It is currently Thu Jul 27, 2017 11:44 pm

All times are UTC - 7 hours





Post new topic Reply to topic  [ 4 posts ] 
Author Message
PostPosted: Tue Sep 03, 2013 2:35 am 
Online
User avatar

Joined: Fri Nov 12, 2004 2:49 pm
Posts: 7111
Location: Jongny, VD, Switzerland
It's really annoying. If I search for "foo bar" searsh will give me posts that contains either "foo" or "bar" but I want to look up posts that contains both "foo" and "bar". It does that no matter what is the position of the radio button.


Top
 Profile  
 
PostPosted: Tue Sep 03, 2013 3:37 am 
Offline
User avatar

Joined: Sun Sep 19, 2004 9:28 pm
Posts: 3192
Location: Mountain View, CA, USA
The default model for the search box is AND, with the two terms space-delimited. An example would be to enter into the textbox the literal string graphics snake. The results you'll get back are individual posts that contain the word graphics and also contain the word snake. Proof:

search.php?keywords=graphics+snake&terms=all

The | (pipe) operator is used to change the logic from AND to OR; for example graphics | snake would look for posts that contain the word graphics or the word snake and return all those results. I believe this is the same thing as choosing the "Search for any terms" radio button.

Use of double quotes (ex. "graphics snake") do not cause the search function to look for the string graphics snake as an individual word/phrase. phpBB does not support phrase-searching like this.

There are also numerous settings/adjustments in phpBB itself, such as when building the search results (this happens behind the scenes) there are a minimum number of characters and a maximum number of characters that limit the results; those are set to 3 and 14 right now, respectfully. So if you're looking for a short phrase such as ok that probably won't get you accurate results. There is also a "common word threshold" percentage adjuster that ignores certain words in queries that meet certain criteria; we have this set to 20% right now. For example the word if would probably match that condition, since it's an incredibly common word used in posts.


Top
 Profile  
 
PostPosted: Tue Sep 03, 2013 4:18 am 
Online
User avatar

Joined: Fri Nov 12, 2004 2:49 pm
Posts: 7111
Location: Jongny, VD, Switzerland
Quote:
phpBB does not support phrase-searching like this.

Oh that sucks because that's exactly what I would be looking for. This is weird especially considering it's simpler to seach for 1 phrase than 2 words (I mean, computationally).

PS : It's also weird a search for "snake graphics" doesn't show your post, which contains both terms (and now this very post should be included too in the list because of this very sentence)


Top
 Profile  
 
PostPosted: Tue Sep 03, 2013 4:31 am 
Offline
User avatar

Joined: Sun Sep 19, 2004 9:28 pm
Posts: 3192
Location: Mountain View, CA, USA
Bregalad wrote:
Quote:
phpBB does not support phrase-searching like this.

Oh that sucks because that's exactly what I would be looking for. This is weird especially considering it's simpler to seach for 1 phrase than 2 words (I mean, computationally).

I'd urge you to join the phpBB forums and complain then. The general response given by the community there is "use Google" (I see the legitimacy of this but also roll my eyes). The colloquial term you're looking for, BTW, is "phrase search" or "phrase match". If you find a actively maintained phpBB mod(ule) that does what you want and works with our version of phpBB and (as said) is actively maintained then I'd be happy to install it/try it. What I found though is pretty abysmal (meaning I think this kind of functionality has to be added to the phpBB core itself, and not an addon module).

Bregalad wrote:
PS : It's also weird a search for "snake graphics" doesn't show your post, which contains both terms (and now this very post should be included too in the list because of this very sentence)

It will eventually. The search index does not get rebuilt every single time someone posts something; search results are cached server-side (part of phpBB). The cache expires every 1800 seconds. Try again in 30-60 minutes and you'll get the result you expect. The caching is done to keep the server load down; there are literally 113108 posts, and the current number of indexed words is at 103264. I think the last time the index got rebuilt (induced by tepples) it took quite some time (hours). Large forums with lots of posts over many many years do not scale well.


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 4 posts ] 

All times are UTC - 7 hours


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB® Forum Software © phpBB Group