IGNORE SPECIAL MEANINGS PROXIMITY SEARCHES USING PATTERNS TO SEARCH SOUNDEX ABOUT ()
For example, the search expression: death AND cause will select only those documents that contain both the word "death" and the word "cause"
death AND cause
For example, the search expression: death OR mortality will select only those documents that contain either the word "death" or the word "mortality".
death OR mortality
For example, the search expression: death NOT cause will select only those documents that contain the word "death" but not the word "cause"
death NOT cause
For example: (disease AND cause of death) NOT natural causes would select only documents containing both the word "disease" and the phrase "cause of death", but not documents containing the phrase "natural causes".
(disease AND cause of death) NOT natural causes
will be interpreted as a search for documents containing both the words "death" and "taxes", but not the intended phrase "death and taxes".
You can tell the search engine to ignore the special meaning of words in the search expression by writing portions of the search expression in the form {phrase}
For example: death {and} taxes to be {or not} to be {to be or not to be}water {within} house(death {and} taxes AND health) NOT social security
death {and} taxes to be {or not} to be {to be or not to be}water {within} house(death {and} taxes AND health) NOT social security
The CIESIN search engine allows for nearness of words to be described in two ways.
word1 NEAR word2 and NEAR( (word1, word2, ...), n)
word1 NEAR word2
NEAR( (word1, word2, ...), n)
In the first form above, the expression will select only those documents containing word1 and word2, and only when word1 and word2 occur within 100 words of each other somewhere in the document.
For example: death NEAR cause
death NEAR cause
Consider the following example: near((red tide,cause,sewage),50)
near((red tide,cause,sewage),50)
Consider the following scenario: Document 1: ... red tide...(30 words)...sewage...(30 words)...cause... Document 2: ...cause...(20 words)...red tide...(30 words)...sewage...
Document 1: ... red tide...(30 words)...sewage...(30 words)...cause... Document 2: ...cause...(20 words)...red tide...(30 words)...sewage...
(death AND cause) WITHIN SENTENCE (death AND unnatural) WITHIN PARAGRAPH
Words can be misspelled, can occur in difference tenses, can be pluralized, and can have other forms that make it more difficult to find matches using exact matching of words and phrases. To address these problems, the search engine supports numerous pattern matching tools to allow for more flexible searching. Here we will discuss only a few of them: wildcards, word stemming, soundex, and fuzzy searches. Wildcard (%)
A wildcard, %, matches any number of characters. It is used when it is desirable to specify only a portion of a word when searching. Examples are as follows: polluti% matches words beginning with the "polluti", such as pollution and polluting. pol%ing matches words beginning with 'pol' and ending with 'ing' such as polling, polluting, and politicking. %lution% matches words containing the sequence of letters "lution", such as pollution, solution, and resolutions.
The stem pattern finds words with the same stem form. This is useful for finding "GOING" and "WENT" from "GO", for instance. Examples: $go matches words having the same stem as "go", including going, gone, and went. $pollution matches word having word stem as pollution, e.g. polluting, pollute, pollutant
Soundex (!)
Soundex query finds words which sound similar.Examples: !hog
!hog
Fuzzy (?)
The fuzzy pattern finds words with similar form. This is useful for finding mis-typed or mis-OCR'd words. The fuzzy operator is ?. Example: ?dog
?dog
About applies word stem, wildcards, and other patterns to find variations on the words and phrases given in the query. It uses a variety of strategies to find the most information that might be relevant to your search expression.
about(temperature) about(global climate change in the southern hemisphere)
about(causes of disease that result in unnatural death) AND ($cause near water) AND ( (pollut% AND infect%) WITHIN SENTENCE )
Copyright © 1997-2007. The Trustees of Columbia University in the City of New York.
Privacy, Security, Notices