Resources
How to Construct a Search Strategy
A General Guide...
By Ruth M Sladek, updated September 2005
A Search Strategy is a plan.
You will find it easier if you have spent time framing your question
in terms of:
- Population
- Intervention
- Comparison
- Outcome
A Search Strategy consists of:
- Identifying the key concepts of interest (ie, PICO)
- Identifying where you will search
- Combining concepts with each other and applying appropriate
limits
Example
Step 1:
Is St Johns Wort more effective than traditional
antidepressants in the treatment of depression?
P = depressed people
I = St Johns Wort
C = antidepressants
O = reduction of depressive symptons
Step 2
Identify the databases you wish to check.
Step 3
Plan how you will combine your searches. This
will vary slightly for each database, but basically requires you
to decide which concepts you wish to combine. Generally you would
start with a broad search, and then narrow it down. Typically
this would mean combining the Population and Intervention
concepts first.
Using the example and a fictional database, the
first concept, population, is depressed people. Identify
a term likely to be used in common language for this, eg, depression.
The next concept is the intervention,
St Johns Wort. Databases usually have specific rules for searching
phrases, but ignore this for this example.
| |
Search Term
Depression
St Johns Wort
|
|
You really want all records that include both
concepts. To do this, combine them using the 'AND' operator.
| |
Search Term
Depression
St Johns Wort
1 and 2
|
|
You might find 1300 records, ie, 1300 records
contain both the 'population' and the intervention concepts. If
you have a specific comparison, such as ' antidepressants'
, you could then 'and' this. By doing this you are narrowing your
search. This might reduce your search down to 100. In other words,
100 records share all three concepts of Population, Intervention
and Comparison.
| |
Search Term
Depression
St Johns Wort
1 and 2
Antidepressants
3 and 4
|
Hits
10,000
2,000
1,300
30,000
100
|
In the example, we have used 'natural language'
to describe our concepts. Databases use both natural language
and controlled language subject terms. Natural language
is sometimes referred to as textword or keyword searching. Most
Internet sites use natural language, and most bibliographic databases
allow you to use both.
One of the drawbacks of textwords, is that you need to be able
to identify possible alternative expressions for the same concepts.
Using our example, what if there was an excellent systematic review
on the topic, but it didn't use the term 'St Johns Wort'. What
if it were called 'a systematic review of hypericum in patients
with mild to moderate depression'?.
There are two strategies that you can use. Firstly, identify alternative
words to describe the concepts and combine them with an 'OR'
operator. Looking at our example,
| |
Search Term
Depression
St Johns Wort
Hypericum
2 or 3
1 and 4
Antidepressant$
3 and 4
|
Hits
10,000
2,000
3,000
4,000
1,600
35,000
150
|
Without using the word 'hypericum' we retrieved
100 references. By including it, we broadened our search strategy
to retrieve 150 records. Set 4 does not equal 2,000+3000, because
some of these records contain both 'hypericum' and 'St Johns Wort'.
The second strategy is that of truncation.
Notice how Set 6 contains the symbol '$' . It is shorthand way
of using the word 'or'. It actually means, search for 'antidepressant
or antidepressants'. Hence it also broadens your search.
For example, child$ would search for 'child
OR children' . The truncation rule is that you take the last common
letter:
eg Amput$ will find:
- Amput ate OR
- Amput ates OR
- Amput ee OR
- Amput ees OR
- Amput ation OR
- Amput ations OR
Truncation symbols allow you to search
for plurals. They vary across databases. In OVID Medline a $
sign is used, but in PubMed and the Cochrane Library an * is used.
The use of the words AND and OR is commonplace in
all databases and searching the Internet. It is called Boolean
Logic. A handy way to remember this is that OR
means more. It broadens your search. AND means less and
narrows your search.
Some databases even have wildcards to use in the
middle of words. For example, OVID Medline
- uses # to replace 1 letter (eg, wom#n) and
- ? to replace 0-1 letters (eg colo?r, or orthop?edic)
The other type of searching is called 'controlled
language' or 'subject searching'. The most commonly known
one in medicine is called MESH. This is a list of thesaurus terms.
When a database uses a controlled language such as MESH, it means
that someone has read the articles indexed in the database, and
given them a range of subject headings taken from the Thesaurus.
You can then search the database using these. Databases such as
Medline, Embase and Cinahl allow you to do textword searching
and subject searching.
The best way to explain the difference, is by
use of an example where the concept of interest is the 'hand'
. If you search Medline using it as a textword, you will retrieve
a large number of records. Many of these will focus on the 'hand'
but many will be irrelevant because they will contain expressions
such as 'on the other hand' or 'hand it over' .
If you search for it using a MESH heading, you
will restrict your search to only those items that are about the
'hand'.
Subject searching is more specific and gives
you more control. Subjects normally have a range of subheadings
- making searching easier. For example, myocardial Infarction/rh
will find everything on myocardial infarction and rehabilitation.
Textword searching is good for recent developments
(where there may be no formal subject heading), or when something
is so rare that you are looking for anything at all that might
use the term. Experienced searchers use both to build up complex
strategies.
Most databases have online help in providing
you with the controlled vocabulary. It is sometimes called mapping.
In OVID Medline for example, if you enter a natural language term
such as stroke, it automatically directs you to the MESH heading
"cerebrovascular accidents". Not all databases use the
same language. Embase, Medline and CINAHL all use different controlled
languages.
The next step in searching is to limit
your results. There are some standard limits built into most databases,
such as year and language. However an evidence based approach
means we are really looking to narrow our search down to particular
types of evidence, in particular, the highest quality ones, so
we need to have some strategies for doing this. These are referred
to as "quality filters" or "hedges". There
are some quite complicated ones available via the Internet, but
there are some simple choices you can make when using databases
restrict your search results to higher quality evidence. When
you use PubMed Clinical Queries, you are actually using such filters,
which have been developed and saved by experienced searchers.
In terms of Boolean Logic, logically you are
combining your final search results from a strategy (developed
from PICO) and the quality filter.
Consider a search on Medline. You can restrict
your search to a particular study type, eg, a RCT, clinical trial,
or a Cochrane Review. Similarly on CINAHL, you can restrict to
a systematic review, a clinical trial, or research.
If 'formal limits' are unavailable, you can always
add in the words 'review' or 'systematic review' to your search
to limit the findings.
A few other comments:
Sensitivity is the likelihood of retrieving relevant
items, whereas specificity is the likelihood of excluding irrelevant
items.
You can find developed filters on the Web
Summary
To summarise, developing a search strategy consists
of 3 steps
- Identifying the key concepts of interest (ie. PICO)
- Identifying where you will search
- Combining concepts with each other and applying appropriate
limits
- Typically start with Population and Intervention
- Use OR to broaden your search
- Use AND to combine your PICO concepts
- If using textwords, identify all possible alternatives and
consider truncation symbol for plurals, alternative spelling
and all words with the same word stem
- If you get few or no results, check your spelling!
- Once you have completed your search, consider applying a
filter (ie, this will AND your final search with a limit or
search which will identify better quality evidence)
- A database will only ever be as good as the information
it indexes in the first place. There is no one database that
will provide you with all answers. If you do find something,
it isn' t necessarily good evidence. If you don' t find anything,
it doesn' t mean it doesn' t exist.
|