What is truncation and when should I use it in database searches?

Truncation is a search technique that uses a wildcard symbol — usually an asterisk (*) — at the end of a word stem to retrieve all words sharing that stem. Searching for educat* retrieves education, educational, educator, educators, educating, and educate in a single term. Truncation prevents searches from missing relevant records because of word form variations. Use it on word stems where the variants are all relevant: child* captures children, childhood, childlike. Avoid truncating at too short a stem — work* would retrieve work, worker, working, workplace, workload, workshop, worldwide, and word in many databases — producing retrieval too broad to be useful. Some databases also support wildcard symbols within words (typically ? or #) to handle spelling variants: wom?n retrieves woman and women; colo?r retrieves color and colour. Check the specific database's search guide for its truncation and wildcard characters, as they vary between platforms.

What is citation searching and how does it extend a database search?

Citation searching uses a known relevant source to find additional relevant literature in two directions: backward citation searching traces the references cited in a known article (finding older literature the author drew on) and forward citation searching finds all subsequent articles that have cited the known article (finding newer literature that engaged with it). Together, backward and forward citation searching build the scholarly conversation around a key source, often retrieving highly relevant articles that keyword and subject heading searches missed — especially articles that use different terminology. Tools for citation searching include Google Scholar's Cited by function, Web of Science's Citing Articles, and Scopus's Cited by. Citation searching is most productive after a keyword search has identified two to three central articles; those central articles become the anchors for citation chains that extend coverage in both temporal directions.

How do I know when I have done enough searching?

The practical indicators of search saturation are: new searches are returning sources already identified in earlier searches (you are reaching the same literature from different entry points); citation chains from key articles are converging on the same core set of sources; additional synonym and terminology variants are producing diminishing returns in new relevant results; and you have covered at least two to three databases with appropriate disciplinary coverage. For a systematic or scoping review, saturation is defined more formally by protocol and documented in a PRISMA flow diagram. For a dissertation literature review, the combination of keyword searches across multiple databases, citation searching from key sources, and a hand-search of the most relevant journals' recent issues is normally sufficient. The error to avoid is searching until you find enough sources to fill a section, rather than searching until you have covered the literature — the former produces convenience sampling; the latter produces genuine evidence of the field.

Library Database Navigation

Home / Academic Skills / Library Database Navigation

RESEARCH METHODOLOGY · INFORMATION LITERACY · ACADEMIC SEARCH SKILLS

Library Database Navigation

How to search academic databases with precision — from Boolean operators and controlled vocabulary through discipline-specific platforms, systematic search strategy, citation searching, interlibrary loan, and the search habits that turn a research question into a comprehensive, auditable evidence base for academic writing.

Research Paper Help Literature Review Help

50–60 min read All academic levels 20+ databases covered 10,000+ words

Custom University Papers Research Skills Team

Specialists in academic information literacy, search strategy design, and the research practices that connect a question to a literature base — drawing on experience supporting students across disciplines from undergraduate level through doctoral research, and the specific challenges that arise when institutional database access meets complex, interdisciplinary research questions.

Most students learn to search academic databases the way they learned to type: by doing it until something worked, without anyone explaining the underlying logic. The result is a characteristic set of search habits — typing a full research question into a single database, scrolling through the first page of results, and treating whatever appears as the literature on the topic. This approach produces a convenience sample of whatever the algorithm surfaces, not a representative picture of the scholarship. The gap between those two things is the gap between a literature review that describes existing research accurately and one that describes the fraction of it a student happened to encounter. Library database navigation — the structured, strategic use of search tools, operators, controlled vocabulary, and multi-database methodology — is what closes that gap.

What This Guide Covers

What library database navigation involves How academic databases are organised Boolean operators and search logic Subject headings and controlled vocabulary Filters, limiters, and result refinement Navigating discipline-specific databases Full-text versus bibliographic databases Citation searching and reference tracing Building a systematic search strategy Interlibrary loan and access barriers Grey literature and supplementary sources Managing and organising search results Search errors that undermine literature coverage Frequently asked questions

Library database navigation refers to the structured process of identifying, accessing, and querying the organised electronic repositories that hold academic literature — journal articles, conference papers, theses, dissertations, and reports indexed for scholarly retrieval. The term captures more than just logging into a database and entering a search term. It encompasses the full methodology of literature discovery: selecting the right repositories for the research question, constructing searches with Boolean logic and controlled vocabulary, applying appropriate filters, managing and screening results, and documenting the process in a way that makes the search auditable and reproducible.

Understanding why this methodology matters begins with understanding what a database is not. Academic databases are not neutral archives that surface all relevant scholarship equally. They are curated, indexed collections with specific disciplinary coverage, inclusion criteria, and search architectures. What your search retrieves is determined as much by the logic of your query as by the content of the database — two searches on the same topic using different terminology and operators in the same database can return radically different results. A student who searches for “stress in students” retrieves different records than a student searching for “psychological distress AND university enrollment” in the same database, even though both are searching the same platform for the same conceptual territory.

72%of students rely on a single database for their literature searches, according to information literacy studies at university libraries

3–5×more relevant records retrieved when Boolean operators and controlled vocabulary are used versus natural language keyword searching alone

28%of relevant literature on a topic not indexed in any single database — multi-database searching is required for comprehensive coverage

60%of systematic review search strategies require revision after a librarian audit, most commonly for missing synonyms and absent subject headings

The consequence of inadequate search methodology is not just an incomplete bibliography. It is a structurally biased one — because the literature that is easiest to find through casual searching (highly cited papers, recent publications in high-profile journals, English-language sources with common terminology) is not randomly distributed across the scholarly conversation. It skews toward consensus, toward dominant methodological traditions, and toward the specific linguistic conventions of one disciplinary community. A search methodology that retrieves only this easily-surfaced literature consistently misses counter-evidence, methodological diversity, and the scholarship of disciplines that address the same question with different vocabulary.

This guide addresses the full scope of academic information retrieval — the technical operations, strategic principles, and practical habits that determine whether a literature search is thorough and reproducible or convenient and porous. Whether you are writing an undergraduate essay, constructing a master’s dissertation literature review, or conducting the search protocol for a systematic review, the principles are the same: the methodology shapes the evidence base, and the evidence base shapes the argument.

How Academic Databases Are Organised — the Structure Beneath the Search Box

Every academic database has an architecture — a structure that determines how records are stored, indexed, and retrieved. Understanding this architecture is not a technical prerequisite for searching; it is the foundation that makes sense of why some searches work and others fail. The most common reason students get poor search results is not that the relevant literature does not exist in the database — it is that their query does not match the way the database has indexed that literature.

The Record — The Basic Unit of a Database

Each item in an academic database is stored as a record: a structured entry containing multiple fields — title, author(s), abstract, publication year, journal name, volume, issue, page numbers, DOI, subject headings, and often a full-text link. When you search a database, you are querying these structured fields, not necessarily the full text of articles. Most keyword searches query the title, abstract, and subject heading fields by default — which is why an article’s full text can discuss a concept extensively without the record being retrieved by that term if the concept does not appear in the abstract or assigned headings.

Indexing — How Records Are Categorised

Indexing is the process by which database staff — or, in some databases, automated systems — assign subject headings and descriptors to each record based on its content. This is the process that creates controlled vocabulary coverage: an article about “adolescent mental health” might be indexed under “Youth — Psychological Aspects,” “Mental Disorders — Diagnosis,” and “Age Factors” in one database’s controlled vocabulary system. A search using those subject headings retrieves this article even if the exact phrase “adolescent mental health” never appears in the title or abstract. The thoroughness and currency of indexing varies significantly between databases — older literature may have minimal subject heading coverage, and very recent articles may not yet be fully indexed.

The Thesaurus — The Controlled Vocabulary Map

Most major databases provide a searchable thesaurus — a hierarchical map of the controlled vocabulary used for indexing. The thesaurus shows how subject headings relate to each other: broader terms (hypernyms), narrower terms (hyponyms), related terms, and preferred terms for concepts that could be described in multiple ways. Before running a substantive database search, browsing the relevant database’s thesaurus for your key concepts is the single highest-return preparatory action available. It identifies the authorised terms under which your topic is indexed and reveals the hierarchical relationships that allow you to broaden or narrow searches systematically.

Search Fields — Where Your Query Lands

Database search interfaces offer multiple field-specific search options: All Fields (queries all searchable text in the record), Title, Abstract, Subject Heading, Author, Journal Name, and in full-text databases, the body of the article. The scope of a keyword search is directly determined by which field it queries. Searching in All Fields is the broadest and most likely to produce noise; searching in Title AND Abstract strikes a balance between precision and recall; searching in Subject Headings alone is the most precise but risks missing recently indexed or poorly indexed material. Advanced database users combine field-specific searches: subject heading searches for established literature, keyword searches in title and abstract for recent or emerging literature.

Filters — Post-Search Refinement Tools

Filters narrow retrieved results by attributes stored in the record fields: date range, publication type (journal article, conference paper, book chapter, review), peer-review status, language, subject area, and in some databases, study design or methodology. Filters are applied after the initial search and should be used in sequence, recording how each filter changes the result count. The order matters: applying overly restrictive filters early can remove relevant records before you have assessed their potential relevance. Full-text availability is a filter that should almost never be applied at the database level — it systematically excludes records your library may be able to provide through interlibrary loan.

Database Coverage vs. Full-Text Access — A Critical Distinction

Many students conflate database coverage with full-text availability and assume that a record appearing in a database search means the full article is accessible. These are two separate things. A database record is a description of an article — its citation, abstract, and indexing data. Full-text access to the article itself depends on your institution’s subscription to the journal where it was published. A database may index thousands of journals that your institution does not subscribe to in full text. The record appears in your search; the full text does not open.

The implication for search methodology is direct: do not apply “full text available” as a search filter. Doing so limits your search to the subset of relevant literature your institution happens to have subscribed to — which is a function of budget and collection policy, not of relevance to your research question. Identify the full relevant literature first, then pursue access to specific items through your library’s full-text links, open-access repositories, or interlibrary loan.

Boolean Operators and Search Logic: The Syntax of Precise Database Retrieval

Boolean operators are the logical connectors that determine how multiple search terms relate to each other in a database query. Named after the nineteenth-century mathematician George Boole, they are the fundamental mechanism by which a vague research interest becomes a precise retrieval instruction. Three operators — AND, OR, and NOT — form the core of Boolean logic; a fourth operation, parentheses grouping, controls the order in which operators are evaluated in complex queries. Together, these four elements give you explicit control over the scope and direction of your search.

Boolean search string construction — worked example Search Syntax

RESEARCH QUESTION:
What is the effect of social media use on sleep quality in adolescents?

STEP 1 — Identify core concepts (columns for AND):
Concept A: social media
Concept B: sleep / sleep quality / sleep disturbance
Concept C: adolescents / teenagers / young people

STEP 2 — Generate synonyms for each concept (rows for OR within column):
Concept A: "social media" OR "social networking" OR Instagram OR TikTok OR Twitter OR Facebook OR "online platforms"
Concept B: sleep OR "sleep quality" OR "sleep disturbance" OR insomnia OR "sleep duration" OR "sleep deprivation"
Concept C: adolescent* OR teenager* OR "young people" OR youth OR "secondary school" OR "high school"

STEP 3 — Combine with AND between concepts, OR within concepts:
( "social media" OR "social networking" OR Instagram OR TikTok )
AND
( sleep OR "sleep quality" OR "sleep disturbance" OR insomnia OR "sleep duration" )
AND
( adolescent* OR teenager* OR "young people" OR youth )

RESULT: Retrieves records containing at least one term from each concept group —
more precise than keyword-only, more comprehensive than phrase-only.

AND — Narrowing by Intersection

The AND operator requires both (or all) connected terms to appear in the same record. It narrows the search — every additional AND term reduces the result set to the intersection of multiple concept sets. AND is the operator that makes a search topic-specific rather than topic-adjacent: without AND, a search for sleep and adolescents as separate terms retrieves records on each topic independently; with AND between them, it retrieves only records on both simultaneously. The practical rule is to AND together your core concepts — the essential components of the research question that must all be present in a record for it to be relevant — and no more. Over-ANDing produces retrieval so narrow that genuinely relevant records are excluded.

OR — Broadening by Equivalence

The OR operator retrieves records containing either connected term. It broadens the search — every additional OR term expands the result set to include records using any of the equivalent terms. OR is the operator that makes a search terminologically comprehensive: the scholarly literature on a topic uses multiple terms for the same concept (adolescent, teenager, young person, youth, juvenile), and a search that uses only one of these terms misses all records that use the others. OR should connect every synonym, variant spelling, related term, abbreviation, and alternative terminology within each concept group. The practical rule is to generate OR synonyms exhaustively before searching, because missing a widely-used synonym is one of the most common sources of incomplete literature coverage.

NOT — Excluding by Definition

The NOT operator excludes records containing the following term. It is useful for disambiguation — removing records on a similarly-named but different topic (mercury NOT planet, for medical literature on mercury exposure; depression NOT weather, for psychological literature). NOT should be used cautiously: it is a blunt instrument that removes records based on the presence of a term regardless of the context in which that term appears. A record primarily about psychological depression that mentions weather in its introduction is excluded by depression NOT weather, even though it is relevant. Use NOT only where the excluded term is a systematic source of irrelevant results that cannot be addressed by more precise construction of the main query.

Phrase Searching

Enclosing a multi-word term in quotation marks instructs the database to search for those words as a phrase, appearing together and in sequence. “Social media” retrieves records containing that exact phrase; without quotes, social AND media retrieves records containing those words anywhere in the record, including unrelated uses. Phrase searching is more precise than keyword searching for compound concepts that only mean something specific as a unit: “cognitive load,” “evidence-based practice,” “climate change adaptation.” Use phrase searching for any concept whose individual words have common but unrelated meanings when separated.

Truncation and Wildcards

Truncation uses a symbol (typically *) at the end of a word stem to retrieve all words sharing that stem: educat* retrieves educate, education, educational, educator. This prevents searches from missing plural forms, verb conjugations, and noun/adjective variants. Most databases also support internal wildcards (? or #) for spelling variations: wom?n retrieves woman and women; colo?r retrieves color and colour. Check the specific database’s help documentation for its truncation characters — they vary. Apply truncation at a point in the stem where all variants are relevant: communicat* is useful; com* retrieves far too many irrelevant words to be useful.

The purpose of Boolean logic in database searching is not to be clever — it is to be explicit. A Boolean search string is a precise, auditable specification of what you are asking the database to retrieve. The more explicit the specification, the more reliably the retrieval reflects your research intent rather than the database algorithm’s judgment about what you probably meant. — Principle reflected in systematic review methodology guidelines and academic information literacy frameworks

Subject Headings and Controlled Vocabulary: Searching the Way Databases Think

Controlled vocabulary is the most powerful and most underused tool in academic database searching. Every major database uses a structured, standardised terminology system — a set of authorised terms that indexers assign to records to categorise their content, regardless of the specific wording used in the original text. When you search using these authorised terms, you retrieve all records indexed under that concept: records that use different terminology, records with minimal abstracts, and records where the concept is central but the exact keyword you would use does not appear in the retrievable text. Keyword searching finds what you know to look for; controlled vocabulary searching finds what has been indexed under a concept, including material you would not have found by keyword alone.

PubMed / MEDLINE

MeSH — Medical Subject Headings

The National Library of Medicine’s controlled vocabulary for biomedical literature — a hierarchical structure of over 30,000 terms covering clinical medicine, biological sciences, public health, and nursing. MeSH terms can be searched as subject headings in PubMed’s MeSH database, which allows you to browse the hierarchy, identify related terms, and add the term directly to your search. MeSH also supports subheadings (called qualifiers) that narrow a term to a specific aspect: “Diabetes Mellitus/therapy” retrieves records about diabetes treatment specifically, not all diabetes literature. For clinical and biomedical research, MeSH searching is the primary retrieval methodology; keyword searching supplements it for very recent literature not yet fully indexed. Available at PubMed.

EBSCO Databases

EBSCO Thesaurus

EBSCO hosts multiple databases — Academic Search Complete, PsycINFO, CINAHL, ERIC, Business Source Complete — each with its own discipline-specific thesaurus. The EBSCO thesaurus is accessible through the “Subjects” or “Thesaurus” tab in the advanced search interface. It shows the preferred term for a concept, its broader and narrower terms, and related terms, allowing systematic refinement of subject heading searches. In PsycINFO, for example, the preferred term for studies involving human participants is “Human” — without this, animal studies appear in human psychology searches. EBSCO thesaurus browsing before searching is strongly recommended for any systematic or comprehensive literature search.

JSTOR

Discipline Classification System

JSTOR organises its humanities, social sciences, and sciences literature by discipline and journal, with browsable subject area classification. While JSTOR does not use a traditional subject heading thesaurus like MeSH or EBSCO, it supports discipline-specific filtering and advanced Boolean searching across its journal archive. JSTOR is particularly strong for historical depth — many journals have archives extending to their founding issues, making it the primary source for literature review of established scholarly debates in the humanities. Available at JSTOR.org.

Web of Science / Scopus

Keyword Plus and Author Keywords

Web of Science uses KeyWords Plus — algorithmically generated terms derived from the titles of articles cited by an indexed paper — as an additional search field supplementing author-supplied keywords. Scopus similarly indexes author keywords and aggregated subject classifications. Both databases support structured Boolean searching across multiple fields and are the primary platforms for citation analysis and bibliometric searching. Their broad multidisciplinary coverage makes them appropriate for first-pass searches across interdisciplinary research questions where disciplinary database selection is uncertain.

ERIC (Education)

ERIC Thesaurus

The Educational Resources Information Center (ERIC) maintains a subject thesaurus for education literature — a controlled vocabulary covering curriculum, pedagogy, learning theory, educational policy, assessment, and related fields. ERIC thesaurus terms are used as “Descriptors” in ERIC records and can be searched specifically in the Descriptor field for precise subject retrieval. ERIC also indexes grey literature — government reports, policy documents, practitioner publications — alongside peer-reviewed journals, making thesaurus-guided searching particularly important to distinguish research literature from policy documents in result sets.

Library of Congress

LCSH — Library of Congress Subject Headings

The standard controlled vocabulary for cataloguing books and monographs in library catalogues across the English-speaking world. Library of Congress Subject Headings (LCSH) are used in library catalogues including your institution’s catalogue, WorldCat, and HathiTrust. When searching for books, edited volumes, and monographs — rather than journal articles — understanding LCSH allows you to find all catalogued books on a concept even when their titles use different terminology. LCSH is also the basis of most public library catalogues and is accessible through the Library of Congress catalogue at loc.gov.

Using the Thesaurus: A Practical Workflow

The productive sequence for incorporating controlled vocabulary into a database search is: first identify the concept you need to search, then look it up in the database’s thesaurus before building your query. The thesaurus shows the preferred (authorised) term, which you add to your search as a subject heading. It also shows the broader term (to use if initial results are too narrow), narrower terms (to refine a broad initial result set), and related terms (to identify adjacent concepts you may have missed). Many students skip this step and search entirely by keyword — which works adequately for well-known concepts with stable, widely-used terminology, but fails systematically for concepts with multiple disciplinary names, recently renamed conditions, or jargon that varies between US and UK academic traditions.

Database filters are refinement tools applied after an initial search to reduce the result set to the subset most appropriate for a specific research purpose. Applied strategically, they improve search efficiency; applied carelessly, they introduce systematic bias into the literature retrieved. The key is understanding which filters narrow results by genuine relevance criteria and which narrow them by access or availability criteria that have nothing to do with whether a source is useful.

Appropriate Use

Use With Caution

Avoid as Default

Filter Type

What it does

When to apply it

Risk if misapplied

Date range

Limits results to items published within a specified year range

When currency is a genuine research criterion (clinical guidelines, technology policy, recent developments) — or to find the most recent literature after a comprehensive search has already been run

Excludes foundational and seminal literature; misses the historical development of a debate; produces literature reviews with no theoretical grounding in established scholarship

Peer-reviewed / scholarly

Limits results to publications that have undergone peer review

When the research question requires peer-reviewed evidence specifically — most undergraduate and postgraduate academic work

Excludes grey literature that may be the primary source for policy, practice, and current data; not all peer review is equal in rigour

Document / publication type

Limits to specific item types: journal articles, reviews, meta-analyses, books, conference papers

When evidence hierarchy requirements specify a particular study design — systematic reviews searching for RCTs, for example

Excludes highly relevant work in other formats; conference papers often contain cutting-edge research not yet published in journals

Language

Limits results to a specified language of publication

Practically, when you cannot read or access translation for other languages — though this should be acknowledged as a limitation

Introduces language bias — well-documented in systematic review methodology as a source of publication and selection bias, particularly in fields with strong non-English-language research traditions

Full text available

Limits results to items your institution can provide in full text from the database directly

As a convenience filter after a comprehensive, unfiltered search has been documented — not as a primary search filter

Systematically biases literature coverage toward what your institution subscribes to; highly relevant items obtainable via interlibrary loan are excluded without being assessed for relevance

Subject area / discipline

Limits results to a specified disciplinary classification

When a broad database search returns excessive disciplinary noise for a narrowly disciplinary question

Excludes interdisciplinary literature; a psychology question filtered to Psychology excludes relevant sociology, public health, and neuroscience research

The most consistently misapplied filter is the full-text availability filter, applied at the outset of a search as a convenience measure. A student who begins a search with “full text available” checked is not searching for relevant literature — they are searching for literature their institution happens to own. The overlap between those two sets varies by topic, institution, and time period, but it is never complete. The correct sequence is always: search without full-text filter, document the complete relevant set, then pursue access to specific items through whatever means your library provides. The literature your institution does not subscribe to in full text is not less relevant than the literature it does.

Navigating Discipline-Specific Databases: Matching the Tool to the Research Question

Selecting the right database is as important as constructing the right search. General multidisciplinary databases — Scopus, Web of Science, Academic Search Complete — offer broad coverage but cannot match the depth, indexing quality, and specialist thesaurus systems of discipline-specific databases in their home fields. A clinical question searched only in Scopus misses PubMed’s superior medical indexing and MeSH system; a psychological research question searched only in PubMed misses PsycINFO’s comprehensive coverage of psychological literature including dissertations, book chapters, and specialist journals not indexed in PubMed.

Health and Biomedical Sciences

PubMed/MEDLINE — the standard for biomedical and clinical literature with MeSH controlled vocabulary. CINAHL (EBSCO) — nursing and allied health. Cochrane Library — systematic reviews and clinical trials. EMBASE — pharmaceutical and European clinical literature. For comprehensive clinical searches, PubMed plus EMBASE plus CINAHL is the standard three-database combination for systematic reviews in healthcare.

Social Sciences and Psychology

PsycINFO (EBSCO/APA) — the definitive psychology database covering journals, books, and dissertations with the APA Thesaurus. Sociological Abstracts — sociology and related social sciences. Web of Science Social Science Citation Index — multidisciplinary with citation analysis. JSTOR — strong archive for established social science journals. For psychology dissertations, ProQuest Dissertations and Theses is essential.

Humanities and Arts

JSTOR — deep archives in history, literature, philosophy, and art history. Arts and Humanities Citation Index (Web of Science) — multidisciplinary humanities coverage with citation analysis. MLA International Bibliography — language, linguistics, literature, and folklore. Historical Abstracts — world history excluding North America. Philosopher’s Index — philosophy literature with specialist indexing.

Law

Westlaw and LexisNexis — the primary legal research databases for case law, legislation, and legal commentary. HeinOnline — law review articles and legal history. OSCOLA resources for international law. Jurisdiction-specific legal databases supplement these for national legal systems. Law searches require understanding of legal citation conventions and jurisdiction filtering alongside standard Boolean searching.

Education

ERIC (Institute of Education Sciences) — the primary education database, freely accessible at eric.ed.gov, covering journals and grey literature with a specialist thesaurus. British Education Index — UK education literature. Australian Education Index — Australian and Pacific education research. Education searches frequently require grey literature coverage alongside journal databases to capture policy and practice documents that peer-reviewed literature does not represent.

Business, Economics, and Management

Business Source Complete (EBSCO) — comprehensive business and management literature. EconLit — economics literature with specialist indexing. ABI/Inform (ProQuest) — business and management with strong industry report coverage. SSRN — pre-print and working paper repository especially important for economics research at the frontier. Available through institutional access or open access depending on the database.

Google Scholar: Useful Supplement, Not a Replacement for Database Searching

Google Scholar indexes a broad range of academic literature — journal articles, theses, conference papers, pre-prints, and court opinions — and is freely accessible without institutional login. Its strengths are breadth of coverage, citation searching (the “Cited by” function), and finding open-access versions of paywalled articles. For these purposes it is a genuine research tool, not just a convenience shortcut.

Its limitations are equally genuine: Google Scholar does not support controlled vocabulary searching, its indexing is less systematic and consistent than specialist databases, it indexes non-peer-reviewed material without differentiation, its coverage is biased toward English-language and open-access sources, and the algorithm ranking its results is opaque and not optimised for comprehensive retrieval. For systematic or comprehensive literature searches, Google Scholar alone is insufficient. Use it for citation searching, identifying open-access versions of specific articles, and rapid relevance checking — not as a primary retrieval source for disciplinary literature.

Full-Text Databases vs. Bibliographic Databases — Understanding What You Are Searching

The distinction between full-text and bibliographic databases is one of the most consequential differences in academic database searching, and it is frequently misunderstood. Knowing which type you are searching determines what your query is actually doing and why the same search in different databases produces different results even when both databases appear to cover the same subject area.

Bibliographic Databases

Full-Text Databases

What Is IndexedRecords contain bibliographic information and abstracts — the article’s citation data, abstract, and subject headings. The full body of the article is not stored in the database and is not searched by default.

What Is IndexedRecords include the complete text of the article or document. Searches can query the full body of the text — every word in every article — not just the title, abstract, and subject headings.

Search BehaviourA keyword search queries title, abstract, and subject heading fields. An article that discusses a concept extensively in its body but does not mention it in the abstract may not be retrieved. Subject heading searches are especially important in bibliographic databases for this reason.

Search BehaviourFull-text searching retrieves articles where the search term appears anywhere in the document. This produces higher recall (fewer relevant articles missed) but often lower precision — terms appearing in passing references, methods sections, or discussion footnotes are retrieved alongside articles where the term is central.

ExamplesPubMed/MEDLINE (bibliographic with links to full text); EMBASE; PsycINFO; Scopus; Web of Science; ERIC — these databases index and abstract literature; full-text access depends on separate journal subscriptions.

ExamplesJSTOR (full text of journals in its archive); ScienceDirect; Academic Search Complete (EBSCO full text) for subscribed journals; ProQuest databases where full text is available. Note that even full-text databases may not have full-text access for every journal they cover.

Precision vs. RecallBibliographic searches with well-constructed Boolean strings and controlled vocabulary tend toward higher precision — the retrieved records are more consistently relevant to the search concept. The risk is lower recall: relevant articles with atypical abstracts or indexing may be missed.

Precision vs. RecallFull-text searches offer higher recall in theory but require careful query construction to avoid noise: retrieving articles where your search term appears once in passing among thousands of records. Highly specific phrase searches and proximity operators are more important in full-text databases.

For most academic research at undergraduate and postgraduate level, the most effective approach combines both modes: use bibliographic databases with controlled vocabulary for primary comprehensive searching, then use full-text search capabilities in platforms like JSTOR or ScienceDirect for targeted searching of specific journals or document collections. The combination exploits the precision of structured bibliographic indexing and the recall advantages of full-text coverage in specialised archives.

Citation Searching and Reference Tracing: Extending the Literature Search Temporally

Citation searching is a literature retrieval methodology that uses a known relevant source as the entry point for discovering additional relevant literature, rather than search terms. It operates in two directions — backward through the reference list of a known article, and forward through all articles that have subsequently cited it — and produces a temporally complete picture of the scholarly conversation around a topic that keyword searching alone cannot match.

Backward Citation Searching — The Literature an Author Drew On

Every reference cited in a relevant article is a potentially relevant source. Working backward through a reference list is the fastest way to identify the foundational and seminal literature in a field — the sources that established the theoretical frameworks, key findings, and methodological approaches that later work built upon. Systematic backward searching begins with two or three central articles identified through keyword searching; their combined reference lists typically yield twenty to forty potentially relevant sources, each of which can be assessed for relevance and then used for further backward searching if warranted. This “snowballing” technique can be a productive source of older literature that predates comprehensive database indexing.

Forward Citation Searching — Who Has Cited This Work Since Publication

Forward citation searching identifies all articles published after a known source that have cited it. This traces the development of an idea, finding how a foundational study has been extended, applied, critiqued, or replicated in subsequent research. It is the most reliable method for finding the most recent literature on a topic when keyword searches are returning predominantly older results — because recent articles cite the key older articles in the field, forward citation searching from those older articles reliably surfaces recent contributions. Google Scholar’s “Cited by” function, Web of Science’s “Citing Articles,” and Scopus’s “Cited by” provide this functionality.

Lateral Citation Searching — Articles That Cite the Same Sources

Lateral citation searching — finding articles that cite the same set of sources as a known relevant article — identifies literature that addresses the same intellectual problem using related but potentially different terminology. Web of Science supports this through its “Related Records” function; Scopus has a similar feature. This approach is particularly valuable for interdisciplinary topics where different disciplinary communities address the same empirical question using their own terminology and citing their own disciplinary canon. Lateral citation searching can reveal an entire disciplinary literature on your topic that keyword searching in a different disciplinary database may have missed.

Journal Hand-Searching — Targeted Coverage of Core Publications

For a research question closely associated with a specific journal, hand-searching — browsing the journal’s table of contents issue by issue over a defined date range — provides a level of coverage that database indexing delays and indexing gaps can miss. Major systematic review guidelines recommend hand-searching at least one or two core journals as a supplement to database searching. Most journals’ websites allow browsing by volume and issue; alternatively, journals that publish in major databases allow table-of-contents searches limited to a specific source title. Hand-searching is time-intensive and is most appropriate for comprehensive reviews where thoroughness of coverage is a primary methodological requirement.

Reference Manager Cross-Referencing — Finding Patterns in Your Collection

As a research collection grows, reference manager tools like Zotero and Mendeley can identify which sources are most frequently cited across your collected literature — indicating the most influential works in the field — and surface sources cited by multiple items in your collection that you have not yet retrieved directly. This cross-referencing within a managed collection is an additional form of citation analysis that requires no additional database searching: it exploits the reference data already present in the literature you have collected to surface gaps and high-priority sources you have not yet assessed.

2–3

Seed Articles Needed

The minimum number of highly relevant “anchor” articles needed to begin productive citation searching — identified through initial keyword searches before citation methods are applied

30%

Additional Coverage Gain

Approximate proportion of relevant literature in a field typically identified through citation searching that was not retrieved by the initial database keyword and subject heading search

Tools to Use

Google Scholar (Cited by), Web of Science (Citing Articles and Related Records), and Scopus (Cited by and Reference Search) — each with slightly different coverage, making all three worth checking

Building a Systematic Search Strategy: From Question to Documented Protocol

A systematic search strategy is a pre-specified, structured, and documented approach to searching the academic literature that is designed to be comprehensive, reproducible, and transparent. The word “systematic” has a specific methodological meaning in research contexts — it does not simply mean “thorough.” It means that the search was conducted according to a defined protocol that specifies what was searched, how, and why; that the protocol was followed consistently; and that the results are documented in a way that allows another researcher to replicate the search independently and arrive at the same result set.

The PICO Framework — Structuring Clinical and Social Science Questions for Search

PICO (Population, Intervention, Comparison, Outcome) is a structured question format used in clinical and social science research to decompose a research question into its searchable components. Each element of the PICO question becomes a concept group in the search matrix, connected by AND between groups and OR within groups for synonym variants.

P — Population: Who are the participants? (e.g., adults with type 2 diabetes, primary school children, nursing home residents). This generates the participant/population search terms.

I — Intervention: What is being done or studied? (e.g., cognitive behavioural therapy, dietary intervention, digital health tools). This generates the intervention or exposure search terms.

C — Comparison: What is being compared to? (e.g., usual care, no intervention, alternative treatment). The comparison element is sometimes omitted in initial searches to maximise recall, then used as a filter in the screening stage.

O — Outcome: What outcomes are measured? (e.g., HbA1c levels, quality of life, hospital readmission rates). Outcome terms are often the most variable in the literature and may be omitted from the initial search string if they significantly reduce recall.

PICO variants exist for different question types: PICOS adds Study design; SPIDER (Sample, Phenomenon of Interest, Design, Evaluation, Research type) is used for qualitative research questions; ECLIPSE is used for health policy questions. The appropriate framework depends on the type of research question and the methodology of studies likely to address it.

Search Strategy Documentation Checklist

Research question stated clearly
PICO or equivalent framework applied
All databases listed with access dates
Full Boolean search strings recorded for each database
Subject headings and thesaurus terms documented
Filters applied listed with rationale
Number of results per database recorded
Deduplication method noted
Inclusion/exclusion criteria pre-specified
PRISMA flow diagram completed
Citation searching sources listed
Grey literature sources searched

Related Research Support

The Search Term Matrix: Organising Synonyms Before Searching

The search term matrix is a pre-search planning tool that organises your concepts and their synonym variants into a table before any searching begins. Each column represents a core concept from your research question; each row within a column contains a synonym, variant spelling, abbreviation, or related term for that concept. Terms within a column are connected by OR; columns are connected by AND. The matrix makes the logical structure of the search string visible before it is coded into a database query, allows you to check for missing synonyms before results are retrieved, and provides the documentation record that systematic search methodology requires.

85%

Proportion of systematic review search strategies that include explicit synonym mapping before database entry

Analysis of Cochrane systematic review search strategies consistently shows that high-quality searches include exhaustive synonym mapping as a pre-search step. Studies comparing searches with and without synonym mapping show that missing synonyms account for the majority of relevant literature not retrieved in initial searches — making pre-search synonym generation the highest-return single investment in search quality.

Interlibrary Loan and Access Barriers: Reaching Literature Your Institution Does Not Subscribe To

Interlibrary loan (ILL) is the mechanism through which university libraries obtain copies of academic items — journal articles, book chapters, conference papers, theses — not available through their own subscriptions or holdings. It is one of the most powerful and least used research tools available to students, primarily because students conflate full-text availability in their institutional database interface with the full extent of accessible literature. These are not the same. When an article appears in your database search but the full text is not immediately accessible, the article is not unavailable — it is obtainable, typically within days to a week, through an ILL request to your library.

How Interlibrary Loan Works in Practice

The ILL process begins when you identify a relevant item — through database searching, citation searching, or a reference in another source — that your institution does not have in full text. You submit an ILL request through your library’s portal (typically accessible from the catalogue or library website), providing the citation details: author, title, journal, volume, issue, year, and page numbers for articles; author, title, publisher, and edition for books. The library locates a holding institution and requests a copy — digitally for journal articles (typically delivered to your institutional email within one to five business days), physically for books (typically one to three weeks for loan).

The critical habit is building ILL requests into your research timeline rather than discovering access problems at the writing stage. When you complete a database search and screen results for relevance, submit ILL requests for all relevant items without immediate full-text access at the same time — before moving to the reading stage. This eliminates the stop-start pattern of discovering inaccessible items one by one during writing and waiting days for each delivery.

For students supporting substantial research projects, our research consultant service and personalised academic assistance can help manage the full research and retrieval workflow, including identifying and accessing literature across multiple institutional and open-access sources.

Open Access as a Parallel Access Route

Beyond institutional subscriptions and interlibrary loan, a substantial and growing proportion of academic literature is available in open access — freely available to anyone regardless of institutional affiliation. Open access articles appear in three main locations: publisher open-access journals and open-access articles within hybrid journals (Gold OA), author-deposited pre-prints and accepted manuscripts in institutional and disciplinary repositories (Green OA), and repositories like PubMed Central (PMC), arXiv, SSRN, and philarchive. Before requesting an item through ILL, check Google Scholar’s “All versions” link and Unpaywall (a browser extension that automatically surfaces legal open-access versions of paywalled articles) — a significant proportion of items assumed to be paywalled have freely accessible author versions in repositories that predate or coincide with the journal publication.

Open access availability by discipline — proportion of literature with legal free access (approximate, based on open access studies)

Physics and Astronomy (arXiv)

~91%

Mathematics

~73%

Biomedical Sciences (PMC)

~68%

Social Sciences

~53%

Humanities

~38%

Law (jurisdiction-specific)

~42%

Grey Literature and Supplementary Sources: Beyond the Peer-Reviewed Journal Article

Grey literature refers to academic and research-adjacent content produced outside traditional commercial journal publishing channels — government reports, policy documents, institutional publications, working papers, pre-prints, conference proceedings, theses and dissertations, standards documents, and the publications of non-governmental and international organisations. For many research questions, grey literature is not a peripheral supplement to peer-reviewed evidence — it is the primary source of current data, practice-level evidence, and real-world implementation findings that peer-reviewed literature lags significantly in documenting.

📊

Government and Official Statistics

National statistics agencies, government departments, and official bodies publish primary data and reports that peer-reviewed literature cites but does not reproduce. For empirical research in public health, education, social policy, economics, and criminology, government statistical publications are foundational sources: ONS (UK), Census Bureau (US), WHO, OECD, World Bank, Eurostat. These are accessed through the publishing agency’s website, not through academic databases.

📝

Theses and Dissertations

Doctoral theses and master’s dissertations contain primary research, comprehensive literature reviews, and methodological detail not available in the published articles derived from them. ProQuest Dissertations and Theses, EThOS (British Library), DART-Europe, and institutional repositories provide access. For literature reviews in established fields, high-quality theses often provide the most thorough treatment of the existing evidence base available.

📄

Pre-Prints and Working Papers

Pre-print repositories — arXiv (sciences and mathematics), SSRN (social sciences and economics), bioRxiv (biology), medRxiv (medicine) — publish research before peer review, often representing the cutting edge of a field six to twenty-four months before journal publication. Working papers from research centres and economics institutes similarly circulate important research ahead of formal publication. Pre-prints require appropriate caveats about their unreviewed status when cited.

🌐

International Organisation Reports

WHO, UNICEF, UNESCO, IMF, World Bank, OECD, ILO, and regional equivalents publish research, policy analyses, and data collections that frequently constitute primary sources for global health, education, economic, and social policy questions. These reports are accessible through the organisations’ websites and are sometimes indexed in databases like Global Health or PAIS International, but direct source website searching is often more current and comprehensive.

📰

Conference Proceedings

Conference papers in disciplines like computer science, engineering, and some social sciences represent the primary venue for new research — often more current and more methodologically innovative than the same work later published in journals. IEEE Xplore, ACM Digital Library, and conference-specific proceedings repositories provide access. In other disciplines, conference papers are preliminary work that precedes journal publication; understanding the publication norms of your specific field determines how to weight conference proceedings evidence.

🔬

Clinical Trial Registries and Protocols

Clinical trial registries (ClinicalTrials.gov, WHO International Clinical Trials Registry Platform) register trials at inception, providing protocol data and outcomes registration that identifies unpublished trials — critical for systematic reviews assessing publication bias. Registered trials with no subsequent publication are evidence of reporting bias; their registration data can inform the scope of the evidence base even when no published results are available.

Grey literature searching requires different strategies from database searching because grey literature is not systematically indexed in academic databases. Effective grey literature coverage uses targeted website searching, database searching for grey literature-specific repositories (Grey Literature Report, OpenGrey/GRIS), and hand-searching of the websites of relevant organisations and government departments. For systematic and scoping reviews, grey literature searching is a formal methodological requirement documented in the search protocol. For dissertation literature reviews, covering at least the most relevant government and organisational sources — even informally — produces significantly more comprehensive evidence bases than journal database searching alone.

Managing and Organising Search Results: Reference Management and the Research Record

A comprehensive literature search across multiple databases, citation chains, and grey literature sources can yield hundreds of records. Managing this volume of material — deduplicating records retrieved from multiple databases, screening titles and abstracts against inclusion criteria, storing full-text articles, and recording the search process — requires a systematic approach and appropriate tools. The absence of systematic result management is where well-constructed searches fail at the translation stage: good retrieval undermined by disorganised storage produces a research record almost as inadequate as poor retrieval.

Step 1: Export before screening

Export all search results from each database immediately after searching, before any screening. Most databases support export in standard formats: RIS, BibTeX, CSV, or direct export to reference managers. Exporting before screening preserves the complete search record — including records later excluded — which is required for systematic review documentation and recommended for any reproducible search. Export includes citation data, abstract, and DOI; do not rely on database-internal saving, which may be lost if your session expires.

Step 2: Import to a reference manager

Import all exported records into a reference management tool — Zotero (free, open-source), Mendeley (free with limitations), or EndNote (institutional access). Reference managers deduplicate records retrieved from multiple databases (a critical function — the same article may appear in five databases as five records), organise records into folders or collections by topic or project, store PDFs of full-text articles linked to their citation records, generate citations and bibliographies in any style, and export annotated note collections alongside citation data. Establishing a reference manager workflow at the start of a research project eliminates the citation management chaos that typically arrives at the writing stage.

Step 3: Deduplication

Records retrieved from multiple databases frequently overlap — the same article indexed in PubMed, CINAHL, and Scopus appears as three separate records when all three searches are exported. Deduplication removes these duplicates before screening. Zotero’s duplicate detection, Rayyan’s deduplication function, and Covidence (for systematic reviews) all support automated deduplication. Manual deduplication of large result sets is time-consuming and error-prone; use automated tools and review the duplicate pairs identified before confirming deletion.

Step 4: Title and abstract screening

Screen the deduplicated record set against pre-specified inclusion and exclusion criteria, working from title and abstract only. Apply inclusion criteria consistently: date range, population, topic focus, study design, and language restrictions should all be specified before screening begins, not adjusted in response to what the results contain. Record the number of records screened and the number included and excluded at each stage. For systematic reviews, screening is conducted by two independent reviewers with a third resolving disagreements; for dissertation literature reviews, single-reviewer screening with documented criteria is standard.

Step 5: Full-text retrieval and assessment

For records passing title and abstract screening, retrieve the full text and assess against the same inclusion/exclusion criteria. Records excluded at full-text stage should be logged with the reason for exclusion — this documentation is required for systematic reviews (the PRISMA exclusion table) and supports retrospective justification of the literature base in any form of research. Retrieve full texts through institutional access links, open-access repositories (Unpaywall), or interlibrary loan. The full-text screening stage typically reduces the included set by a further 50–70% from the title/abstract-included set.

Step 6: Build the documented search record

Complete the search record documentation: all databases searched, dates of searching, all Boolean search strings in full, all filters applied, numbers of records retrieved per database, numbers after deduplication, numbers after title/abstract screening, and numbers after full-text assessment. For systematic reviews, this is presented as a PRISMA flow diagram; for dissertation literature reviews, as a methods subsection describing search methodology. This documentation is the evidence that the literature review is based on a systematic, reproducible search rather than ad hoc selection.

From Search Record to Written Literature Review

The bridge between a completed, documented database search and a written literature review is the synthesis of what the retrieved literature collectively shows — not a summary of each source in turn, but an analysis of what the body of evidence establishes, where it is contested, what methodological patterns shape its findings, and where the gaps lie that your research addresses. The search provides the evidence base; the synthesis produces the argument. Students who need support translating a documented search into a written, analytical literature review can access expert guidance through our specialist research writing services.

Literature Reviews Dissertation Help

Search Errors That Undermine Literature Coverage — and How to Correct Them

Most inadequate database searches share a small number of structural problems that produce predictable gaps in literature coverage. Understanding these errors — and the specific correction for each — is more practically useful than a general injunction to search more carefully. Each error has a specific cause, a specific consequence for the literature retrieved, and a specific technical or methodological fix.

The most common single cause of incomplete literature coverage is not searching in the wrong database — it is failing to generate enough synonyms for the concepts being searched. The literature uses more terminology variation than any individual researcher’s vocabulary reflects.

Reflected in systematic review methodology literature and information literacy research on search term generation

Searching only in databases your institution’s library homepage links to first is not search strategy — it is institutional convenience mapped onto a research methodology. The databases prominently featured in any institution’s library interface reflect licensing decisions, not research relevance.

Principle reflected in academic library information literacy guidelines and systematic review search methodology

Error 1

Single-Database Searching

Searching only one database, however comprehensive, produces systematic gaps because no single database covers all relevant literature. Each database has disciplinary biases, indexing gaps, and date-of-coverage boundaries. The standard for comprehensive searching is two to four databases, selected for their coverage of different aspects of the research question. The correction: identify the two or three databases most relevant to your discipline and question before searching, run equivalent searches in each, and combine the results after deduplication.

Error 2

Natural Language Question Entry

Entering the full research question as a search term produces poor results in all databases because databases do not interpret natural language the way search engines attempt to. “What are the effects of poor sleep on university student performance?” is not a database search string. The correction: decompose the question into its core concepts, generate synonyms for each, and construct a Boolean string. This takes ten minutes of preparation and produces dramatically better retrieval than iterative natural language entry.

Error 3

Omitting Subject Headings

Searching by keyword only — even with well-constructed Boolean strings — misses literature indexed under subject headings that uses different terminology in titles and abstracts. This is especially problematic for older literature and for topics where terminology has changed over time. The correction: before constructing the keyword search, look up each core concept in the relevant database’s thesaurus and add the authorised subject heading terms to the search, combined with OR alongside the keyword synonyms.

Error 4

Applying Full-Text Filters at Search Stage

Filtering to “full text available” before screening results limits the search to what the institution subscribes to — not to what is relevant. This is one of the most consequential search errors because it produces a literature base that appears complete (the researcher retrieved and read everything the database showed them) while being systematically incomplete (many relevant items were excluded before being seen). The correction: never apply full-text filters during searching; apply them only after screening, for non-essential convenience purposes, and use ILL for important items.

Error 5

No Citation Searching

Stopping at database keyword searching without conducting citation searching misses a significant proportion of relevant literature — particularly older foundational work and newer work using different terminology. Citation searching through Google Scholar, Web of Science, or Scopus routinely identifies twenty to thirty percent additional relevant articles not retrieved by keyword and subject heading searches. The correction: after keyword searching is complete and two or three central articles have been identified, conduct systematic backward and forward citation searching from each of these anchor articles.

Error 6

Undocumented Searches

Running searches without recording the exact search strings, databases searched, dates, and filters applied means the search cannot be reproduced, reported, or refined systematically. When a supervisor or reviewer asks how the literature search was conducted, the only honest answer is a reconstruction from memory — unreliable, imprecise, and academically problematic. The correction: document every search in a dedicated search log before beginning to screen results. This documentation takes two minutes per search and protects months of work.

Error 7

Ignoring Grey Literature

For research questions with policy, practice, or current-data dimensions — public health, education policy, social work, environmental management, economics — ignoring grey literature produces a literature base that lags two to five years behind current practice and misses the primary data sources the peer-reviewed literature itself cites. The correction: identify the major governmental, intergovernmental, and institutional sources relevant to your topic and conduct targeted searches of their publications alongside database searching. This does not require the same systematic approach as database searching — targeted website searching of three to five relevant organisations is often sufficient.

Error 8

Over-Restrictive Date Filters

Applying a five- or ten-year date filter as a default practice — rather than as a methodologically justified decision — systematically excludes foundational and seminal literature. Literature reviews that cite only recent sources produce discussions of current debates without the theoretical and empirical context that explains why those debates exist. The correction: search without date restrictions as the default, then use date filters selectively and with explicit rationale — for currency-sensitive topics or as a post-comprehensive-search refinement for very large result sets.

How Search Strategy Connects to Literature Review Quality

The relationship between the quality of a database search and the quality of the literature review it produces is direct and structural. A literature review is only as comprehensive as the evidence base it draws from — which is only as comprehensive as the search that produced it. The specific writing failures that supervisors and markers most commonly flag in literature reviews — missing important sources, presenting a biased picture of the field, failing to engage with counter-evidence, producing a list of summaries rather than a synthesis — are not writing problems. They are search problems, or search-to-synthesis problems, that manifested in the written product.

Bias in the Evidence Base

A search that retrieves only the most-cited, most recent, or most accessible literature produces a biased evidence base that skews toward dominant perspectives and methodologies. The literature review built from this base presents a consensus that may not reflect the actual distribution of views and findings in the field — and a marker with broader disciplinary knowledge will recognise the gap. Systematic multi-database searching with controlled vocabulary reduces this bias by retrieving literature the algorithm would not have surfaced through casual searching.

Missing the Scholarly Conversation

A literature review that cites sources as individual authorities rather than as participants in a conversation with each other is the product of a search that never identified the connections between sources. Citation searching — finding who cites whom and how — is what makes those connections visible. A well-searched literature has a map of relationships between sources; a poorly searched one has a list of sources without visible relationships. The map is the literature review’s structure; the list is just a bibliography.

Currency Without Depth

Literature reviews citing only recent publications — typically a consequence of default date filtering — miss the theoretical and methodological foundations of current debates. Every current controversy in a field has a history; a reviewer who does not know that history cannot accurately characterise the debate. Backward citation searching from current sources is the specific tool that builds temporal depth into a literature base constructed primarily through recent-publication database searching.

For students working on research-intensive assignments — dissertations, systematic reviews, extended research papers — where the quality of the literature search is directly assessed or where the argument depends on comprehensive literature coverage, professional research support can make a material difference to both the process and the outcome. Our literature review service, research consultancy, and dissertation support are available across all disciplines and degree levels. Our team of subject specialists includes researchers with direct experience of systematic search methodology across clinical, social science, and humanities research contexts. For an overview of everything available, visit our full services page.

Frequently Asked Questions About Library Database Navigation

What is the difference between a library database and a search engine?

A library database is a structured, curated collection of academic literature — journal articles, conference papers, theses, reports — selected and indexed according to quality criteria and organised with controlled vocabulary to enable precise scholarly searching. A general web search engine crawls publicly available web content without systematic quality filtering or scholarly indexing, returning results ranked primarily by algorithmic relevance signals including popularity and link structure. Library databases retrieve content from subscription-based and peer-reviewed sources that do not appear in general web search results. They offer structured search fields — author, title, subject heading, abstract — date and publication-type filters, and controlled vocabulary tools that enable targeted retrieval of academic literature. For academic research, databases are the primary search environment. Tools like Google Scholar occupy a middle ground: broader than a specific database, more academically focused than a general search engine, but lacking the precision of controlled vocabulary and systematic indexing that specialist databases provide.

What are Boolean operators and how do they work in database searching?

Boolean operators are logical connectors — AND, OR, and NOT — used to combine search terms in academic database queries. AND narrows a search by requiring all connected terms to appear in the same record: climate change AND agriculture retrieves only records containing both terms. OR broadens a search by retrieving records containing either connected term: adolescent OR teenager retrieves records using either word. NOT excludes records containing the following term: depression NOT weather excludes meteorological records. Parentheses control the order of operations in complex strings: (adolescent OR teenager) AND (depression OR anxiety) AND therapy is a correctly structured three-concept Boolean string. These operators form the syntax of precise database retrieval — they convert a vague research interest into an explicit retrieval instruction. Most academic databases accept Boolean operators in their advanced search interface; some databases treat AND as the default between unconnected terms (Google Scholar does this, for instance), so always check the specific database’s search documentation before assuming default behavior.

What is controlled vocabulary in database searching?

Controlled vocabulary is the set of standardised, authorised terms that database indexers assign to records to categorise their content — regardless of the specific wording used in the original article. When you search using these authorised terms (called subject headings or descriptors), you retrieve all records indexed under that concept, including records whose authors used different terminology. The major controlled vocabulary systems include MeSH (Medical Subject Headings) in PubMed, the EBSCO Thesaurus in EBSCO databases, the APA Thesaurus in PsycINFO, and ERIC Descriptors in the education database ERIC. To use controlled vocabulary, look up your core concepts in the relevant database’s thesaurus tool before searching, identify the authorised subject heading, and add it to your search alongside keyword synonyms. The combination of subject heading searches (for comprehensively indexed literature) and keyword searches (for recently indexed or non-standard terminology) produces the most complete retrieval.

How do I choose which academic databases to search?

Database selection should be driven by disciplinary coverage of your research question, not by institutional familiarity or interface convenience. Identify which databases cover the primary discipline, then consider whether your question has interdisciplinary dimensions requiring additional coverage. Core database selections by discipline: for biomedicine and clinical topics, PubMed is essential; for psychology, PsycINFO; for education, ERIC; for nursing and allied health, CINAHL; for law, Westlaw or LexisNexis; for humanities and social sciences archival depth, JSTOR; for broad multidisciplinary coverage with citation analysis, Web of Science and Scopus. For most research questions, two to four databases covering different aspects of the topic is appropriate — one discipline-specific, one broader multidisciplinary, and one specifically covering the literature type most relevant to the question. Document every database searched with dates and search strings as part of the reproducible search record.

What is truncation and when should I use it?

Truncation uses a wildcard symbol — usually an asterisk (*) — at the end of a word stem to retrieve all words sharing that stem in a single search term: educat* retrieves educate, education, educational, educator, educators, educating. This prevents searches from missing relevant records due to word form variations (plural, past tense, noun versus adjective). Use truncation when: the variants of a word stem are all relevant to your search (child* for child, children, childhood); when a concept appears in both noun and verb forms (communicat*); and when US-UK spelling variants matter. Avoid truncating at too short a stem — com* retrieves far too many unrelated words. Some databases use different truncation characters ($ in LexisNexis, ? in some EBSCO interfaces), so check the specific database’s help page for the correct symbol. Internal wildcards — ? or # within a word — handle spelling variations: wom?n retrieves woman and women; behavio?r retrieves behavior and behaviour.

What is interlibrary loan and when should I use it?

Interlibrary loan (ILL) is a library service that obtains academic items — journal articles, book chapters, conference papers, theses — not available through your institution’s own subscriptions, from other libraries on your behalf. Use it whenever you identify a relevant source through searching that you cannot access in full text through your institutional login. ILL is not a last resort or a slow process reserved for urgent situations — it should be a routine part of the research workflow. When you screen your database search results and identify relevant items without full-text access, submit ILL requests at that point, not after you have tried to find the source through other means for three days. Digital delivery of journal articles typically takes one to five business days. The key error to avoid is treating ILL as optional or exceptional: every relevant article you do not access because it was not immediately available is a gap in your literature coverage that had a simple institutional solution.

What is citation searching and how does it improve literature coverage?

Citation searching uses a known relevant article as the entry point for finding additional relevant literature in two directions. Backward citation searching follows the reference list of a known article to find the sources it drew on — identifying older, foundational literature. Forward citation searching finds all articles that have since cited the known article — identifying newer literature that engaged with it. Together, they trace the full scholarly conversation around a topic through time. Forward citation searching is performed using the “Cited by” function in Google Scholar, “Citing Articles” in Web of Science, or “Cited by” in Scopus. Citation searching consistently adds fifteen to thirty percent additional relevant literature not retrieved by keyword and subject heading database searches — particularly articles using different terminology that would not have appeared in the initial keyword search. It is most productive when the initial database search has already identified two or three highly relevant anchor articles from which to begin citation tracing.

How do I know when my database search is sufficiently comprehensive?

Practical indicators of search adequacy include: new searches using synonym variants are returning sources already identified in earlier searches; citation chains from multiple key articles are converging on the same core set; the same central sources appear in the reference lists of the majority of your retrieved literature (indicating you have identified the field’s foundational work); additional database searching is producing diminishing returns of new relevant results; and you have covered at least two to three databases with appropriate disciplinary coverage for your question. For a dissertation literature review, the combination of Boolean keyword searches across two to four databases, controlled vocabulary searches, citation searching from three to five key articles, and a scan of grey literature from relevant organisations is typically sufficient to demonstrate methodological rigour. The error to avoid is stopping when you have found enough sources to fill the required section length — sufficiency is a coverage criterion, not a quantity criterion.

Related Academic Research and Writing Resources

Extend your research skills: research paper writing · literature review writing · dissertation support · critical analysis papers · data analysis help · citation and referencing · challenging research topics · proofreading and editing · academic integrity · personalised academic help · research consulting · statistical analysis · master’s essay support · essay introduction guide

Library Database Navigation

What Library Database Navigation Involves — and Why Search Methodology Determines Literature Coverage

How Academic Databases Are Organised — the Structure Beneath the Search Box

The Record — The Basic Unit of a Database

Indexing — How Records Are Categorised

The Thesaurus — The Controlled Vocabulary Map

Search Fields — Where Your Query Lands

Filters — Post-Search Refinement Tools

Boolean Operators and Search Logic: The Syntax of Precise Database Retrieval

AND — Narrowing by Intersection

OR — Broadening by Equivalence

NOT — Excluding by Definition

Phrase Searching

Truncation and Wildcards

Subject Headings and Controlled Vocabulary: Searching the Way Databases Think

MeSH — Medical Subject Headings

EBSCO Thesaurus

Discipline Classification System

Keyword Plus and Author Keywords

ERIC Thesaurus

LCSH — Library of Congress Subject Headings

Using the Thesaurus: A Practical Workflow

Filters, Limiters, and Result Refinement: Narrowing Without Losing Relevance

Navigating Discipline-Specific Databases: Matching the Tool to the Research Question

Health and Biomedical Sciences

Social Sciences and Psychology

Humanities and Arts

Law

Education

Business, Economics, and Management

Full-Text Databases vs. Bibliographic Databases — Understanding What You Are Searching

Citation Searching and Reference Tracing: Extending the Literature Search Temporally

Backward Citation Searching — The Literature an Author Drew On

Forward Citation Searching — Who Has Cited This Work Since Publication

Lateral Citation Searching — Articles That Cite the Same Sources

Journal Hand-Searching — Targeted Coverage of Core Publications

Reference Manager Cross-Referencing — Finding Patterns in Your Collection

Seed Articles Needed

Additional Coverage Gain

Tools to Use

Building a Systematic Search Strategy: From Question to Documented Protocol

The PICO Framework — Structuring Clinical and Social Science Questions for Search

Search Strategy Documentation Checklist

Related Research Support

The Search Term Matrix: Organising Synonyms Before Searching

Proportion of systematic review search strategies that include explicit synonym mapping before database entry

Interlibrary Loan and Access Barriers: Reaching Literature Your Institution Does Not Subscribe To

How Interlibrary Loan Works in Practice

Open Access as a Parallel Access Route

Grey Literature and Supplementary Sources: Beyond the Peer-Reviewed Journal Article

Government and Official Statistics

Theses and Dissertations

Pre-Prints and Working Papers

International Organisation Reports

Conference Proceedings

Clinical Trial Registries and Protocols

Managing and Organising Search Results: Reference Management and the Research Record

From Search Record to Written Literature Review

Search Errors That Undermine Literature Coverage — and How to Correct Them

Single-Database Searching

Natural Language Question Entry

Omitting Subject Headings

Applying Full-Text Filters at Search Stage

No Citation Searching

Undocumented Searches

Ignoring Grey Literature

Over-Restrictive Date Filters

How Search Strategy Connects to Literature Review Quality

Bias in the Evidence Base

Missing the Scholarly Conversation

Currency Without Depth

Expert Research Support Across All Academic Disciplines

Frequently Asked Questions About Library Database Navigation

Leave a Comment Cancel