corpora: meaning, definition, pronunciation and examples
C1Academic, Technical, Formal
Quick answer
What does “corpora” mean?
The plural form of 'corpus', meaning collections of written or spoken texts, assembled for linguistic analysis or study.
Audio
Pronunciation
Definition
Meaning and Definition
The plural form of 'corpus', meaning collections of written or spoken texts, assembled for linguistic analysis or study.
Collections of data or body of information used as a basis for research or analysis, especially in linguistics, computational linguistics, and law.
Dialectal Variation
British vs American Usage
Differences
Both regions primarily use 'corpora'. 'Corpuses' is slightly more common in general American English than in British English, where 'corpora' is strongly preferred in academic contexts.
Connotations
In the UK, 'corpora' strongly connotes academic linguistics or lexicography (e.g., the British National Corpus). In the US, it can also be used in corporate or legal contexts (e.g., 'corpora of case law').
Frequency
The word is low-frequency in general usage but high-frequency within linguistics, computational linguistics, and digital humanities in both regions.
Grammar
How to Use “corpora” in a Sentence
corpora of + [noun phrase] (e.g., corpora of legal documents)analyse/build/compile + corporacorpora + [passive verb] (e.g., corpora are used)Vocabulary
Collocations
Examples
Examples of “corpora” in a Sentence
adjective
British English
- corpora-based research
American English
- corpora-driven analysis
Usage
Meaning in Context
Business
Rare, except in data analytics or AI companies developing language models ('training corpora').
Academic
Primary context. Used in linguistics, computational linguistics, language teaching, and digital humanities research.
Everyday
Virtually never used.
Technical
Standard term in Natural Language Processing (NLP), corpus linguistics, and lexicography.
Vocabulary
Synonyms of “corpora”
Strong
Neutral
Weak
Vocabulary
Antonyms of “corpora”
Watch out
Common Mistakes When Using “corpora”
- Using it as a singular noun (e.g., 'a large corpora' is wrong).
- Misspelling as 'corporea' or 'corpera'.
- Mispronouncing with stress on the second syllable.
FAQ
Frequently Asked Questions
Yes, 'corpuses' is an accepted anglicised plural, especially in non-specialist contexts. However, 'corpora' is the standard Latin plural and is strongly preferred in academic and technical writing.
A corpus is a specific type of database: a systematically assembled collection of authentic texts (written or spoken) designed to be representative of a language or variety, used for linguistic analysis. A general database may not have these specific design criteria.
In British English: /ˈkɔː.pər.ə/ (KOR-puh-ruh). In American English: /ˈkɔːr.pɚ.ə/ (KOR-per-uh). The stress is always on the first syllable.
While its primary use is linguistic, it can be extended metaphorically to other organised collections of evidence or data, such as in legal studies ('corpora of case law') or art history ('corpora of Renaissance drawings'), though this is less common.
The plural form of 'corpus', meaning collections of written or spoken texts, assembled for linguistic analysis or study.
Corpora is usually academic, technical, formal in register.
Phrases
Idioms & Phrases
- “A corpus delicti (a related legal term, not plural)”
- “To mine the corpora”
Learning
Memory Aids
Mnemonic
Think: 'CORPS of data' – but instead of soldiers (corps), it's a 'CORPUS' of texts. The plural 'corpora' sounds like 'core' data 'for a' study.
Conceptual Metaphor
CORPORA ARE MINES (sources of valuable material to be extracted).
Practice
Quiz
What is the primary context for the word 'corpora'?