corpora: meaning, definition, pronunciation and examples

C1
UK/ˈkɔː.pər.ə/US/ˈkɔːr.pɚ.ə/

Academic, Technical, Formal

My Flashcards

Quick answer

What does “corpora” mean?

The plural form of 'corpus', meaning collections of written or spoken texts, assembled for linguistic analysis or study.

Audio

Pronunciation

Definition

Meaning and Definition

The plural form of 'corpus', meaning collections of written or spoken texts, assembled for linguistic analysis or study.

Collections of data or body of information used as a basis for research or analysis, especially in linguistics, computational linguistics, and law.

Dialectal Variation

British vs American Usage

Differences

Both regions primarily use 'corpora'. 'Corpuses' is slightly more common in general American English than in British English, where 'corpora' is strongly preferred in academic contexts.

Connotations

In the UK, 'corpora' strongly connotes academic linguistics or lexicography (e.g., the British National Corpus). In the US, it can also be used in corporate or legal contexts (e.g., 'corpora of case law').

Frequency

The word is low-frequency in general usage but high-frequency within linguistics, computational linguistics, and digital humanities in both regions.

Grammar

How to Use “corpora” in a Sentence

corpora of + [noun phrase] (e.g., corpora of legal documents)analyse/build/compile + corporacorpora + [passive verb] (e.g., corpora are used)

Vocabulary

Collocations

strong
large corporalinguistic corporatext corporaanalyse corporabuild corporabalanced corporaparallel corporahistorical corpora
medium
available corporaelectronic corporaspecialised corporaaccess corporacorpora ofuse corporamonitor corpora
weak
different corporaseveral corporavarious corporaexisting corporamajor corpora

Examples

Examples of “corpora” in a Sentence

adjective

British English

  • corpora-based research

American English

  • corpora-driven analysis

Usage

Meaning in Context

Business

Rare, except in data analytics or AI companies developing language models ('training corpora').

Academic

Primary context. Used in linguistics, computational linguistics, language teaching, and digital humanities research.

Everyday

Virtually never used.

Technical

Standard term in Natural Language Processing (NLP), corpus linguistics, and lexicography.

Vocabulary

Synonyms of “corpora”

Strong

text collectionslanguage data sets

Neutral

collectionsdatabasesarchivescompilations

Weak

assembliesanthologies

Vocabulary

Antonyms of “corpora”

individual textsingle sourceanecdote

Watch out

Common Mistakes When Using “corpora”

  • Using it as a singular noun (e.g., 'a large corpora' is wrong).
  • Misspelling as 'corporea' or 'corpera'.
  • Mispronouncing with stress on the second syllable.

FAQ

Frequently Asked Questions

Yes, 'corpuses' is an accepted anglicised plural, especially in non-specialist contexts. However, 'corpora' is the standard Latin plural and is strongly preferred in academic and technical writing.

A corpus is a specific type of database: a systematically assembled collection of authentic texts (written or spoken) designed to be representative of a language or variety, used for linguistic analysis. A general database may not have these specific design criteria.

In British English: /ˈkɔː.pər.ə/ (KOR-puh-ruh). In American English: /ˈkɔːr.pɚ.ə/ (KOR-per-uh). The stress is always on the first syllable.

While its primary use is linguistic, it can be extended metaphorically to other organised collections of evidence or data, such as in legal studies ('corpora of case law') or art history ('corpora of Renaissance drawings'), though this is less common.

The plural form of 'corpus', meaning collections of written or spoken texts, assembled for linguistic analysis or study.

Corpora is usually academic, technical, formal in register.

Phrases

Idioms & Phrases

  • A corpus delicti (a related legal term, not plural)
  • To mine the corpora

Learning

Memory Aids

Mnemonic

Think: 'CORPS of data' – but instead of soldiers (corps), it's a 'CORPUS' of texts. The plural 'corpora' sounds like 'core' data 'for a' study.

Conceptual Metaphor

CORPORA ARE MINES (sources of valuable material to be extracted).

Practice

Quiz

Fill in the gap
Modern lexicographers rely heavily on digital to identify new words and meanings.
Multiple Choice

What is the primary context for the word 'corpora'?