How Many Idioms Are There in the English Language?
The English language, with its rich history of borrowing, evolution, and innovation, contains an extraordinary number of idioms. These figurative expressions, often deeply entrenched in cultural and historical contexts, challenge even fluent speakers due to their non-literal meanings. But just how many idioms does English actually have? Despite decades of linguistic inquiry, the answer remains elusive and somewhat speculative. However, several linguistic theories and methodologies help estimate, explain, and contextualize the potential number of idioms in English.
To understand the scale of idiomatic expression, one must first establish a working definition. In its broadest sense, an idiom is a phrase whose meaning cannot be inferred from the literal interpretation of its individual words. Common examples include expressions like “kick the bucket” (to die) or “spill the beans” (to reveal a secret). However, definitions can vary:
Strict Idioms: Fixed expressions with non-compositional meanings, such as “let the cat out of the bag.”
Semi-idioms: Phrases with partially transparent meanings, like “give someone the cold shoulder.”
Collocational idioms: Highly frequent word combinations that have become conventionalized, such as “strong tea” or “make a decision.”
The breadth of this definition affects how one counts idioms. A narrow definition results in a smaller corpus; a broader one captures thousands more.
There is no universally accepted number of idioms in English. The figure varies greatly depending on what is classified as an idiom, the inclusion of regional and historical idioms, and whether new idiomatic constructions such as memes or slang-derived phrases are considered.
1. Lexicographical Estimates
Major English dictionaries and idiom-specific reference works list between 5,000 and 25,000 idioms. However, even the most exhaustive idiom dictionaries acknowledge that they capture only a portion of idiomatic usage. These figures typically exclude obsolete, regional, or emerging idioms. Moreover, dictionaries tend to curate idioms for common usage or pedagogical value, not for completeness.
2. Corpus-Based Studies
Corpus linguistics—the study of language through real-world text collections—offers a more empirical approach. Large corpora like the British National Corpus (BNC) or the Corpus of Contemporary American English (COCA) help identify frequent idiomatic expressions. By analyzing lexical bundles, phrasal verbs, and fixed collocations, some researchers suggest that English could have well over 100,000 idiomatic or semi-idiomatic expressions in active and passive use.
Such studies reveal the dynamic and productive nature of idioms: new ones emerge (e.g., “throw under the bus”), while older ones fade. Additionally, idioms evolve syntactically while retaining their idiomatic core, complicating exact counts.
A. Productivity of Language
From a generative grammar perspective, idiomatic expressions can be seen as a productive aspect of language. Idioms arise from metaphorical extensions, cultural references, and pragmatic patterns. For example, idioms can be generated through analogy or variation:
“Kick the bucket” → “Kick the proverbial bucket”
“Hit the roof” → “Hit the ceiling”
This suggests an open-ended potential for idiom creation, especially as societal and technological contexts change. Social media, for instance, has introduced idioms such as “slide into DMs” or “cancel someone.”
B. Cognitive Linguistics and Conceptual Metaphor Theory
According to cognitive linguists, idioms are not just memorized phrases but manifestations of deeper cognitive structures. Conceptual Metaphor Theory (CMT), developed in the 1980s, argues that many idioms are rooted in metaphorical thinking. For example:
ANGER IS HEAT → “He blew his top.”
LIFE IS A JOURNEY → “She’s at a crossroads.”
Since our conceptual metaphors are finite but recursively applicable, idioms can proliferate based on conceptual combinations. This opens the door to thousands of potential idiomatic formations, many of which may not be codified but are readily understood in context.
C. Sociolects, Dialects, and Cultural Variation
The number of idioms also expands when considering regional varieties of English (e.g., Indian English, African American Vernacular English, Australian English) and sociolects. Each community brings its own idiomatic expressions, some of which are intelligible only within that context. These idioms may not be represented in formal registers but are vital to linguistic diversity.
Several factors complicate the enumeration of idioms in English:
Fuzziness of Boundaries: Where does an idiom end and a collocation begin? What about proverbs, clichés, or slogans?
Context Dependence: Some expressions are idiomatic only in certain contexts or registers.
Evolving Usage: Language is fluid. Idioms evolve, hybridize, or die out.
Emergent Idioms: Memes, internet slang, and popular culture continuously generate novel idiomatic expressions.
While it is impossible to provide a definitive number of idioms in the English language, expert estimates and linguistic theory suggest that the number could range from tens of thousands to well over 100,000, depending on criteria and methodology. Idioms are not a finite set but a living, evolving dimension of English. They reflect the creativity, history, and culture of its speakers—infinitely generative, cognitively grounded, and socially embedded. The true number of idioms may never be precisely known, but their richness and pervasiveness are undeniable.