Wordlists |
A collection of wordlists useful for language education and research.
Available wordlists:
The zip file contains 10 XML files, each corresponding to the dominant vocabulary extracted from spoken corpora of as many English varieties, namely, Canada, East Africa, Hong Kong, India, Ireland, Jamaica, the Philippines, Singapore, the UK, and the USA.
The documentation is available at Dominant vocabularies of 10 spoken corpora from as many varieties of English.
Download link | Size (KB) | File type | Notes |
DOVOs 10 EV | 218 KB | zip | Corpus data, word units,... |
Function words are characterized by their ambiguous lexical meaning and by their capacity to organize grammatical relationships between words within a sentence. There are a relatively small and fixed number of function words (as opposed to verbs, nouns, adjectives, and adverbs, which are limited but expandable sets). Prepositions, conjunctions, determiners, pronouns, and auxiliary verbs are all considered function words. Most of these words are uninflected although a few are inflected and may take affixes.
Ideally, it would be possible to list all function words (since they comprise a closed class) but this is a surprisingly difficult thing to do. Nonetheless, the objective is to provide exhaustive lists of the function words in the English language. Contributions are welcome.
Download link | Size (KB) | File type | Notes |
English Function Words | 4.08 kb | zip | Function words of English |
Types of function words:
Auxiliary Verbs are verbs whose function is to characterize the main verbs they accompany with shades of meaning pertaining to tense and/or modality. Regarding tense, the core meaning of the verb can be modified to express perfect, progressive, or passive voices. Regarding modality, the main verb is altered to denote judgement or opinion in terms of ability, advice, expectation, intention/willingness, likelihood, necessity, permission/prohibition, or degrees of politeness.
Auxiliary verbs are necessary to form questions and negatives in English. If auxiliary verbs are used only to serve these functions, they are referred to as dummy auxiliaries. Additionally, the auxiliaries 'do', 'does', and 'did' can be inserted preceding the main verb for emphasis. Modal verbs are distinguished from other auxiliary verbs by their inability to function as main verbs and their lack of complete conjugations (infinitive for example).
Conjunctions are uninflected function words that serve to conjoin words, clauses, phrases, or sentences. There are three basic forms: single word (however), compound (as long as), and correlative (so ... that). In terms of function, conjunctions can be grouped into additive (so, thus), adversative (but, instead), causative (so, because), and temporal (after, then).
Conjunctions are not structural elements in a clause. Rather, they are external elements that establish grammatical relations (coordination, correlation, subordination) between clauses. Certain adverbial and prepositional phrases can also act as conjunctions (subsequently, in addition to that).
Determiners are inflected function words employed as noun modifiers and that serve to alter the referents of noun phrases in terms of amount, location, possession, and general versus specific. In terms of form, determiners are simple (two, their, the) or compound (a number of, one half, a little). Also, possessive and demonstrative adjectives are considered determiners.
The determiner class is often divided into articles (a, an, the), determiners (both, neither, whichever), and quantifiers (much, various, little).
Prepositions are uninflected function words that combine with nouns, pronouns, or noun phrases to form prepositional phrases that can have, in turn, adverbial or adjectival relationships with other words. Prepositions can be simple (as, of) or compound (next to, in view of) forms. In terms of function, at least the following types of preposition can be distinguished: time (until, circa), location (along, amid), logical (since, given), possession (including, pertaining to), and movement (toward, to).
Prepositions can also occur in post position with: nouns (interest in, need for), adjectives (familiar with, sure of), participles (married to, made of), and verbs (give up, look forward). In this situation, the composite can be though of as a unit.
Pronouns are inflected function words employed in place of nouns or noun phrases. In terms of form, pronouns are simple (nothing, herself) and compound (each other, one another). Also, some pronoun composites are used in relative clauses (all of whom, several of which).
Pronouns are classified into the following classes: subject personal (I, he, we), object personal (me, him, us), possessive (mine, his, ours), reflexive (myself, himself, ourselves), demonstrative (this, these, such), relative (who, all, that), indefinite (each, anybody, none), reciprocal (each other, one another), and interrogative (how, who, why). Additionally, reflexives also operate as so-called intensive pronouns when are employed to emphasize an antecedent noun or pronoun (as in, "The boss himself prepared the coffee" or "I myself could not believe it").
The General Service List (GSL) by West (A General Service List of English Words, 1953, London; Longman, Green & Co.) is a well-known list that has withstood the test of time and comprises around 2,000 of the most frequent words in English.
The download below contains several word lists related to the GSL and, in particular, the GSL headwords, family words by item and by headword, minimal pairs, and headwords by part of speech.
Download link | Size (KB) | File type | Notes |
GSL Set | 81.1 kb | zip | Headwords, families, MPs,... |
The Academic Word List (AWL) by Coxhead (A New Academic Word List. TESOL Quarterly, 34(2), 2000: 213-238) contains some 570 words specific to academic texts where they account for about 9-10% of running words.
The download below contains several word lists related to the AWL and, in particular, the AWL headwords, family words by item and by headword, and headwords by part of speech.
Download link | Size (KB) | File type | Notes |
AWL Set | 30.7 KB | zip | Headwords, families,... |