General Service List

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

The General Service List (GSL) is a list of roughly 2,000 words published by Michael West in 1953.[1] The words were selected to represent the most frequent words of English and were taken from a corpus of written English. The target audience was English language learners and ESL teachers. To maximize the utility of the list, some frequent words that overlapped broadly in meaning with words already on the list were omitted. In the original publication the relative frequencies of various senses of the words were also included.

Details

The list is important because a person who knows all the words on the list and their related families would understand approximately 90–95 percent of colloquial speech and 80–85 percent of common written texts. The list consists only of headwords, which means that the word "be" is high on the list, but assumes that the person is fluent in all forms of the word, e.g. am, is, are, was, were, being, and been.

Researchers have expressed doubts about the adequacy of the GSL because of its age and the relatively low coverage provided by the words not in the first 1000 words of the list.[2] Engels was, in particular, critical of the limited vocabulary chosen by West (1953), and while he concurred that the first 1000 words of the GSL were good selections based on their high frequency and wide range, he was of the opinion that the words beyond the first 1000 of the GSL could not be considered general service words because the range and frequency of these words were too low to be included in the list. Recent research by Billuroğlu and Neufeld (2005) confirmed that the General Service List was in need of minor revision, but the headwords in the list still provide approximately 80% text coverage in written English. The research showed that the GSL contains a small number of archaic terms, such as shilling, while excluding words that have gained currency since the first half of the twentieth century, such as plastic, television, battery, okay, victim, and drug.

The GSL evolved over several decades before West’s publication in 1953. The GSL is not a list based solely on frequency, but includes groups of words on a semantic basis.[3] Today there is no version of the GSL in print; it only exists in virtual form via the Internet. Various versions float around the Internet, and attempts have been made to improve it.[4]

There are two major updates of the GSL:

  1. the New General Service List (new-GSL) by Brezina & Gablasova published in Applied Linguistics in 2013. This wordlist is based on the analysis of four language corpora of the total size of over 12 billion words.[5]
  2. the New General Service List (NGSL), published in March 2013 by Browne, Culligan and Phillips. The NGSL was based on a 273 million word subsection of the 2 billion word Cambridge English Corpus. Preliminary results show that the new list provides a substantially higher degree of coverage with fewer words.[6]

See also

Notes

  1. West, M. 1953. A General Service List of English Words. London: Longman, Green and Co.
  2. Engels, 1968
  3. Nation & Waring, 2004; Dickins
  4. Bauman & Culligan, 1995
  5. Brezina, V. & Gablasova, D. (2013) "Is There a Core General Vocabulary? Introducing the New General Service List" Applied Linguistics
  6. Lua error in package.lua at line 80: module 'strict' not found.

References

External links

  • Bauman's revised GSL A 1995 revised version of the GSL with minor changes, along with a more detailed discussion about the problems in the GSL.
  • A short history of Bauman's revised GSL A short history of the GSL with links to translations into 11 languages.
  • PC-based vocabulary profiling software that includes the GSL:
  • Lextutor Vocabulary Profilers provided free by Tom Cobb includes several web-based vocabulary profilers, in which you can paste any text and the words are then 'coloured' according to frequency band profiles. Here are two:
    • Classic Vocabulary Profiler, which produces output in coloured form—blue for K1 (the first 1,000 words of the GSL), green for K2 (the second 1,000 words of the GSL), yellow for Academic word list, and red for words that are not in any of the lists
    • BNL profiler is a revised word list for students learning English which overcomes the problems of treating the GSL and AWL as separate and distinct constructs.
  • Other web-based vocabulary profilers include:
    • OGTE (Online Graded Text Editor) provided free by Charles Browne and Rob Waring. The tool allows teachers and authors to analyze and edit texts to a specific level using the GSL, NGSL, AWL and other important vocabulary lists.
    • OUP3000 text checker.
    • OKAPI will return formatted CBA probes or a readability analysis, with bands for Grades 1-3 (US) and Grades 4+
    • WORDLE provides a graphic representation of words by frequency in any text, but is not as yet linked to any specific vocabulary profiling bands.