Talk:Most common words in English

From Simple English Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Confiusing and confused[change source]

I don't understand this article. I checked the references. The first two do not seem to have any lists at all. The later ref has a list, but it is not of lemmas, rather of actual words (called "word forms"). In addition, that list is based on literature -- which is perhaps why the word "case" comes in so high (maybe due to "In case"). In any case, this list is a little suspect, and the article confusing. Kdammers (talk) 05:06, 16 September 2012 (UTC)

Everyone's entitled to their opinion. I've added to the intro a definition of what is meant by a word (in this context). Macdonald-ross (talk) 12:56, 16 September 2012 (UTC)

short lists[change source]

Why does the noun list have only 21 words? (I can guess that the prep list is short because of rarity of additional preps.) Kdammers (talk) 15:53, 30 April 2015 (UTC)

Word Frequencies in Written and Spoken English: based on the British National Corpus.[change source]

This book by Geoffrey Leech, Paul Rayson and Andrew Wilson [1] has a companion website (web archive) which provides wordlists under a Creative Commons Attribution-Share Alike 2.0 UK: England & Wales License. These lists include all words that occur more than ten times in the 100,000,000 word corpus of (then) present-day English. The British National Corpus project "was carried out and is managed by an industrial/academic consortium lead [sic] by Oxford University Press, of which the other members are major dictionary publishers Addison-Wesley Longman and Larousse Kingfisher Chambers; academic research centres at Oxford University Computing Services, Lancaster University's Centre for Computer Corpus Research on Language, and the British Library's Research and Innovation Centre." [2]

I believe some of the above would be a useful addition to the article but that is protected. I would particularly like to add this external link: https://web.archive.org/web/20191118100553/http://ucrel.lancs.ac.uk/bncfreq/lists/1_2_all_freq.txt. I'm thinking of creating a shared Google Sheet from the data but it will take a little time to do that properly. (Note: I have used citation templates above so that I can easily make citations in other articles.)--GrounderUK (talk) 20:02, 5 June 2020 (UTC)
  1. Leech, Geoffrey; Rayson, Paul; Wilson, Andrew (2001). Word Frequencies in Written and Spoken English: based on the British National Corpus. Longman, London. ISBN 0582-32007-0.
  2. "Companion Website for: Word Frequencies in Written and Spoken English: based on the British National Corpus". Archived from the original on 6 November 2019. Retrieved 2020-06-05.