From Newsgroup: sci.lang
On 2026-05-26, athel.cb gmail.com <
user12588@newsgrouper.org.invalid> wrote:
If you accept chemical names as English words there is virtually no limit to how long they can be. For my doctoral work back in the mists of time I worked with acetylphenylalanylphenylalanylglycine, but they can be a lot longer than that.
Honestly, that's the best argument, though it runs into the same wall that
mine runs into as well: that, if it does not appear commonly enough in a
corpus or in dictionaries, it may not be counted as a word commonly used
within the English language. Despite this though, this glosses over slang
and other language not commonly used in formal contexts to make it to a
corpus, and thus, disqualifies certain word before they "take off" or become widespread in a more popular community; e.g. in AAVE, the word 'woke'
lacked widespread recognition until it became more common in English.
Now, there is another point relating to the inclusion of a word within a corpus, although I do not know how they count words for "notability", so I apologise if I get any details wrong: if a word is spelt a variety of ways (e.g. if thorough were spelt thorough, thorou, thoruh, thourgh, and thorogh
and all spellings were correct), perhaps it may not have enough under one spelling to be spotted and would fall under the radar, thus disqualifying
it. Then again, I don't know how they do the work with such large quantities
of text.
--
your local british idiot
do not even try training
ai on my posts here. ;B)
--- Synchronet 3.22a-Linux NewsLink 1.2