A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to commotion in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
commotion (0) - 32 freq
'commotion' (2) - 1 freq
commutin (2) - 1 freq
communion (2) - 17 freq
condition (3) - 36 freq
formation (3) - 11 freq
commitit (3) - 5 freq
commonity (3) - 177 freq
compston (3) - 2 freq
commodity (3) - 1 freq
committin (3) - 2 freq
companion (3) - 26 freq
commontie (3) - 2 freq
comfortin (3) - 14 freq
summation (3) - 1 freq
commination (3) - 2 freq
motion (3) - 47 freq
communin (3) - 1 freq
comission (3) - 1 freq
commission (3) - 54 freq
coalition (3) - 8 freq
commeision (3) - 1 freq
completion (3) - 4 freq
common (3) - 301 freq
communions (3) - 2 freq
commotion (0) - 32 freq
commutin (2) - 1 freq
communion (3) - 17 freq
computin (4) - 3 freq
summation (4) - 1 freq
commoun (4) - 3 freq
communin (4) - 1 freq
committin (4) - 2 freq
commination (4) - 2 freq
commentin (4) - 4 freq
commadin (4) - 1 freq
common (4) - 301 freq
commitit (4) - 5 freq
commeision (4) - 1 freq
commin (4) - 8 freq
competin (4) - 3 freq
'commotion' (4) - 1 freq
commuity (5) - 1 freq
competan (5) - 2 freq
commonties (5) - 18 freq
promotion (5) - 21 freq
commited (5) - 2 freq
competeen (5) - 1 freq
commits (5) - 7 freq
commone (5) - 2 freq
SoundEx code - C535
continent - 19 freq
content - 119 freq
contents - 35 freq
contempt - 27 freq
contained - 11 freq
continue - 71 freq
continuin - 13 freq
commotion - 32 freq
condemn - 19 freq
countin - 13 freq
continued - 69 freq
contemptuous - 4 freq
coontin - 47 freq
chauntin - 3 freq
cantiness - 2 freq
contain - 15 freq
contemplate - 8 freq
countenance - 3 freq
condemned - 22 freq
conteins - 2 freq
continents - 9 freq
contenders - 1 freq
cuntin - 1 freq
contend - 4 freq
canteen - 20 freq
contineed - 1 freq
'continue - 1 freq
continental - 10 freq
conteenwal - 1 freq
contint - 1 freq
contemporary - 29 freq
continues - 32 freq
commitment - 52 freq
coontenance - 3 freq
condense - 2 freq
condammt - 1 freq
contemplation - 8 freq
chantin - 28 freq
centenary - 8 freq
continuation - 2 freq
contented - 8 freq
contentit - 24 freq
contemplative - 1 freq
contentment - 5 freq
contint's - 1 freq
cummetmint's - 1 freq
cummedien - 1 freq
comitmint's - 1 freq
cuntain - 1 freq
contented-like - 1 freq
canadian - 10 freq
comedian - 5 freq
contemplated - 4 freq
continuous - 12 freq
contemplatit - 3 freq
condone - 1 freq
contaminated - 1 freq
comedians - 7 freq
contemplatin - 8 freq
countdown - 1 freq
contaminate - 3 freq
containers - 2 freq
contemplates - 2 freq
coontin's - 1 freq
contints - 1 freq
containin - 7 freq
continuing - 5 freq
continuan - 1 freq
contemptuously - 6 freq
contemporarie - 4 freq
cantons - 1 freq
canton - 2 freq
container - 6 freq
contentious - 1 freq
continually - 6 freq
contains - 13 freq
conteinuitie - 1 freq
continuum - 10 freq
commïttin - 2 freq
conteens - 3 freq
countence - 1 freq
containues - 1 freq
contentedly - 1 freq
committin - 2 freq
coundna - 1 freq
conteened - 3 freq
condensation - 2 freq
condenser - 1 freq
conteenual - 1 freq
continuity - 6 freq
cantonese - 3 freq
commitments - 36 freq
contemporar - 11 freq
cantankerous - 3 freq
coontan - 3 freq
condensed - 5 freq
contened - 1 freq
contined - 4 freq
contenin - 1 freq
containment - 2 freq
contamienaetion - 1 freq
counting - 4 freq
condoned - 1 freq
condimunts - 1 freq
committment - 6 freq
committin' - 1 freq
committments - 4 freq
containt - 2 freq
contumacious - 2 freq
conteen - 3 freq
countan - 1 freq
conteenue - 2 freq
conteenuous - 1 freq
cantin - 4 freq
conteenas - 3 freq
commeitment - 3 freq
chauntan - 1 freq
conteined - 2 freq
conteint - 1 freq
contender - 1 freq
conteenin - 2 freq
centimetres - 2 freq
contending - 1 freq
contemporaneous - 1 freq
condoms - 3 freq
contamination - 2 freq
canadiane - 2 freq
condemnations - 1 freq
conteena - 3 freq
continewit - 1 freq
conteenuum - 1 freq
commutin - 1 freq
centimetre - 1 freq
commadin - 1 freq
conteenually - 1 freq
continua - 1 freq
chanting - 3 freq
contantly - 1 freq
contemporars - 3 freq
conteenye - 1 freq
conteenyed - 5 freq
condamnin - 1 freq
committing - 2 freq
contemplating - 2 freq
€˜commitment - 1 freq
contenna - 1 freq
condemnatory - 1 freq
contaminatit - 1 freq
coantents - 1 freq
commeetments - 1 freq
coontdoon - 2 freq
contemption - 1 freq
€œcoonting - 1 freq
coonting - 4 freq
contentions - 1 freq
camden - 1 freq
centenaries - 1 freq
continuit - 1 freq
continuallie - 1 freq
coontins - 1 freq
conteenued - 1 freq
€œchantan - 1 freq
chantan - 4 freq
contemprir - 3 freq
chindian - 1 freq
condemnation - 1 freq
contemporaries - 1 freq
candomin - 1 freq
continual - 2 freq
condom - 1 freq
'commotion' - 1 freq
continuously - 1 freq
cmtnaogs - 1 freq
contentbible - 1 freq
MetaPhone code - KMXN
commotion - 32 freq
commeision - 1 freq
commeisioun - 3 freq
'commotion' - 1 freq
COMMOTION
Time to execute Levenshtein function - 0.204420 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.383132 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031491 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037430 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000878 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.