A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to contents in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
contents (0) - 35 freq
contexts (1) - 15 freq
content (1) - 117 freq
contests (1) - 6 freq
contints (1) - 1 freq
coantents (1) - 1 freq
conceits (2) - 2 freq
convent (2) - 3 freq
conters (2) - 1 freq
contenna (2) - 1 freq
contint's (2) - 1 freq
connects (2) - 4 freq
contened (2) - 1 freq
continents (2) - 9 freq
conteens (2) - 3 freq
contest (2) - 16 freq
context- (2) - 1 freq
contentit (2) - 20 freq
comments (2) - 65 freq
conteins (2) - 2 freq
contint (2) - 1 freq
conteint (2) - 1 freq
convenes (2) - 1 freq
concepts (2) - 10 freq
conteenas (2) - 3 freq
contents (0) - 35 freq
contints (1) - 1 freq
coantents (1) - 1 freq
contexts (2) - 15 freq
content (2) - 117 freq
contests (2) - 6 freq
conteint (3) - 1 freq
contint (3) - 1 freq
conteins (3) - 2 freq
conteenas (3) - 3 freq
contentit (3) - 20 freq
contentious (3) - 1 freq
contorts (3) - 1 freq
intents (3) - 2 freq
contacts (3) - 7 freq
contented (3) - 8 freq
contint's (3) - 1 freq
conteens (3) - 3 freq
continents (3) - 9 freq
coontins (4) - 1 freq
convent (4) - 3 freq
contantly (4) - 1 freq
concerts (4) - 14 freq
continues (4) - 30 freq
coontin's (4) - 1 freq
SoundEx code - C535
continent - 17 freq
content - 117 freq
contents - 35 freq
contempt - 27 freq
contained - 10 freq
continue - 69 freq
continuin - 13 freq
commotion - 31 freq
condemn - 18 freq
countin - 13 freq
continued - 69 freq
contemptuous - 4 freq
coontin - 47 freq
chauntin - 3 freq
cantiness - 2 freq
contain - 14 freq
contemplate - 8 freq
countenance - 3 freq
condemned - 22 freq
conteins - 2 freq
continents - 9 freq
contenders - 1 freq
cuntin - 1 freq
contend - 4 freq
canteen - 20 freq
contineed - 1 freq
'continue - 1 freq
continental - 10 freq
conteenwal - 1 freq
contint - 1 freq
contemporary - 29 freq
continues - 30 freq
commitment - 52 freq
coontenance - 3 freq
condense - 2 freq
condammt - 1 freq
contemplation - 8 freq
chantin - 28 freq
centenary - 8 freq
continuation - 2 freq
contented - 8 freq
contentit - 20 freq
contemplative - 1 freq
contentment - 5 freq
contint's - 1 freq
cummetmint's - 1 freq
cummedien - 1 freq
comitmint's - 1 freq
cuntain - 1 freq
contented-like - 1 freq
contemplatit - 3 freq
condone - 1 freq
contaminated - 1 freq
comedians - 7 freq
contemplatin - 8 freq
countdown - 1 freq
contaminate - 3 freq
contemplated - 3 freq
containers - 2 freq
contemplates - 2 freq
coontin's - 1 freq
contints - 1 freq
containin - 7 freq
continuing - 5 freq
continuan - 1 freq
contemptuously - 6 freq
continuous - 11 freq
contemporarie - 4 freq
cantons - 1 freq
canton - 2 freq
container - 6 freq
contentious - 1 freq
continually - 6 freq
contains - 13 freq
conteinuitie - 1 freq
continuum - 10 freq
commïttin - 2 freq
conteens - 3 freq
countence - 1 freq
containues - 1 freq
contentedly - 1 freq
committin - 2 freq
coundna - 1 freq
conteened - 3 freq
canadian - 5 freq
condensation - 2 freq
condenser - 1 freq
conteenual - 1 freq
continuity - 6 freq
cantonese - 3 freq
comedian - 4 freq
commitments - 36 freq
contemporar - 11 freq
cantankerous - 3 freq
coontan - 3 freq
condensed - 5 freq
contened - 1 freq
contined - 4 freq
contenin - 1 freq
containment - 2 freq
contamienaetion - 1 freq
counting - 4 freq
condoned - 1 freq
condimunts - 1 freq
committment - 6 freq
committin' - 1 freq
committments - 4 freq
containt - 2 freq
contumacious - 2 freq
conteen - 3 freq
countan - 1 freq
conteenue - 2 freq
conteenuous - 1 freq
cantin - 4 freq
conteenas - 3 freq
commeitment - 3 freq
chauntan - 1 freq
conteined - 2 freq
conteint - 1 freq
contender - 1 freq
conteenin - 2 freq
centimetres - 2 freq
contending - 1 freq
contemporaneous - 1 freq
condoms - 3 freq
contamination - 2 freq
canadiane - 2 freq
condemnations - 1 freq
conteena - 3 freq
continewit - 1 freq
conteenuum - 1 freq
commutin - 1 freq
centimetre - 1 freq
commadin - 1 freq
conteenually - 1 freq
continua - 1 freq
chanting - 3 freq
contantly - 1 freq
contemporars - 3 freq
conteenye - 1 freq
conteenyed - 5 freq
condamnin - 1 freq
committing - 2 freq
contemplating - 2 freq
€˜commitment - 1 freq
contenna - 1 freq
condemnatory - 1 freq
contaminatit - 1 freq
coantents - 1 freq
commeetments - 1 freq
coontdoon - 2 freq
contemption - 1 freq
€œcoonting - 1 freq
coonting - 4 freq
contentions - 1 freq
camden - 1 freq
centenaries - 1 freq
continuit - 1 freq
continuallie - 1 freq
coontins - 1 freq
conteenued - 1 freq
€œchantan - 1 freq
chantan - 4 freq
contemprir - 3 freq
chindian - 1 freq
condemnation - 1 freq
contemporaries - 1 freq
candomin - 1 freq
continual - 2 freq
condom - 1 freq
'commotion' - 1 freq
continuously - 1 freq
cmtnaogs - 1 freq
contentbible - 1 freq
MetaPhone code - KNTNTS
contents - 35 freq
contint's - 1 freq
contints - 1 freq
coantents - 1 freq
CONTENTS
Time to execute Levenshtein function - 0.295114 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.526330 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.073577 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037522 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000806 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.