A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to content in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
content (0) - 119 freq
contest (1) - 16 freq
contend (1) - 4 freq
contint (1) - 1 freq
convent (1) - 3 freq
context (1) - 80 freq
contents (1) - 35 freq
consent (1) - 11 freq
conteint (1) - 1 freq
conteena (2) - 3 freq
contenna (2) - 1 freq
contints (2) - 1 freq
intent (2) - 37 freq
countet (2) - 1 freq
contenin (2) - 1 freq
conteen (2) - 3 freq
cogent (2) - 1 freq
oontment (2) - 1 freq
contexts (2) - 15 freq
conteins (2) - 2 freq
concept (2) - 40 freq
concernt (2) - 15 freq
cantelt (2) - 1 freq
continent (2) - 19 freq
convert (2) - 12 freq
content (0) - 119 freq
contint (1) - 1 freq
conteint (1) - 1 freq
consent (2) - 11 freq
contents (2) - 35 freq
containt (2) - 2 freq
context (2) - 80 freq
contest (2) - 16 freq
contend (2) - 4 freq
convent (2) - 3 freq
contened (3) - 1 freq
cantelt (3) - 1 freq
contentit (3) - 24 freq
continent (3) - 19 freq
entent (3) - 1 freq
contented (3) - 8 freq
contact (3) - 91 freq
constant (3) - 72 freq
coantents (3) - 1 freq
coontert (3) - 1 freq
conteens (3) - 3 freq
conteins (3) - 2 freq
intent (3) - 37 freq
contints (3) - 1 freq
contenna (3) - 1 freq
SoundEx code - C535
continent - 19 freq
content - 119 freq
contents - 35 freq
contempt - 27 freq
contained - 11 freq
continue - 71 freq
continuin - 13 freq
commotion - 32 freq
condemn - 19 freq
countin - 13 freq
continued - 69 freq
contemptuous - 4 freq
coontin - 47 freq
chauntin - 3 freq
cantiness - 2 freq
contain - 15 freq
contemplate - 8 freq
countenance - 3 freq
condemned - 22 freq
conteins - 2 freq
continents - 9 freq
contenders - 1 freq
cuntin - 1 freq
contend - 4 freq
canteen - 20 freq
contineed - 1 freq
'continue - 1 freq
continental - 10 freq
conteenwal - 1 freq
contint - 1 freq
contemporary - 29 freq
continues - 32 freq
commitment - 52 freq
coontenance - 3 freq
condense - 2 freq
condammt - 1 freq
contemplation - 8 freq
chantin - 28 freq
centenary - 8 freq
continuation - 2 freq
contented - 8 freq
contentit - 24 freq
contemplative - 1 freq
contentment - 5 freq
contint's - 1 freq
cummetmint's - 1 freq
cummedien - 1 freq
comitmint's - 1 freq
cuntain - 1 freq
contented-like - 1 freq
canadian - 10 freq
comedian - 5 freq
contemplated - 4 freq
continuous - 12 freq
contemplatit - 3 freq
condone - 1 freq
contaminated - 1 freq
comedians - 7 freq
contemplatin - 8 freq
countdown - 1 freq
contaminate - 3 freq
containers - 2 freq
contemplates - 2 freq
coontin's - 1 freq
contints - 1 freq
containin - 7 freq
continuing - 5 freq
continuan - 1 freq
contemptuously - 6 freq
contemporarie - 4 freq
cantons - 1 freq
canton - 2 freq
container - 6 freq
contentious - 1 freq
continually - 6 freq
contains - 13 freq
conteinuitie - 1 freq
continuum - 10 freq
commïttin - 2 freq
conteens - 3 freq
countence - 1 freq
containues - 1 freq
contentedly - 1 freq
committin - 2 freq
coundna - 1 freq
conteened - 3 freq
condensation - 2 freq
condenser - 1 freq
conteenual - 1 freq
continuity - 6 freq
cantonese - 3 freq
commitments - 36 freq
contemporar - 11 freq
cantankerous - 3 freq
coontan - 3 freq
condensed - 5 freq
contened - 1 freq
contined - 4 freq
contenin - 1 freq
containment - 2 freq
contamienaetion - 1 freq
counting - 4 freq
condoned - 1 freq
condimunts - 1 freq
committment - 6 freq
committin' - 1 freq
committments - 4 freq
containt - 2 freq
contumacious - 2 freq
conteen - 3 freq
countan - 1 freq
conteenue - 2 freq
conteenuous - 1 freq
cantin - 4 freq
conteenas - 3 freq
commeitment - 3 freq
chauntan - 1 freq
conteined - 2 freq
conteint - 1 freq
contender - 1 freq
conteenin - 2 freq
centimetres - 2 freq
contending - 1 freq
contemporaneous - 1 freq
condoms - 3 freq
contamination - 2 freq
canadiane - 2 freq
condemnations - 1 freq
conteena - 3 freq
continewit - 1 freq
conteenuum - 1 freq
commutin - 1 freq
centimetre - 1 freq
commadin - 1 freq
conteenually - 1 freq
continua - 1 freq
chanting - 3 freq
contantly - 1 freq
contemporars - 3 freq
conteenye - 1 freq
conteenyed - 5 freq
condamnin - 1 freq
committing - 2 freq
contemplating - 2 freq
€˜commitment - 1 freq
contenna - 1 freq
condemnatory - 1 freq
contaminatit - 1 freq
coantents - 1 freq
commeetments - 1 freq
coontdoon - 2 freq
contemption - 1 freq
€œcoonting - 1 freq
coonting - 4 freq
contentions - 1 freq
camden - 1 freq
centenaries - 1 freq
continuit - 1 freq
continuallie - 1 freq
coontins - 1 freq
conteenued - 1 freq
€œchantan - 1 freq
chantan - 4 freq
contemprir - 3 freq
chindian - 1 freq
condemnation - 1 freq
contemporaries - 1 freq
candomin - 1 freq
continual - 2 freq
condom - 1 freq
'commotion' - 1 freq
continuously - 1 freq
cmtnaogs - 1 freq
contentbible - 1 freq
MetaPhone code - KNTNT
content - 119 freq
contained - 11 freq
continued - 69 freq
contend - 4 freq
contineed - 1 freq
contint - 1 freq
conteinuitie - 1 freq
conteened - 3 freq
continuity - 6 freq
contened - 1 freq
contined - 4 freq
condoned - 1 freq
containt - 2 freq
conteined - 2 freq
conteint - 1 freq
continuit - 1 freq
conteenued - 1 freq
CONTENT
Time to execute Levenshtein function - 0.192595 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.365768 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028788 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038176 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000932 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.