A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to countries in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
countries (0) - 36 freq
countrie' (1) - 1 freq
counties (1) - 2 freq
countrie (1) - 8 freq
coonties (2) - 5 freq
contrives (2) - 1 freq
bounties (2) - 2 freq
counties' (2) - 1 freq
courries (2) - 1 freq
country's (2) - 12 freq
foundries (2) - 1 freq
conrie (3) - 1 freq
courses (3) - 13 freq
controls (3) - 5 freq
chanties (3) - 1 freq
couriet (3) - 1 freq
courtier (3) - 1 freq
chuntie (3) - 4 freq
curries (3) - 7 freq
couttie's (3) - 3 freq
countin (3) - 13 freq
councils (3) - 3 freq
counts (3) - 8 freq
courtiers (3) - 16 freq
contrived (3) - 4 freq
countries (0) - 36 freq
counties (2) - 2 freq
countrie (2) - 8 freq
countrie' (2) - 1 freq
centres (3) - 38 freq
centeries (3) - 1 freq
cuntras (3) - 4 freq
country's (3) - 12 freq
coonties (3) - 5 freq
centuries (3) - 83 freq
contrives (3) - 1 freq
cuntre (4) - 2 freq
intries (4) - 1 freq
conteins (4) - 2 freq
contrite (4) - 1 freq
coonters (4) - 1 freq
kintries (4) - 4 freq
sentries (4) - 3 freq
gantries (4) - 1 freq
conters (4) - 1 freq
contraks (4) - 1 freq
count's (4) - 2 freq
country' (4) - 1 freq
cottaries (4) - 1 freq
contre (4) - 1 freq
SoundEx code - C536
canterin - 1 freq
central - 130 freq
country - 299 freq
cantrips - 23 freq
contrair - 8 freq
centre - 275 freq
contreibutions - 2 freq
contreibutor - 3 freq
century - 290 freq
contraction - 8 freq
canterbury - 10 freq
contradicted - 3 freq
chunter - 1 freq
contracted - 12 freq
countries - 36 freq
control - 129 freq
contrar - 10 freq
chanter - 14 freq
centres - 38 freq
counter - 20 freq
centre-stage - 2 freq
coonter - 72 freq
contermynit - 1 freq
cantrip - 7 freq
chunterin - 3 freq
cemetery - 32 freq
counterpart - 3 freq
contribution' - 1 freq
ceemetrie - 1 freq
counterfeitit - 1 freq
conter - 47 freq
centuries - 83 freq
counterfeit - 1 freq
contraflow - 1 freq
countryside - 43 freq
cintre - 3 freq
contrack - 3 freq
contribution - 38 freq
contraption - 8 freq
canters - 1 freq
contermacious - 5 freq
contradiction - 6 freq
country's - 12 freq
contrition - 1 freq
contrite - 1 freq
contribute - 21 freq
contracks - 4 freq
cantraips - 2 freq
cuntraptien - 3 freq
cuntroal - 1 freq
contrary - 7 freq
contrast - 24 freq
controversial - 5 freq
chanters - 9 freq
cooooouuuntry - 1 freq
contributes - 5 freq
cinderella - 34 freq
centred - 6 freq
conterin - 7 freq
conthrairy - 1 freq
cinderellae - 5 freq
countrie - 8 freq
contractit - 1 freq
contracts - 2 freq
center - 9 freq
cinderella's - 3 freq
contributed - 7 freq
countrymen - 1 freq
contradict - 3 freq
centrally-heatit - 1 freq
centralised - 5 freq
centurie - 16 freq
centrality - 2 freq
centralized - 1 freq
contreebute - 2 freq
contract - 26 freq
coonterpairts - 3 freq
contreebutors - 2 freq
contributor - 28 freq
contributions - 16 freq
canter - 3 freq
cinders - 7 freq
controls - 5 freq
controlled - 14 freq
contramantious - 1 freq
contradictet - 1 freq
contortions - 2 freq
«century - 1 freq
centerprece - 1 freq
contours - 4 freq
centurion - 6 freq
contrairie - 1 freq
centurion's - 2 freq
'control - 1 freq
contraptions - 4 freq
contrived - 4 freq
countrie' - 1 freq
contradictory - 5 freq
contrairiwise - 1 freq
'cemeterio - 1 freq
coonter' - 1 freq
cuntra - 29 freq
cuntras - 4 freq
cuntraside - 2 freq
cemeteries - 2 freq
candour - 1 freq
contractions - 12 freq
contortit - 2 freq
century's - 1 freq
contradictions - 2 freq
contradickit - 1 freq
coontert - 1 freq
conter-gaits - 1 freq
contrastit - 1 freq
centre' - 1 freq
center' - 1 freq
counterculture - 1 freq
counterclockwise - 1 freq
coontér - 1 freq
centèrs - 9 freq
conterdiction - 7 freq
centrin - 1 freq
contermaister - 1 freq
contrairitie - 1 freq
contermit - 4 freq
contrak - 2 freq
conters - 1 freq
centir - 1 freq
centauri - 5 freq
country-luvin - 1 freq
contraception - 2 freq
cantripin - 1 freq
cantar - 1 freq
contrasts - 8 freq
centrally - 1 freq
contributin' - 1 freq
centuries' - 1 freq
chanterin - 1 freq
counterpynts - 1 freq
contrives - 1 freq
centre-furrit - 1 freq
contradeiction - 1 freq
centrifugal - 1 freq
countriemen - 1 freq
centuries-auld - 1 freq
contrasted - 1 freq
cuntre - 2 freq
contrastive - 4 freq
cantrip-wirds - 1 freq
contergaits - 8 freq
conter't - 2 freq
canntaireachd - 1 freq
centralisation - 1 freq
contributioun - 2 freq
cantraip - 4 freq
contractor - 2 freq
coonterdichtit - 1 freq
cuntry - 2 freq
contreibutit - 2 freq
contributiouns - 1 freq
contreibutors - 2 freq
contreibutioun - 1 freq
contreibutiouns - 1 freq
contradick - 1 freq
caunterberry - 1 freq
controvertit - 4 freq
contreebutit - 2 freq
contradeictory - 1 freq
cindert - 1 freq
contermashious - 2 freq
cantor - 1 freq
coonterbalance - 1 freq
conter-creenge - 1 freq
contreibyut - 1 freq
contributit - 3 freq
centre't - 1 freq
contributin - 3 freq
centered - 1 freq
contributors - 3 freq
coonterpairt - 2 freq
cinder - 2 freq
coonter-productive - 1 freq
contractin - 1 freq
contre - 1 freq
centaurs - 2 freq
contrack-eyn - 1 freq
contraks - 1 freq
contrakkars - 1 freq
countrafait - 1 freq
contergates - 2 freq
cimiterie - 1 freq
countra - 5 freq
contradeictions - 1 freq
controversy - 3 freq
committaris - 1 freq
contreebution - 5 freq
contradicts - 1 freq
contreibute - 1 freq
commodore - 2 freq
controversie - 2 freq
contra - 2 freq
coonterin - 3 freq
counterpane - 1 freq
controllin - 4 freq
coonters - 1 freq
contorts - 1 freq
counterbalanced - 1 freq
commuter - 1 freq
coonterfeitin - 1 freq
coonterpanes - 1 freq
contrivance - 1 freq
countryfile - 1 freq
chemotherapy - 1 freq
contortin - 1 freq
contracting - 2 freq
coontèr - 1 freq
€˜control - 1 freq
contraband - 1 freq
contered - 2 freq
coonter-blast - 1 freq
centeries - 1 freq
cemetary - 2 freq
€˜comehither - 1 freq
centaur - 1 freq
centuar - 2 freq
counterpoint - 1 freq
contrapuntal - 1 freq
coonter-jihad - 1 freq
contraceptive - 3 freq
controlling - 3 freq
centuries-lang - 1 freq
centre-left - 1 freq
centre-richt - 1 freq
contermaschious - 1 freq
central-haeteen - 1 freq
central-european - 1 freq
contributing - 1 freq
contreebyeutan - 1 freq
controllers - 1 freq
coonterculture - 2 freq
centre-back - 1 freq
contractarianism - 1 freq
contractarian - 1 freq
cantrebreiniol - 1 freq
contraversial - 1 freq
coonterborin - 1 freq
chanterranter - 17 freq
chantywrastler - 2 freq
'chantiewrastler' - 1 freq
cantrol - 1 freq
countryÂ’s - 1 freq
control-freakery - 1 freq
centre's - 1 freq
centreÂ’s - 1 freq
chantywrassler - 1 freq
country' - 1 freq
canterberry - 1 freq
centraltaxised - 1 freq
cemetry - 1 freq
conturbat - 1 freq
contrarily - 1 freq
country” - 1 freq
MetaPhone code - KNTRS
gantries - 1 freq
countries - 36 freq
kintrae's - 2 freq
canters - 1 freq
kintras - 57 freq
country's - 12 freq
kintra's - 11 freq
kintraes - 8 freq
contours - 4 freq
cuntras - 4 freq
kintries - 4 freq
kintrie's - 1 freq
conters - 1 freq
gantrees - 1 freq
gantry's - 1 freq
coonters - 1 freq
countryÂ’s - 1 freq
COUNTRIES
Time to execute Levenshtein function - 0.578222 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.113204 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.098523 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.106483 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.071991 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.