A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to caledonia in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
caledonia (0) - 16 freq
caledonia' (1) - 1 freq
caledonian (1) - 10 freq
macedonia (2) - 5 freq
caledons (2) - 2 freq
caledones (2) - 1 freq
€œcaledonia (2) - 1 freq
caledonians (2) - 2 freq
caledonia's (2) - 1 freq
caladonian (2) - 1 freq
cameronian (3) - 3 freq
calidon (3) - 1 freq
laberdonia (3) - 5 freq
cadona (3) - 6 freq
cydonia (3) - 2 freq
california (3) - 9 freq
catalonia (3) - 8 freq
cameronians (4) - 3 freq
catalunia (4) - 1 freq
madonna (4) - 4 freq
cannonba (4) - 1 freq
camelon (4) - 7 freq
cafeteria (4) - 1 freq
calendula (4) - 1 freq
calmdoon (4) - 1 freq
caledonia (0) - 16 freq
caledonian (2) - 10 freq
caledonia' (2) - 1 freq
calidon (3) - 1 freq
caledones (3) - 1 freq
caledons (3) - 2 freq
caladonian (3) - 1 freq
culdna (4) - 34 freq
cydonia (4) - 2 freq
cadona (4) - 6 freq
macedonia (4) - 5 freq
€œcaledonia (4) - 1 freq
caledonians (4) - 2 freq
caledonia's (4) - 1 freq
haldon (5) - 1 freq
goldoni (5) - 8 freq
cleanin (5) - 48 freq
clonin (5) - 2 freq
calton (5) - 10 freq
culdnae (5) - 2 freq
cweedna (5) - 2 freq
cleedin (5) - 1 freq
couldni (5) - 7 freq
couldna (5) - 414 freq
cleidin (5) - 3 freq
SoundEx code - C435
couldna - 414 freq
couldnae - 623 freq
couldnae've - 4 freq
couldnue - 1 freq
cleidin - 3 freq
collidin - 1 freq
couldn't - 18 freq
clootin - 2 freq
caledonia's - 1 freq
clautin - 1 freq
caledonia' - 1 freq
culloden - 13 freq
caledonian - 10 freq
culdna - 34 freq
could'na - 1 freq
couldna'v - 1 freq
cloodin - 3 freq
clattin - 1 freq
couldny - 13 freq
cloddin - 8 freq
calton - 10 freq
could'no - 1 freq
couldno - 5 freq
caledonians - 2 freq
claithe'n - 1 freq
cooldness - 1 freq
caledons - 2 freq
clytemnestra's - 1 freq
clood-man - 1 freq
clothin - 2 freq
caledonia - 16 freq
claddeen - 1 freq
cauldnes - 1 freq
claithin - 1 freq
cauldness - 6 freq
cooldna - 1 freq
claddin - 1 freq
culdnae - 2 freq
calidon - 1 freq
chilton - 1 freq
caledones - 1 freq
coouldn - 1 freq
cuildna - 1 freq
clotting - 1 freq
clothing - 4 freq
clood-hentin - 1 freq
couldn - 3 freq
€œcouldni - 1 freq
couldni - 7 freq
couldni' - 1 freq
€œculloden - 1 freq
cleedin - 1 freq
clathin - 1 freq
€œcaledonia - 1 freq
€œcouldn - 1 freq
colluding - 1 freq
coldness - 2 freq
coalition - 8 freq
coalitions - 1 freq
cloutin - 1 freq
cuildnae - 1 freq
caldamac - 6 freq
caladonian - 1 freq
claething - 1 freq
couldnay - 1 freq
couldnÂ’t - 1 freq
colytonwildlife - 1 freq
couldnt - 1 freq
child-hunting - 1 freq
MetaPhone code - KLTN
couldna - 414 freq
golden - 87 freq
couldnae - 623 freq
couldnue - 1 freq
cleidin - 3 freq
collidin - 1 freq
clootin - 2 freq
clautin - 1 freq
caledonia' - 1 freq
glidin - 7 freq
culloden - 13 freq
goulden - 2 freq
culdna - 34 freq
gloatin - 5 freq
could'na - 1 freq
guillotine - 4 freq
goolden - 10 freq
cloodin - 3 freq
clattin - 1 freq
guilden - 1 freq
couldny - 13 freq
cloddin - 8 freq
calton - 10 freq
could'no - 1 freq
couldno - 5 freq
glutton - 5 freq
caledonia - 16 freq
glidan - 1 freq
claddeen - 1 freq
glydan - 1 freq
gowlden - 1 freq
glettan - 1 freq
gleetin - 2 freq
glaidden - 1 freq
cooldna - 1 freq
claddin - 1 freq
goldoni - 8 freq
gluttony - 3 freq
culdnae - 2 freq
gledden - 1 freq
calidon - 1 freq
gluten - 1 freq
coouldn - 1 freq
cuildna - 1 freq
couldn - 3 freq
€œcouldni - 1 freq
couldni - 7 freq
couldni' - 1 freq
€œculloden - 1 freq
cleedin - 1 freq
€œcaledonia - 1 freq
€œcouldn - 1 freq
cloutin - 1 freq
cuildnae - 1 freq
quiltinÂ’ - 1 freq
couldnay - 1 freq
quiltin - 1 freq
'golden - 1 freq
goldin - 1 freq
CALEDONIA
Time to execute Levenshtein function - 0.233676 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.424230 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.037589 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038431 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000970 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.