A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to latin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
latin (0) - 92 freq
hatin (1) - 2 freq
eatin (1) - 154 freq
lakin (1) - 1 freq
layin (1) - 45 freq
laitin (1) - 19 freq
lain (1) - 18 freq
lacin (1) - 1 freq
lawin (1) - 2 freq
latyn (1) - 1 freq
matin (1) - 5 freq
atin (1) - 3 freq
lastin (1) - 9 freq
lattin (1) - 53 freq
satin (1) - 12 freq
laein (1) - 9 freq
latib (1) - 1 freq
lazin (1) - 4 freq
ratin (1) - 1 freq
slatin (1) - 2 freq
lapin (1) - 1 freq
ladin (1) - 1 freq
datin (1) - 11 freq
aatin (1) - 1 freq
batin (1) - 4 freq
latin (0) - 92 freq
latyn (1) - 1 freq
laitin (1) - 19 freq
batin (2) - 4 freq
lapin (2) - 1 freq
aatin (2) - 1 freq
datin (2) - 11 freq
lavin (2) - 2 freq
slatin (2) - 2 freq
ladin (2) - 1 freq
eltin (2) - 1 freq
lootin (2) - 4 freq
lotion (2) - 4 freq
leetin (2) - 5 freq
loutin (2) - 3 freq
luton (2) - 2 freq
ratin (2) - 1 freq
elation (2) - 3 freq
lain (2) - 18 freq
lacin (2) - 1 freq
lazin (2) - 4 freq
lakin (2) - 1 freq
hatin (2) - 2 freq
eatin (2) - 154 freq
lawin (2) - 2 freq
SoundEx code - L350
loadin - 5 freq
lettin - 91 freq
lattin - 53 freq
latin - 92 freq
leadin - 66 freq
lowdin - 1 freq
leetin - 5 freq
laedin - 1 freq
lowden - 12 freq
lothian - 81 freq
littin - 10 freq
leiden - 3 freq
loudoun - 18 freq
laden - 13 freq
leadan - 5 freq
lettin' - 2 freq
let-doon - 1 freq
let'm - 1 freq
loathin - 4 freq
loddan - 1 freq
lotion - 4 freq
luttin - 8 freq
lat-doon - 1 freq
lettan - 1 freq
looten - 1 freq
lootin - 4 freq
laudin - 1 freq
leadin' - 3 freq
lutten - 2 freq
'luttin - 1 freq
litany - 1 freq
lydon - 1 freq
latten - 11 freq
litten - 2 freq
loutin - 3 freq
lowtin - 1 freq
letham - 6 freq
laitin - 19 freq
lowdoun - 1 freq
leitanie - 1 freq
latyn - 1 freq
lutton - 1 freq
leaden - 1 freq
ladin - 1 freq
€œleadin - 1 freq
lithuania - 1 freq
ladny - 1 freq
laidin - 2 freq
leatham - 9 freq
laiden - 2 freq
ladym - 2 freq
luton - 2 freq
lettinÂ’ - 1 freq
ludm - 1 freq
leadinÂ’ - 1 freq
MetaPhone code - LTN
loadin - 5 freq
lettin - 91 freq
lattin - 53 freq
latin - 92 freq
leadin - 66 freq
lowdin - 1 freq
leetin - 5 freq
laedin - 1 freq
lowden - 12 freq
littin - 10 freq
leiden - 3 freq
loudoun - 18 freq
laden - 13 freq
leadan - 5 freq
lettin' - 2 freq
loddan - 1 freq
luttin - 8 freq
lettan - 1 freq
looten - 1 freq
lootin - 4 freq
laudin - 1 freq
leadin' - 3 freq
lutten - 2 freq
'luttin - 1 freq
litany - 1 freq
lydon - 1 freq
latten - 11 freq
litten - 2 freq
loutin - 3 freq
lowtin - 1 freq
laitin - 19 freq
lowdoun - 1 freq
leitanie - 1 freq
latyn - 1 freq
lutton - 1 freq
leaden - 1 freq
ladin - 1 freq
€œleadin - 1 freq
ladny - 1 freq
laidin - 2 freq
laiden - 2 freq
luton - 2 freq
lettinÂ’ - 1 freq
hldnn - 1 freq
leadinÂ’ - 1 freq
LATIN
Time to execute Levenshtein function - 0.171328 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.334218 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028523 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039600 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000806 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.