A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to madainn in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
madainn (0) - 1 freq
modairn (2) - 7 freq
sasainn (2) - 1 freq
maginn (2) - 1 freq
maain (2) - 2 freq
martins (3) - 2 freq
majin (3) - 1 freq
undain (3) - 1 freq
markin (3) - 17 freq
magink (3) - 1 freq
dain' (3) - 2 freq
martian (3) - 7 freq
again (3) - 2708 freq
medsin (3) - 1 freq
margin (3) - 3 freq
dinn (3) - 6 freq
ma-an (3) - 4 freq
daddin (3) - 1 freq
markins (3) - 4 freq
padair (3) - 8 freq
mankind (3) - 13 freq
mavin (3) - 1 freq
massin (3) - 2 freq
madass (3) - 2 freq
mains (3) - 59 freq
madainn (0) - 1 freq
madonna (3) - 4 freq
maginn (3) - 1 freq
modairn (3) - 7 freq
amadan (4) - 1 freq
mcann (4) - 1 freq
midian (4) - 1 freq
mainin (4) - 3 freq
maddnin (4) - 1 freq
madmen (4) - 1 freq
madden (4) - 3 freq
madigan (4) - 1 freq
maudlin (4) - 1 freq
mann (4) - 15 freq
mydnins (4) - 1 freq
mannin (4) - 1 freq
madeline (4) - 1 freq
dann (4) - 9 freq
mydins (4) - 1 freq
mawnin (4) - 5 freq
madkeen (4) - 1 freq
madman (4) - 7 freq
sasainn (4) - 1 freq
dinn (4) - 6 freq
medsin (4) - 1 freq
SoundEx code - M350
midden - 92 freq
meetin - 148 freq
maiden - 18 freq
madam - 19 freq
motion - 47 freq
mootin - 1 freq
mutton - 24 freq
midden' - 2 freq
medium - 71 freq
matin - 5 freq
midtown - 1 freq
matinee - 4 freq
midtoun - 1 freq
meeteen - 5 freq
meeten - 1 freq
motien - 2 freq
meetin' - 1 freq
madainn - 1 freq
meet'n - 1 freq
modem - 4 freq
meetan - 6 freq
madonna - 4 freq
madame - 32 freq
matthan - 4 freq
mithna - 2 freq
mdn - 1 freq
mitten - 15 freq
midian - 1 freq
mowten - 1 freq
middeen - 2 freq
'modem' - 1 freq
mautioun - 1 freq
maetin - 3 freq
muttin - 3 freq
mahatma - 1 freq
madden - 3 freq
moothin - 3 freq
mideen - 1 freq
medna - 1 freq
mettin - 2 freq
mitton - 1 freq
meytime - 1 freq
maitin - 1 freq
methane - 12 freq
maiden' - 1 freq
mdma - 3 freq
mutiny - 1 freq
modena - 1 freq
'mitten' - 2 freq
mtm - 1 freq
mtn - 1 freq
meetin” - 1 freq
MetaPhone code - MTN
midden - 92 freq
meetin - 148 freq
maiden - 18 freq
mootin - 1 freq
mutton - 24 freq
midden' - 2 freq
matin - 5 freq
matinee - 4 freq
meeteen - 5 freq
meeten - 1 freq
motien - 2 freq
meetin' - 1 freq
madainn - 1 freq
meet'n - 1 freq
meetan - 6 freq
madonna - 4 freq
matthan - 4 freq
mdn - 1 freq
mitten - 15 freq
midian - 1 freq
mowten - 1 freq
middeen - 2 freq
maetin - 3 freq
muttin - 3 freq
madden - 3 freq
mideen - 1 freq
medna - 1 freq
mettin - 2 freq
mitton - 1 freq
maitin - 1 freq
maiden' - 1 freq
mutiny - 1 freq
modena - 1 freq
'mitten' - 2 freq
mtn - 1 freq
meetin” - 1 freq
MADAINN
Time to execute Levenshtein function - 0.581190 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.187547 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.103095 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.111719 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000905 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.