A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to maiden in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
maiden (0) - 18 freq
madden (1) - 3 freq
maiden' (1) - 1 freq
laiden (1) - 2 freq
maide (1) - 46 freq
raiden (1) - 2 freq
maidens (1) - 8 freq
maen (2) - 5 freq
aden (2) - 2 freq
paiyen (2) - 1 freq
camden (2) - 1 freq
marten (2) - 1 freq
taiken (2) - 7 freq
made (2) - 2115 freq
raiken (2) - 1 freq
maaken (2) - 1 freq
handen (2) - 1 freq
warden (2) - 6 freq
maiden's (2) - 1 freq
maidens' (2) - 1 freq
madder (2) - 5 freq
baidan (2) - 1 freq
bairen (2) - 1 freq
mailed (2) - 2 freq
madey (2) - 1 freq
maiden (0) - 18 freq
maidens (2) - 8 freq
mideen (2) - 1 freq
maide (2) - 46 freq
raiden (2) - 2 freq
madden (2) - 3 freq
maiden' (2) - 1 freq
laiden (2) - 2 freq
hauden (3) - 49 freq
makeen (3) - 1 freq
laden (3) - 13 freq
minen (3) - 1 freq
madmen (3) - 1 freq
maikin (3) - 1 freq
maitin (3) - 1 freq
widen (3) - 13 freq
aidan (3) - 5 freq
wuiden (3) - 24 freq
suiden (3) - 1 freq
midden (3) - 92 freq
maine (3) - 1 freq
maison (3) - 4 freq
maid (3) - 94 freq
mainin (3) - 3 freq
heiden (3) - 1 freq
SoundEx code - M350
midden - 92 freq
meetin - 148 freq
maiden - 18 freq
madam - 19 freq
motion - 47 freq
mootin - 1 freq
mutton - 24 freq
midden' - 2 freq
medium - 71 freq
matin - 5 freq
midtown - 1 freq
matinee - 4 freq
midtoun - 1 freq
meeteen - 5 freq
meeten - 1 freq
motien - 2 freq
meetin' - 1 freq
madainn - 1 freq
meet'n - 1 freq
modem - 4 freq
meetan - 6 freq
madonna - 4 freq
madame - 32 freq
matthan - 4 freq
mithna - 2 freq
mdn - 1 freq
mitten - 15 freq
midian - 1 freq
mowten - 1 freq
middeen - 2 freq
'modem' - 1 freq
mautioun - 1 freq
maetin - 3 freq
muttin - 3 freq
mahatma - 1 freq
madden - 3 freq
moothin - 3 freq
mideen - 1 freq
medna - 1 freq
mettin - 2 freq
mitton - 1 freq
meytime - 1 freq
maitin - 1 freq
methane - 12 freq
maiden' - 1 freq
mdma - 3 freq
mutiny - 1 freq
modena - 1 freq
'mitten' - 2 freq
mtm - 1 freq
mtn - 1 freq
meetin” - 1 freq
MetaPhone code - MTN
midden - 92 freq
meetin - 148 freq
maiden - 18 freq
mootin - 1 freq
mutton - 24 freq
midden' - 2 freq
matin - 5 freq
matinee - 4 freq
meeteen - 5 freq
meeten - 1 freq
motien - 2 freq
meetin' - 1 freq
madainn - 1 freq
meet'n - 1 freq
meetan - 6 freq
madonna - 4 freq
matthan - 4 freq
mdn - 1 freq
mitten - 15 freq
midian - 1 freq
mowten - 1 freq
middeen - 2 freq
maetin - 3 freq
muttin - 3 freq
madden - 3 freq
mideen - 1 freq
medna - 1 freq
mettin - 2 freq
mitton - 1 freq
maitin - 1 freq
maiden' - 1 freq
mutiny - 1 freq
modena - 1 freq
'mitten' - 2 freq
mtn - 1 freq
meetin” - 1 freq
MAIDEN
Time to execute Levenshtein function - 0.213049 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.403950 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034435 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041870 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000903 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.