A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to maha in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
maha (0) - 3 freq
maa (1) - 16 freq
maya (1) - 2 freq
mama (1) - 2 freq
matha (1) - 2 freq
maga (1) - 4 freq
mahal (1) - 3 freq
mah (1) - 379 freq
maaa (1) - 4 freq
'aha (1) - 1 freq
aha (1) - 8 freq
haha (1) - 66 freq
bah (2) - 4 freq
daia (2) - 2 freq
mona (2) - 7 freq
mla (2) - 1 freq
mesha (2) - 1 freq
ah- (2) - 4 freq
mats (2) - 8 freq
marv (2) - 3 freq
man- (2) - 10 freq
mawe (2) - 6 freq
tah (2) - 2 freq
maed (2) - 38 freq
mang (2) - 17 freq
maha (0) - 3 freq
mah (1) - 379 freq
mhu (2) - 1 freq
haha (2) - 66 freq
mh (2) - 3 freq
meh (2) - 192 freq
umah (2) - 1 freq
aha (2) - 8 freq
'aha (2) - 1 freq
maya (2) - 2 freq
maa (2) - 16 freq
mama (2) - 2 freq
matha (2) - 2 freq
maaa (2) - 4 freq
mahal (2) - 3 freq
maga (2) - 4 freq
mash (3) - 14 freq
mao (3) - 2 freq
mauk (3) - 5 freq
may (3) - 449 freq
aah (3) - 6 freq
macao (3) - 1 freq
uha (3) - 2 freq
mhj (3) - 1 freq
mma (3) - 3 freq
SoundEx code - M000
ma - 15179 freq
me - 12981 freq
may - 449 freq
mou - 178 freq
my - 2963 freq
maw - 392 freq
'ma - 79 freq
me' - 17 freq
moo - 175 freq
mi - 246 freq
me-ah - 1 freq
m- - 8 freq
m - 847 freq
mah - 379 freq
mo - 32 freq
'my - 41 freq
'm - 18 freq
''m - 8 freq
'me - 15 freq
mu - 11 freq
mea - 5 freq
mn - 8 freq
mmm - 21 freq
moo' - 1 freq
mawhi - 1 freq
mey - 87 freq
'moo' - 1 freq
'mah - 2 freq
'may - 2 freq
mm - 19 freq
meeeee - 1 freq
'mmm - 1 freq
'm'a - 1 freq
mae - 361 freq
mei - 86 freq
mma - 3 freq
mie - 3 freq
mee - 23 freq
maa - 16 freq
'mm - 1 freq
ma' - 5 freq
'maw - 2 freq
moi - 2 freq
moe - 2 freq
mia - 8 freq
mmmm - 7 freq
meh - 192 freq
'meh - 1 freq
maaa - 4 freq
m'ma - 2 freq
mai - 4 freq
maya - 2 freq
mew - 2 freq
mön - 10 freq
mow - 4 freq
mayo - 3 freq
®ma - 1 freq
mooo - 2 freq
mmm-mmm-mmm-m'mmm-my - 2 freq
miaow - 1 freq
'mo - 3 freq
mueh - 1 freq
™mo - 2 freq
ˆm - 2 freq
›m - 2 freq
mowe - 1 freq
mi' - 1 freq
'me' - 1 freq
may' - 1 freq
€™m - 1287 freq
€˜maw - 1 freq
meow - 3 freq
€œma - 17 freq
€œmi - 1 freq
€˜my - 8 freq
ðém - 1 freq
€œmy - 20 freq
€˜ma - 8 freq
mawe - 6 freq
€œmey - 1 freq
€˜m - 4 freq
€œmmmm - 1 freq
€œme - 10 freq
mae- - 2 freq
€˜me - 7 freq
meo - 1 freq
meeeh - 3 freq
mw - 2 freq
€™ma - 4 freq
€œmmmmm - 2 freq
my- - 1 freq
€œmooooo - 1 freq
maha - 3 freq
€™me - 3 freq
€œm - 1 freq
€œmeh - 1 freq
moa - 20 freq
me- - 1 freq
àm - 1 freq
m'n - 3 freq
m' - 1 freq
mowi - 1 freq
mooie - 3 freq
myo - 1 freq
mye - 6 freq
mui - 1 freq
mhu - 1 freq
mao - 2 freq
mh - 3 freq
mhw - 1 freq
'maw' - 1 freq
'ma' - 1 freq
meeeeeeeeee - 1 freq
mmmmm - 1 freq
meÂ’ - 1 freq
maw” - 1 freq
mwh - 1 freq
mmw - 1 freq
“me - 1 freq
muu - 2 freq
'mae - 1 freq
‘my - 1 freq
'may' - 1 freq
meeee - 1 freq
MetaPhone code - MH
mawhi - 1 freq
maha - 3 freq
mhu - 1 freq
MAHA
Time to execute Levenshtein function - 0.212318 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.383816 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028575 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.075533 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.