A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to wm in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
wm (0) - 12 freq
xm (1) - 1 freq
qm (1) - 1 freq
w (1) - 190 freq
bm (1) - 5 freq
wmt (1) - 1 freq
wy (1) - 11 freq
nm (1) - 2 freq
wmc (1) - 1 freq
wa (1) - 148 freq
wh (1) - 10 freq
wf (1) - 4 freq
wq (1) - 2 freq
wn (1) - 7 freq
wc (1) - 6 freq
wme (1) - 1 freq
wu (1) - 9 freq
w' (1) - 1 freq
wk (1) - 11 freq
wx (1) - 4 freq
lm (1) - 5 freq
tm (1) - 7 freq
'm (1) - 18 freq
km (1) - 4 freq
wum (1) - 1 freq
wm (0) - 12 freq
wme (1) - 1 freq
wam (1) - 1 freq
wum (1) - 1 freq
wmu (1) - 1 freq
wim (1) - 2 freq
wr (2) - 7 freq
wj (2) - 4 freq
sm (2) - 4 freq
gm (2) - 10 freq
hm (2) - 13 freq
m (2) - 847 freq
fm (2) - 90 freq
ws (2) - 6 freq
wd (2) - 6 freq
wo (2) - 2 freq
om (2) - 6 freq
wv (2) - 3 freq
wg (2) - 4 freq
wb (2) - 4 freq
waam (2) - 1 freq
wame (2) - 75 freq
wm- (2) - 1 freq
uiwm (2) - 1 freq
wami (2) - 1 freq
SoundEx code - W500
when - 4506 freq
win - 980 freq
wan - 2472 freq
whinny - 6 freq
wean - 424 freq
wun - 200 freq
wheen - 829 freq
winna - 384 freq
whan - 2757 freq
whein - 50 freq
won - 282 freq
wine - 272 freq
ween - 25 freq
weein - 1 freq
wame - 75 freq
'when - 19 freq
wain - 15 freq
'whan - 7 freq
wunna - 10 freq
wanae - 1 freq
whiney - 1 freq
win' - 7 freq
wane - 12 freq
waa'in - 1 freq
whin - 755 freq
'whin - 1 freq
whim - 9 freq
wham - 21 freq
wheem - 1 freq
winnie - 80 freq
wi'm - 3 freq
wayne - 6 freq
wi'in - 37 freq
whiny - 2 freq
whom - 17 freq
winnae - 46 freq
won' - 1 freq
wyne - 16 freq
weem - 11 freq
wooin - 1 freq
wim - 2 freq
'wan - 6 freq
wanna - 17 freq
whun - 92 freq
wuhan - 2 freq
wanny - 8 freq
wuin - 3 freq
winn - 1 freq
wunnae - 13 freq
wn - 7 freq
weeyin - 22 freq
wunny - 1 freq
wunne - 1 freq
whunny - 1 freq
wi'ïn - 2 freq
wwhan - 1 freq
wen - 18 freq
wona - 1 freq
whin' - 1 freq
waam - 1 freq
weeny - 2 freq
wye'in - 1 freq
wino - 2 freq
wine' - 1 freq
waim'i - 1 freq
wyme - 5 freq
wymie - 1 freq
whunnie - 1 freq
wan-wye - 1 freq
whine - 5 freq
whaen - 3 freq
winny - 7 freq
wunn - 9 freq
whaun - 5 freq
wan- - 2 freq
wan' - 2 freq
wieen - 1 freq
waen - 2 freq
waein - 1 freq
woun - 5 freq
€˜when - 4 freq
wone - 4 freq
wyin - 2 freq
'winna' - 2 freq
whinney - 1 freq
€œwhan - 8 freq
waw-en - 1 freq
wien - 1 freq
weyin - 3 freq
€œwhen - 4 freq
€œwhin - 1 freq
whain - 1 freq
€˜win - 1 freq
whane - 3 freq
wyn - 1 freq
wannae - 6 freq
wine-o - 1 freq
wum - 1 freq
weenie - 1 freq
€œwinna - 2 freq
€˜wan - 5 freq
wu-yun - 14 freq
whyn - 3 freq
whaim - 1 freq
€˜wean - 1 freq
wein - 2 freq
whammy - 2 freq
wm- - 1 freq
wuni - 1 freq
wini - 2 freq
wan” - 1 freq
winae - 2 freq
wina - 1 freq
wenna - 1 freq
'winnae' - 2 freq
win- - 2 freq
-wan - 1 freq
weemo - 1 freq
wm - 12 freq
'win' - 1 freq
wooin” - 1 freq
wami - 1 freq
'wean' - 1 freq
wmu - 1 freq
wwn - 1 freq
'wheen' - 1 freq
wme - 1 freq
weanÂ’ - 1 freq
wean' - 3 freq
wayin - 1 freq
weeeannie - 1 freq
wayno - 1 freq
'wheen - 1 freq
wam - 1 freq
MetaPhone code - M
ma - 15179 freq
me - 12981 freq
may - 449 freq
mou - 178 freq
my - 2963 freq
maw - 392 freq
'ma - 79 freq
me' - 17 freq
moo - 175 freq
mi - 246 freq
hmm - 22 freq
me-ah - 1 freq
m- - 8 freq
m - 847 freq
mbe - 5 freq
hm - 13 freq
mah - 379 freq
'hmm - 1 freq
mo - 32 freq
'my - 41 freq
'm - 18 freq
''m - 8 freq
'me - 15 freq
mu - 11 freq
mea - 5 freq
mmm - 21 freq
moo' - 1 freq
mey - 87 freq
'moo' - 1 freq
'mah - 2 freq
'may - 2 freq
mm - 19 freq
meeeee - 1 freq
'mmm - 1 freq
'm'a - 1 freq
mae - 361 freq
mei - 86 freq
mma - 3 freq
mie - 3 freq
mee - 23 freq
maa - 16 freq
'mm - 1 freq
ma' - 5 freq
'hmmm - 2 freq
'maw - 2 freq
moi - 2 freq
moe - 2 freq
mia - 8 freq
mmmm - 7 freq
meh - 192 freq
ym - 2 freq
'meh - 1 freq
maaa - 4 freq
hïm - 575 freq
mai - 4 freq
hmmmm - 4 freq
hym - 5 freq
wyme - 5 freq
mew - 2 freq
wymie - 1 freq
mow - 4 freq
®ma - 1 freq
hæm - 2 freq
mooo - 2 freq
miaow - 1 freq
'mo - 3 freq
mueh - 1 freq
™mo - 2 freq
ˆm - 2 freq
›m - 2 freq
y'm - 1 freq
mi' - 1 freq
'me' - 1 freq
may' - 1 freq
€™m - 1287 freq
€˜maw - 1 freq
höm - 1 freq
€˜hmm - 3 freq
meow - 3 freq
€œma - 17 freq
€œmi - 1 freq
€˜my - 8 freq
ðém - 1 freq
€œmy - 20 freq
€˜ma - 8 freq
€œmey - 1 freq
€˜m - 4 freq
€œmmmm - 1 freq
€œme - 10 freq
mae- - 2 freq
€˜me - 7 freq
meo - 1 freq
meeeh - 3 freq
mw - 2 freq
hmi - 2 freq
€™ma - 4 freq
€œmmmmm - 2 freq
€˜hmi - 1 freq
my- - 1 freq
€œmooooo - 1 freq
€™me - 3 freq
€œm - 1 freq
€œhmm - 1 freq
€œmeh - 1 freq
moa - 20 freq
me- - 1 freq
àm - 1 freq
m' - 1 freq
€™hm - 1 freq
hmbbw - 1 freq
mb - 5 freq
mooie - 3 freq
wm- - 1 freq
mui - 1 freq
mao - 2 freq
ymm - 1 freq
mh - 3 freq
mby - 1 freq
mhw - 1 freq
hmmm - 7 freq
'maw' - 1 freq
'ma' - 1 freq
meeeeeeeeee - 1 freq
mmmmm - 1 freq
meÂ’ - 1 freq
maw” - 1 freq
hmo - 1 freq
wm - 12 freq
mwh - 1 freq
mmw - 1 freq
“me - 1 freq
muu - 2 freq
'mae - 1 freq
mbu - 1 freq
wmu - 1 freq
wme - 1 freq
‘my - 1 freq
mbh - 1 freq
'may' - 1 freq
meeee - 1 freq
hyme - 6 freq
“hyme - 1 freq
hyme” - 1 freq
WM
Time to execute Levenshtein function - 0.185207 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.354102 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027870 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040970 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000964 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.