A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to w in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
w (0) - 188 freq
(1) - 1 freq
sw (1) - 3 freq
wl (1) - 6 freq
ow (1) - 12 freq
dw (1) - 4 freq
tw (1) - 6 freq
v (1) - 266 freq
jw (1) - 4 freq
wb (1) - 4 freq
(1) - 3 freq
d (1) - 462 freq
(1) - 2 freq
(1) - 2 freq
(1) - 4 freq
wa (1) - 145 freq
hw (1) - 1 freq
(1) - 9 freq
(1) - 2 freq
(1) - 3 freq
wu (1) - 9 freq
(1) - 4 freq
wx (1) - 4 freq
(1) - 4 freq
f (1) - 187 freq
w (0) - 188 freq
ew (1) - 5 freq
wy (1) - 11 freq
iw (1) - 2 freq
wa (1) - 145 freq
wo (1) - 2 freq
aw (1) - 8032 freq
wu (1) - 9 freq
wi (1) - 20919 freq
we (1) - 10235 freq
uw (1) - 6 freq
ow (1) - 12 freq
yw (1) - 3 freq
awe (2) - 396 freq
q (2) - 95 freq
wyo (2) - 1 freq
a (2) - 91162 freq
' (2) - 11454 freq
wc (2) - 6 freq
wq (2) - 2 freq
n (2) - 2433 freq
xw (2) - 4 freq
o (2) - 56035 freq
cw (2) - 6 freq
(2) - 1 freq
SoundEx code - W000
wee - 8134 freq
we - 10235 freq
wha - 1876 freq
wi - 20919 freq
wey - 2461 freq
wa - 145 freq
way - 878 freq
who - 1041 freq
whae - 450 freq
why - 804 freq
'wha - 34 freq
'we - 111 freq
wia - 16 freq
'wee - 14 freq
waa - 217 freq
'woe - 1 freq
wye - 1471 freq
wae - 2585 freq
'who - 15 freq
w - 188 freq
wow - 79 freq
wae- - 2 freq
'wae - 3 freq
waw - 155 freq
whee - 1 freq
whoo - 5 freq
wi' - 734 freq
waiy - 98 freq
wa' - 7 freq
wiy - 27 freq
'we' - 1 freq
'why - 22 freq
wei - 78 freq
wee' - 1 freq
wyee - 1 freq
'whae - 3 freq
whew - 2 freq
wi- - 6 freq
whey - 16 freq
'whey - 1 freq
wii - 10 freq
'whoa - 2 freq
whoa - 6 freq
wha' - 3 freq
wauy - 1 freq
woo - 18 freq
wuiy - 15 freq
weh - 27 freq
weih - 1 freq
wooaa - 4 freq
weiy - 23 freq
wha'e - 1 freq
we' - 2 freq
wie - 242 freq
'wi - 8 freq
wah - 11 freq
wh - 10 freq
'w - 2 freq
wye' - 1 freq
www - 176 freq
wwii - 1 freq
wee-' - 1 freq
wei- - 1 freq
wae' - 2 freq
'wh' - 1 freq
way' - 2 freq
wahey - 1 freq
wooooo - 1 freq
woe - 9 freq
'wa - 2 freq
whoohooo - 1 freq
wheyhey - 1 freq
'whoah - 1 freq
wy - 11 freq
waye - 1 freq
weya - 1 freq
'why' - 2 freq
ww - 12 freq
whu - 1 freq
we-' - 2 freq
whie - 1 freq
wow-ee - 1 freq
'wow - 1 freq
waey - 6 freq
'woooooo - 1 freq
wii' - 1 freq
woooo - 1 freq
wooo - 3 freq
wooooooo - 1 freq
wahoo - 1 freq
wooaah - 1 freq
woooooo - 1 freq
we-we - 8 freq
wi'a - 1 freq
wheh - 3 freq
-why - 1 freq
whiy - 7 freq
way- - 1 freq
w' - 1 freq
-wye - 4 freq
wa- - 2 freq
waoa - 1 freq
wou - 7 freq
wyé - 1 freq
wi-wi-wi - 1 freq
'wh - 1 freq
wha - 5 freq
wow-eee - 1 freq
we - 35 freq
wi - 5 freq
waa - 1 freq
whoa - 1 freq
wu - 9 freq
we - 118 freq
wa - 2 freq
why - 7 freq
wee - 6 freq
wey - 1 freq
- 1 freq
we - 2 freq
why- - 1 freq
'whoa' - 1 freq
whae - 1 freq
wi - 4 freq
wee - 3 freq
we - 10 freq
wow - 2 freq
wee- - 1 freq
w - 2 freq
wha - 10 freq
wae - 7 freq
wwww - 1 freq
whaa - 1 freq
why - 5 freq
wh - 1 freq
who - 22 freq
wuiiie - 1 freq
whye - 4 freq
whye - 1 freq
who - 6 freq
wayhay - 1 freq
wh - 1 freq
wu- - 1 freq
whi - 2 freq
wih - 2 freq
wi - 1 freq
wh- - 1 freq
wiye - 2 freq
who - 3 freq
why - 1 freq
wae - 2 freq
wi’ - 10 freq
whau - 2 freq
wwi - 1 freq
weeeeeeee - 1 freq
‘wee’ - 1 freq
wah- - 1 freq
wyo - 1 freq
wue - 1 freq
weyhey - 1 freq
wuhuhuhuhuhu - 1 freq
woohoo - 3 freq
wwiii - 1 freq
whoooo - 5 freq
whoooooo - 5 freq
whooooo - 1 freq
whooooooo - 1 freq
weeeeeeee” - 1 freq
woah - 3 freq
wa’ - 2 freq
wowee - 1 freq
wo - 2 freq
wai - 1 freq
wwa - 2 freq
wahahah - 1 freq
wahaha - 1 freq
“why - 1 freq
way” - 1 freq
“who - 1 freq
wuh - 1 freq
-way - 1 freq
wui - 1 freq
wheeoo - 1 freq
waa- - 1 freq
wao - 1 freq
wuu - 1 freq
'who' - 1 freq
'whae' - 1 freq
waahey - 1 freq
wea - 1 freq
MetaPhone code -
' - 11454 freq
- - 14189 freq
y - 154 freq
'' - 36 freq
w - 188 freq
© - 52 freq
° - 41 freq
£ - 301 freq
à - 2 freq
hy - 5 freq
« - 6 freq
'y - 1 freq
'- - 2 freq
h - 257 freq
-- - 7 freq
-' - 2 freq
¢ - 3 freq
® - 12 freq
- 17 freq
'w - 2 freq
© - 2 freq
° - 2 freq
www - 176 freq
wwii - 1 freq
é - 3 freq
yw - 3 freq
°° - 3 freq
°© - 1 freq
®° - 1 freq
''' - 1 freq
wy - 11 freq
¥ - 3 freq
- 1 freq
'þ' - 1 freq
ð - 3 freq
' - 1 freq
- 3 freq
'y' - 2 freq
- 2 freq
'ö' - 2 freq
ww - 12 freq
- 2 freq
§ - 1 freq
- 3 freq
æ - 2 freq
'ø' - 1 freq
Æ - 2 freq
¾ - 1 freq
- 3 freq
w' - 1 freq
- 3 freq
- 13 freq
- 2 freq
- 1 freq
- 4 freq
- 2 freq
- 2 freq
- 1 freq
- 11 freq
- 1 freq
- 1 freq
- 3 freq
- 1 freq
- 4 freq
- 3 freq
- 3 freq
- 2 freq
- 2 freq
- 1 freq
- 4 freq
¼ - 1 freq
- 1 freq
- 1 freq
- 20 freq
- 4 freq
- 11 freq
ø - 11 freq
- 3 freq
- 5 freq
- 9 freq
- 2 freq
- 4 freq
- 2 freq
- 3 freq
- 4 freq
- 1 freq
- 4 freq
-y - 6 freq
á - 4 freq
- 4 freq
- 3 freq
þæ - 2 freq
½ - 1 freq
þú - 1 freq
wyé - 1 freq
è - 1 freq
híe - 1 freq
- 1422 freq
- 8282 freq
- 5394 freq
- 397 freq
- 345 freq
- 223 freq
- 1 freq
- 1 freq
- 1 freq
- 58 freq
- 58 freq
· - 32 freq
ü - 2 freq
Ðy - 3 freq
- 1 freq
- 27 freq
- - 1 freq
- 1 freq
w - 2 freq
- - 1 freq
wwww - 1 freq
- 1 freq
y - 2 freq
y - 3 freq
Ð - 1 freq
Ó - 2 freq
' - 6 freq
y - 7 freq
- - 1 freq
ö - 6 freq
- 2 freq
- 1 freq
- 1 freq
- 1 freq
- 1 freq
- 1 freq
ç - 2 freq
” - 59 freq
yh - 2 freq
hw - 1 freq
‘ - 18 freq
– - 30 freq
“ - 13 freq
wwi - 1 freq
’ - 17 freq
hh - 3 freq
¯ - 12 freq
wwiii - 1 freq
— - 2 freq
hhy - 1 freq
–’ - 1 freq
hhoh - 1 freq
wwa - 2 freq
hyyh - 1 freq
yye - 2 freq
'h' - 1 freq
yy - 1 freq
… - 4 freq
y-o - 1 freq
W
Time to execute Levenshtein function - 0.181486 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.291798 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028108 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036804 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.003241 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.