A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to who in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
who (0) - 1041 freq
whl (1) - 1 freq
whi (1) - 2 freq
whos (1) - 2 freq
wha (1) - 1876 freq
ho (1) - 56 freq
wao (1) - 1 freq
wlo (1) - 1 freq
sho (1) - 19 freq
wh (1) - 10 freq
whoa (1) - 6 freq
whoo (1) - 5 freq
aho (1) - 1 freq
whor (1) - 1 freq
whot (1) - 1 freq
'who (1) - 15 freq
iho (1) - 1 freq
whu (1) - 1 freq
wyo (1) - 1 freq
wo (1) - 2 freq
qho (1) - 1 freq
wdho (1) - 1 freq
whom (1) - 17 freq
why (1) - 804 freq
woo (1) - 18 freq
who (0) - 1041 freq
whoa (1) - 6 freq
whu (1) - 1 freq
wh (1) - 10 freq
why (1) - 804 freq
whoo (1) - 5 freq
whi (1) - 2 freq
wha (1) - 1876 freq
whee (2) - 1 freq
wah (2) - 11 freq
whau (2) - 2 freq
whie (2) - 1 freq
awhe (2) - 2 freq
whye (2) - 4 freq
tho (2) - 1074 freq
awh (2) - 6 freq
whaa (2) - 1 freq
wuh (2) - 1 freq
wahoo (2) - 1 freq
woo (2) - 18 freq
wih (2) - 2 freq
weh (2) - 27 freq
whae (2) - 450 freq
whey (2) - 16 freq
whiy (2) - 7 freq
SoundEx code - W000
wee - 8134 freq
we - 10235 freq
wha - 1876 freq
wi - 20919 freq
wey - 2461 freq
wa - 145 freq
way - 878 freq
who - 1041 freq
whae - 450 freq
why - 804 freq
'wha - 34 freq
'we - 111 freq
wia - 16 freq
'wee - 14 freq
waa - 217 freq
'woe - 1 freq
wye - 1471 freq
wae - 2585 freq
'who - 15 freq
w - 188 freq
wow - 79 freq
wae- - 2 freq
'wae - 3 freq
waw - 155 freq
whee - 1 freq
whoo - 5 freq
wi' - 734 freq
waiy - 98 freq
wa' - 7 freq
wiy - 27 freq
'we' - 1 freq
'why - 22 freq
wei - 78 freq
wee' - 1 freq
wyee - 1 freq
'whae - 3 freq
whew - 2 freq
wi- - 6 freq
whey - 16 freq
'whey - 1 freq
wii - 10 freq
'whoa - 2 freq
whoa - 6 freq
wha' - 3 freq
wauy - 1 freq
woo - 18 freq
wuiy - 15 freq
weh - 27 freq
weih - 1 freq
wooaa - 4 freq
weiy - 23 freq
wha'e - 1 freq
we' - 2 freq
wie - 242 freq
'wi - 8 freq
wah - 11 freq
wh - 10 freq
'w - 2 freq
wye' - 1 freq
www - 176 freq
wwii - 1 freq
wee-' - 1 freq
wei- - 1 freq
wae' - 2 freq
'wh' - 1 freq
way' - 2 freq
wahey - 1 freq
wooooo - 1 freq
woe - 9 freq
'wa - 2 freq
whoohooo - 1 freq
wheyhey - 1 freq
'whoah - 1 freq
wy - 11 freq
waye - 1 freq
weya - 1 freq
'why' - 2 freq
ww - 12 freq
whu - 1 freq
we-' - 2 freq
whie - 1 freq
wow-ee - 1 freq
'wow - 1 freq
waey - 6 freq
'woooooo - 1 freq
wii' - 1 freq
woooo - 1 freq
wooo - 3 freq
wooooooo - 1 freq
wahoo - 1 freq
wooaah - 1 freq
woooooo - 1 freq
we-we - 8 freq
wi'a - 1 freq
wheh - 3 freq
-why - 1 freq
whiy - 7 freq
way- - 1 freq
w' - 1 freq
-wye - 4 freq
wa- - 2 freq
waoa - 1 freq
wou - 7 freq
wyé - 1 freq
wi-wi-wi - 1 freq
'wh - 1 freq
€˜wha - 5 freq
€˜wow-eee - 1 freq
€˜we - 35 freq
€˜wi - 5 freq
€˜waa - 1 freq
€˜whoa - 1 freq
wu - 9 freq
€œwe - 118 freq
€™wa - 2 freq
€œwhy - 7 freq
€˜wee - 6 freq
€˜wey - 1 freq
wí - 1 freq
€”we - 2 freq
why- - 1 freq
'whoa' - 1 freq
€œwhae - 1 freq
€œwi - 4 freq
€œwee - 3 freq
€™we - 10 freq
€˜wow - 2 freq
wee- - 1 freq
€œw - 2 freq
€œwha - 10 freq
€œwae - 7 freq
wwww - 1 freq
whaa - 1 freq
€˜why - 5 freq
€˜wh - 1 freq
€˜who - 22 freq
wuiiie - 1 freq
whye - 4 freq
€œwhye - 1 freq
€œwho - 6 freq
€œwayhay - 1 freq
€œwh - 1 freq
€œwu- - 1 freq
whi - 2 freq
wih - 2 freq
€”wi - 1 freq
€˜wh- - 1 freq
wiye - 2 freq
€™who - 3 freq
€™why - 1 freq
€™wae - 2 freq
wiÂ’ - 10 freq
whau - 2 freq
wwi - 1 freq
weeeeeeee - 1 freq
‘wee’ - 1 freq
wah- - 1 freq
wyo - 1 freq
wue - 1 freq
weyhey - 1 freq
wuhuhuhuhuhu - 1 freq
woohoo - 3 freq
wwiii - 1 freq
whoooo - 5 freq
whoooooo - 5 freq
whooooo - 1 freq
whooooooo - 1 freq
weeeeeeee” - 1 freq
woah - 3 freq
waÂ’ - 2 freq
wowee - 1 freq
wo - 2 freq
wai - 1 freq
wwa - 2 freq
wahahah - 1 freq
wahaha - 1 freq
“why - 1 freq
way” - 1 freq
“who - 1 freq
wuh - 1 freq
-way - 1 freq
wui - 1 freq
wheeoo - 1 freq
waa- - 1 freq
wao - 1 freq
wuu - 1 freq
'who' - 1 freq
'whae' - 1 freq
waahey - 1 freq
wea - 1 freq
MetaPhone code - W
wee - 8134 freq
we - 10235 freq
wha - 1876 freq
wi - 20919 freq
wey - 2461 freq
wa - 145 freq
way - 878 freq
who - 1041 freq
whae - 450 freq
why - 804 freq
'wha - 34 freq
'we - 111 freq
wia - 16 freq
'wee - 14 freq
waa - 217 freq
'woe - 1 freq
wae - 2585 freq
'who - 15 freq
wow - 79 freq
wae- - 2 freq
'wae - 3 freq
waw - 155 freq
whee - 1 freq
whoo - 5 freq
wi' - 734 freq
waiy - 98 freq
wa' - 7 freq
wiy - 27 freq
'we' - 1 freq
'why - 22 freq
wei - 78 freq
wee' - 1 freq
'whae - 3 freq
whew - 2 freq
wi- - 6 freq
whey - 16 freq
'whey - 1 freq
wii - 10 freq
'whoa - 2 freq
whoa - 6 freq
wha' - 3 freq
wauy - 1 freq
woo - 18 freq
wuiy - 15 freq
weh - 27 freq
weih - 1 freq
wooaa - 4 freq
weiy - 23 freq
wha'e - 1 freq
we' - 2 freq
wie - 242 freq
'wi - 8 freq
wah - 11 freq
wh - 10 freq
wee-' - 1 freq
wei- - 1 freq
wae' - 2 freq
'wh' - 1 freq
way' - 2 freq
wooooo - 1 freq
woe - 9 freq
'wa - 2 freq
'whoah - 1 freq
'why' - 2 freq
whu - 1 freq
we-' - 2 freq
whie - 1 freq
wow-ee - 1 freq
'wow - 1 freq
waey - 6 freq
'woooooo - 1 freq
wii' - 1 freq
woooo - 1 freq
wooo - 3 freq
wooooooo - 1 freq
wooaah - 1 freq
woooooo - 1 freq
wi'a - 1 freq
wheh - 3 freq
-why - 1 freq
whiy - 7 freq
way- - 1 freq
wa- - 2 freq
waoa - 1 freq
wou - 7 freq
'wh - 1 freq
€˜wha - 5 freq
€˜wow-eee - 1 freq
€˜we - 35 freq
€˜wi - 5 freq
€˜waa - 1 freq
€˜whoa - 1 freq
wu - 9 freq
€œwe - 118 freq
€™wa - 2 freq
€œwhy - 7 freq
€˜wee - 6 freq
€˜wey - 1 freq
€”we - 2 freq
why- - 1 freq
'whoa' - 1 freq
€œwhae - 1 freq
€œwi - 4 freq
€œwee - 3 freq
€™we - 10 freq
€˜wow - 2 freq
wee- - 1 freq
€œwha - 10 freq
€œwae - 7 freq
whaa - 1 freq
€˜why - 5 freq
€˜wh - 1 freq
€˜who - 22 freq
wuiiie - 1 freq
€œwho - 6 freq
€œwh - 1 freq
€œwu- - 1 freq
whi - 2 freq
wih - 2 freq
€”wi - 1 freq
€˜wh- - 1 freq
€™who - 3 freq
€™why - 1 freq
€™wae - 2 freq
wiÂ’ - 10 freq
whau - 2 freq
weeeeeeee - 1 freq
‘wee’ - 1 freq
wah- - 1 freq
wue - 1 freq
hwou - 1 freq
whoooo - 5 freq
whoooooo - 5 freq
whooooo - 1 freq
whooooooo - 1 freq
weeeeeeee” - 1 freq
woah - 3 freq
waÂ’ - 2 freq
wo - 2 freq
wai - 1 freq
“why - 1 freq
way” - 1 freq
“who - 1 freq
wuh - 1 freq
-way - 1 freq
wui - 1 freq
wheeoo - 1 freq
waa- - 1 freq
ywo - 1 freq
wao - 1 freq
wuu - 1 freq
'who' - 1 freq
'whae' - 1 freq
wea - 1 freq
WHO
who - 1041 freq
whom - 17 freq
foo - 751 freq
fa - 748 freq
fa's - 80 freq
who's - 73 freq
Time to execute Levenshtein function - 0.179971 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.353004 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028217 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037115 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000855 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.