A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to who in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
who (0) - 1084 freq
why (1) - 828 freq
whu (1) - 1 freq
tho (1) - 1083 freq
woo (1) - 19 freq
sho (1) - 20 freq
iho (1) - 1 freq
wyo (1) - 1 freq
whe (1) - 1 freq
aho (1) - 1 freq
wdho (1) - 1 freq
whl (1) - 1 freq
whos (1) - 2 freq
whoo (1) - 5 freq
whot (1) - 1 freq
wha (1) - 1886 freq
wo (1) - 2 freq
wlo (1) - 1 freq
wao (1) - 1 freq
wh (1) - 10 freq
'who (1) - 17 freq
whoa (1) - 6 freq
whom (1) - 17 freq
qho (1) - 1 freq
whor (1) - 1 freq
who (0) - 1084 freq
wha (1) - 1886 freq
whe (1) - 1 freq
wh (1) - 10 freq
whoo (1) - 5 freq
whoa (1) - 6 freq
why (1) - 828 freq
whi (1) - 2 freq
whu (1) - 1 freq
whau (2) - 2 freq
whiy (2) - 7 freq
awhe (2) - 2 freq
whaa (2) - 1 freq
wuh (2) - 1 freq
whye (2) - 4 freq
whey (2) - 16 freq
wah (2) - 11 freq
whae (2) - 450 freq
weh (2) - 27 freq
wahoo (2) - 1 freq
whee (2) - 1 freq
ho (2) - 56 freq
awh (2) - 6 freq
whie (2) - 1 freq
wih (2) - 2 freq
SoundEx code - W000
wee - 8357 freq
we - 10460 freq
wha - 1886 freq
wi - 21330 freq
wey - 2472 freq
wa - 148 freq
way - 900 freq
who - 1084 freq
whae - 450 freq
why - 828 freq
'wha - 34 freq
'we - 112 freq
wia - 16 freq
'wee - 14 freq
waa - 219 freq
'woe - 1 freq
wye - 1475 freq
wae - 2590 freq
'who - 17 freq
w - 190 freq
wow - 81 freq
wae- - 2 freq
'wae - 3 freq
waw - 155 freq
whee - 1 freq
whoo - 5 freq
wi' - 752 freq
waiy - 98 freq
wa' - 11 freq
wiy - 56 freq
'we' - 1 freq
'why - 23 freq
wei - 78 freq
wee' - 1 freq
wyee - 1 freq
'whae - 3 freq
whew - 2 freq
wi- - 6 freq
whey - 16 freq
'whey - 1 freq
wii - 10 freq
'whoa - 2 freq
whoa - 6 freq
wha' - 3 freq
wauy - 1 freq
woo - 19 freq
wuiy - 15 freq
weh - 27 freq
weih - 1 freq
wooaa - 4 freq
weiy - 23 freq
wha'e - 1 freq
we' - 2 freq
waheyyy - 1 freq
weeeee - 1 freq
weee - 1 freq
wooooohooo - 1 freq
'wee' - 2 freq
whe - 1 freq
wie - 242 freq
'wi - 8 freq
wah - 11 freq
wh - 10 freq
'w - 2 freq
wye' - 1 freq
www - 176 freq
wwii - 1 freq
wee-' - 1 freq
wei- - 1 freq
wae' - 2 freq
'wh' - 1 freq
way' - 2 freq
wahey - 1 freq
wooooo - 1 freq
woe - 9 freq
'wa - 2 freq
whoohooo - 1 freq
wheyhey - 1 freq
'whoah - 1 freq
wy - 11 freq
waye - 1 freq
weya - 1 freq
'why' - 2 freq
ww - 12 freq
whu - 1 freq
we-' - 2 freq
whie - 1 freq
wow-ee - 1 freq
'wow - 1 freq
waey - 6 freq
'woooooo - 1 freq
wii' - 1 freq
woooo - 1 freq
wooo - 3 freq
wooooooo - 1 freq
wahoo - 1 freq
wooaah - 1 freq
woooooo - 1 freq
we-we - 8 freq
wi'a - 1 freq
wheh - 3 freq
-why - 1 freq
whiy - 7 freq
way- - 1 freq
w' - 1 freq
-wye - 4 freq
wa- - 2 freq
waoa - 1 freq
wou - 7 freq
wyé - 1 freq
wi-wi-wi - 1 freq
'wh - 1 freq
€˜wha - 5 freq
€˜wow-eee - 1 freq
€˜we - 35 freq
€˜wi - 5 freq
€˜waa - 1 freq
€˜whoa - 1 freq
wu - 9 freq
€œwe - 119 freq
€™wa - 2 freq
€œwhy - 7 freq
€˜wee - 6 freq
€˜wey - 1 freq
wí - 1 freq
€”we - 2 freq
why- - 1 freq
'whoa' - 1 freq
€œwhae - 1 freq
€œwi - 4 freq
€œwee - 3 freq
€™we - 10 freq
€˜wow - 2 freq
wee- - 1 freq
€œw - 2 freq
€œwha - 10 freq
€œwae - 7 freq
wwww - 1 freq
whaa - 1 freq
€˜why - 5 freq
€˜wh - 1 freq
€˜who - 22 freq
wuiiie - 1 freq
whye - 4 freq
€œwhye - 1 freq
€œwho - 6 freq
€œwayhay - 1 freq
€œwh - 1 freq
€œwu- - 1 freq
whi - 2 freq
wih - 2 freq
€”wi - 1 freq
€˜wh- - 1 freq
wiye - 2 freq
€™who - 3 freq
€™why - 1 freq
€™wae - 2 freq
wiÂ’ - 10 freq
whau - 2 freq
wwi - 1 freq
weeeeeeee - 1 freq
‘wee’ - 1 freq
wah- - 1 freq
wyo - 1 freq
wue - 1 freq
weyhey - 1 freq
wuhuhuhuhuhu - 1 freq
woohoo - 3 freq
wwiii - 1 freq
whoooo - 5 freq
whoooooo - 5 freq
whooooo - 1 freq
whooooooo - 1 freq
weeeeeeee” - 1 freq
woah - 3 freq
waÂ’ - 2 freq
wowee - 1 freq
wo - 2 freq
wai - 1 freq
wwa - 2 freq
wahahah - 1 freq
wahaha - 1 freq
“why - 1 freq
way” - 1 freq
“who - 1 freq
wuh - 1 freq
-way - 1 freq
wui - 1 freq
wheeoo - 1 freq
waa- - 1 freq
wao - 1 freq
wuu - 1 freq
'who' - 1 freq
'whae' - 1 freq
waahey - 1 freq
wea - 1 freq
MetaPhone code - W
wee - 8357 freq
we - 10460 freq
wha - 1886 freq
wi - 21330 freq
wey - 2472 freq
wa - 148 freq
way - 900 freq
who - 1084 freq
whae - 450 freq
why - 828 freq
'wha - 34 freq
'we - 112 freq
wia - 16 freq
'wee - 14 freq
waa - 219 freq
'woe - 1 freq
wae - 2590 freq
'who - 17 freq
wow - 81 freq
wae- - 2 freq
'wae - 3 freq
waw - 155 freq
whee - 1 freq
whoo - 5 freq
wi' - 752 freq
waiy - 98 freq
wa' - 11 freq
wiy - 56 freq
'we' - 1 freq
'why - 23 freq
wei - 78 freq
wee' - 1 freq
'whae - 3 freq
whew - 2 freq
wi- - 6 freq
whey - 16 freq
'whey - 1 freq
wii - 10 freq
'whoa - 2 freq
whoa - 6 freq
wha' - 3 freq
wauy - 1 freq
woo - 19 freq
wuiy - 15 freq
weh - 27 freq
weih - 1 freq
wooaa - 4 freq
weiy - 23 freq
wha'e - 1 freq
we' - 2 freq
weeeee - 1 freq
weee - 1 freq
'wee' - 2 freq
whe - 1 freq
wie - 242 freq
'wi - 8 freq
wah - 11 freq
wh - 10 freq
wee-' - 1 freq
wei- - 1 freq
wae' - 2 freq
'wh' - 1 freq
way' - 2 freq
wooooo - 1 freq
woe - 9 freq
'wa - 2 freq
'whoah - 1 freq
'why' - 2 freq
whu - 1 freq
we-' - 2 freq
whie - 1 freq
wow-ee - 1 freq
'wow - 1 freq
waey - 6 freq
'woooooo - 1 freq
wii' - 1 freq
woooo - 1 freq
wooo - 3 freq
wooooooo - 1 freq
wooaah - 1 freq
woooooo - 1 freq
wi'a - 1 freq
wheh - 3 freq
-why - 1 freq
whiy - 7 freq
way- - 1 freq
wa- - 2 freq
waoa - 1 freq
wou - 7 freq
'wh - 1 freq
€˜wha - 5 freq
€˜wow-eee - 1 freq
€˜we - 35 freq
€˜wi - 5 freq
€˜waa - 1 freq
€˜whoa - 1 freq
wu - 9 freq
€œwe - 119 freq
€™wa - 2 freq
€œwhy - 7 freq
€˜wee - 6 freq
€˜wey - 1 freq
€”we - 2 freq
why- - 1 freq
'whoa' - 1 freq
€œwhae - 1 freq
€œwi - 4 freq
€œwee - 3 freq
€™we - 10 freq
€˜wow - 2 freq
wee- - 1 freq
€œwha - 10 freq
€œwae - 7 freq
whaa - 1 freq
€˜why - 5 freq
€˜wh - 1 freq
€˜who - 22 freq
wuiiie - 1 freq
€œwho - 6 freq
€œwh - 1 freq
€œwu- - 1 freq
whi - 2 freq
wih - 2 freq
€”wi - 1 freq
€˜wh- - 1 freq
€™who - 3 freq
€™why - 1 freq
€™wae - 2 freq
wiÂ’ - 10 freq
whau - 2 freq
weeeeeeee - 1 freq
‘wee’ - 1 freq
wah- - 1 freq
wue - 1 freq
hwou - 1 freq
whoooo - 5 freq
whoooooo - 5 freq
whooooo - 1 freq
whooooooo - 1 freq
weeeeeeee” - 1 freq
woah - 3 freq
waÂ’ - 2 freq
wo - 2 freq
wai - 1 freq
“why - 1 freq
way” - 1 freq
“who - 1 freq
wuh - 1 freq
-way - 1 freq
wui - 1 freq
wheeoo - 1 freq
waa- - 1 freq
ywo - 1 freq
wao - 1 freq
wuu - 1 freq
'who' - 1 freq
'whae' - 1 freq
wea - 1 freq
WHO
who - 1084 freq
whom - 17 freq
foo - 753 freq
fa - 751 freq
fa's - 80 freq
who's - 77 freq
whae - 450 freq
Time to execute Levenshtein function - 0.241886 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.486301 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027691 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037498 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000918 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.