A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to wha� in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
whar (3) - 382 freq
whau'd (3) - 1 freq
whackt (3) - 1 freq
wha' (3) - 3 freq
whangs (3) - 5 freq
whaas (3) - 1 freq
whaes (3) - 27 freq
whaur- (3) - 2 freq
whaft (3) - 1 freq
whare (3) - 19 freq
whaup (3) - 15 freq
wha's (3) - 163 freq
whatr (3) - 1 freq
whaps (3) - 3 freq
whaure (3) - 2 freq
whass (3) - 1 freq
whapit (3) - 1 freq
whalp (3) - 11 freq
whale (3) - 32 freq
whase (3) - 142 freq
wharas (3) - 1 freq
whatno (3) - 2 freq
whang (3) - 6 freq
whalps (3) - 3 freq
whar' (3) - 1 freq
whaurs (6) - 9 freq
whash (6) - 1 freq
whaize (6) - 2 freq
whars (6) - 3 freq
whae'd (6) - 9 freq
whaese (6) - 3 freq
whales (6) - 15 freq
whaal (6) - 2 freq
whant (6) - 15 freq
whaups (6) - 21 freq
whaals (6) - 4 freq
whan (6) - 2757 freq
whaen (6) - 3 freq
wha've (6) - 5 freq
whas (6) - 17 freq
whalin (6) - 24 freq
whaar (6) - 180 freq
whap's (6) - 1 freq
whaa (6) - 1 freq
whaim (6) - 1 freq
whae's (6) - 29 freq
whaul (6) - 2 freq
whaisk (6) - 1 freq
whan's (6) - 3 freq
whaap (6) - 3 freq
SoundEx code - W000
wee - 8357 freq
we - 10460 freq
wha - 1886 freq
wi - 21330 freq
wey - 2472 freq
wa - 148 freq
way - 900 freq
who - 1084 freq
whae - 450 freq
why - 828 freq
'wha - 34 freq
'we - 112 freq
wia - 16 freq
'wee - 14 freq
waa - 219 freq
'woe - 1 freq
wye - 1475 freq
wae - 2590 freq
'who - 17 freq
w - 190 freq
wow - 81 freq
wae- - 2 freq
'wae - 3 freq
waw - 155 freq
whee - 1 freq
whoo - 5 freq
wi' - 752 freq
waiy - 98 freq
wa' - 11 freq
wiy - 56 freq
'we' - 1 freq
'why - 23 freq
wei - 78 freq
wee' - 1 freq
wyee - 1 freq
'whae - 3 freq
whew - 2 freq
wi- - 6 freq
whey - 16 freq
'whey - 1 freq
wii - 10 freq
'whoa - 2 freq
whoa - 6 freq
wha' - 3 freq
wauy - 1 freq
woo - 19 freq
wuiy - 15 freq
weh - 27 freq
weih - 1 freq
wooaa - 4 freq
weiy - 23 freq
wha'e - 1 freq
we' - 2 freq
waheyyy - 1 freq
weeeee - 1 freq
weee - 1 freq
wooooohooo - 1 freq
'wee' - 2 freq
whe - 1 freq
wie - 242 freq
'wi - 8 freq
wah - 11 freq
wh - 10 freq
'w - 2 freq
wye' - 1 freq
www - 176 freq
wwii - 1 freq
wee-' - 1 freq
wei- - 1 freq
wae' - 2 freq
'wh' - 1 freq
way' - 2 freq
wahey - 1 freq
wooooo - 1 freq
woe - 9 freq
'wa - 2 freq
whoohooo - 1 freq
wheyhey - 1 freq
'whoah - 1 freq
wy - 11 freq
waye - 1 freq
weya - 1 freq
'why' - 2 freq
ww - 12 freq
whu - 1 freq
we-' - 2 freq
whie - 1 freq
wow-ee - 1 freq
'wow - 1 freq
waey - 6 freq
'woooooo - 1 freq
wii' - 1 freq
woooo - 1 freq
wooo - 3 freq
wooooooo - 1 freq
wahoo - 1 freq
wooaah - 1 freq
woooooo - 1 freq
we-we - 8 freq
wi'a - 1 freq
wheh - 3 freq
-why - 1 freq
whiy - 7 freq
way- - 1 freq
w' - 1 freq
-wye - 4 freq
wa- - 2 freq
waoa - 1 freq
wou - 7 freq
wyé - 1 freq
wi-wi-wi - 1 freq
'wh - 1 freq
€˜wha - 5 freq
€˜wow-eee - 1 freq
€˜we - 35 freq
€˜wi - 5 freq
€˜waa - 1 freq
€˜whoa - 1 freq
wu - 9 freq
€œwe - 119 freq
€™wa - 2 freq
€œwhy - 7 freq
€˜wee - 6 freq
€˜wey - 1 freq
wí - 1 freq
€”we - 2 freq
why- - 1 freq
'whoa' - 1 freq
€œwhae - 1 freq
€œwi - 4 freq
€œwee - 3 freq
€™we - 10 freq
€˜wow - 2 freq
wee- - 1 freq
€œw - 2 freq
€œwha - 10 freq
€œwae - 7 freq
wwww - 1 freq
whaa - 1 freq
€˜why - 5 freq
€˜wh - 1 freq
€˜who - 22 freq
wuiiie - 1 freq
whye - 4 freq
€œwhye - 1 freq
€œwho - 6 freq
€œwayhay - 1 freq
€œwh - 1 freq
€œwu- - 1 freq
whi - 2 freq
wih - 2 freq
€”wi - 1 freq
€˜wh- - 1 freq
wiye - 2 freq
€™who - 3 freq
€™why - 1 freq
€™wae - 2 freq
wiÂ’ - 10 freq
whau - 2 freq
wwi - 1 freq
weeeeeeee - 1 freq
‘wee’ - 1 freq
wah- - 1 freq
wyo - 1 freq
wue - 1 freq
weyhey - 1 freq
wuhuhuhuhuhu - 1 freq
woohoo - 3 freq
wwiii - 1 freq
whoooo - 5 freq
whoooooo - 5 freq
whooooo - 1 freq
whooooooo - 1 freq
weeeeeeee” - 1 freq
woah - 3 freq
waÂ’ - 2 freq
wowee - 1 freq
wo - 2 freq
wai - 1 freq
wwa - 2 freq
wahahah - 1 freq
wahaha - 1 freq
“why - 1 freq
way” - 1 freq
“who - 1 freq
wuh - 1 freq
-way - 1 freq
wui - 1 freq
wheeoo - 1 freq
waa- - 1 freq
wao - 1 freq
wuu - 1 freq
'who' - 1 freq
'whae' - 1 freq
waahey - 1 freq
wea - 1 freq
MetaPhone code - W
wee - 8357 freq
we - 10460 freq
wha - 1886 freq
wi - 21330 freq
wey - 2472 freq
wa - 148 freq
way - 900 freq
who - 1084 freq
whae - 450 freq
why - 828 freq
'wha - 34 freq
'we - 112 freq
wia - 16 freq
'wee - 14 freq
waa - 219 freq
'woe - 1 freq
wae - 2590 freq
'who - 17 freq
wow - 81 freq
wae- - 2 freq
'wae - 3 freq
waw - 155 freq
whee - 1 freq
whoo - 5 freq
wi' - 752 freq
waiy - 98 freq
wa' - 11 freq
wiy - 56 freq
'we' - 1 freq
'why - 23 freq
wei - 78 freq
wee' - 1 freq
'whae - 3 freq
whew - 2 freq
wi- - 6 freq
whey - 16 freq
'whey - 1 freq
wii - 10 freq
'whoa - 2 freq
whoa - 6 freq
wha' - 3 freq
wauy - 1 freq
woo - 19 freq
wuiy - 15 freq
weh - 27 freq
weih - 1 freq
wooaa - 4 freq
weiy - 23 freq
wha'e - 1 freq
we' - 2 freq
weeeee - 1 freq
weee - 1 freq
'wee' - 2 freq
whe - 1 freq
wie - 242 freq
'wi - 8 freq
wah - 11 freq
wh - 10 freq
wee-' - 1 freq
wei- - 1 freq
wae' - 2 freq
'wh' - 1 freq
way' - 2 freq
wooooo - 1 freq
woe - 9 freq
'wa - 2 freq
'whoah - 1 freq
'why' - 2 freq
whu - 1 freq
we-' - 2 freq
whie - 1 freq
wow-ee - 1 freq
'wow - 1 freq
waey - 6 freq
'woooooo - 1 freq
wii' - 1 freq
woooo - 1 freq
wooo - 3 freq
wooooooo - 1 freq
wooaah - 1 freq
woooooo - 1 freq
wi'a - 1 freq
wheh - 3 freq
-why - 1 freq
whiy - 7 freq
way- - 1 freq
wa- - 2 freq
waoa - 1 freq
wou - 7 freq
'wh - 1 freq
€˜wha - 5 freq
€˜wow-eee - 1 freq
€˜we - 35 freq
€˜wi - 5 freq
€˜waa - 1 freq
€˜whoa - 1 freq
wu - 9 freq
€œwe - 119 freq
€™wa - 2 freq
€œwhy - 7 freq
€˜wee - 6 freq
€˜wey - 1 freq
€”we - 2 freq
why- - 1 freq
'whoa' - 1 freq
€œwhae - 1 freq
€œwi - 4 freq
€œwee - 3 freq
€™we - 10 freq
€˜wow - 2 freq
wee- - 1 freq
€œwha - 10 freq
€œwae - 7 freq
whaa - 1 freq
€˜why - 5 freq
€˜wh - 1 freq
€˜who - 22 freq
wuiiie - 1 freq
€œwho - 6 freq
€œwh - 1 freq
€œwu- - 1 freq
whi - 2 freq
wih - 2 freq
€”wi - 1 freq
€˜wh- - 1 freq
€™who - 3 freq
€™why - 1 freq
€™wae - 2 freq
wiÂ’ - 10 freq
whau - 2 freq
weeeeeeee - 1 freq
‘wee’ - 1 freq
wah- - 1 freq
wue - 1 freq
hwou - 1 freq
whoooo - 5 freq
whoooooo - 5 freq
whooooo - 1 freq
whooooooo - 1 freq
weeeeeeee” - 1 freq
woah - 3 freq
waÂ’ - 2 freq
wo - 2 freq
wai - 1 freq
“why - 1 freq
way” - 1 freq
“who - 1 freq
wuh - 1 freq
-way - 1 freq
wui - 1 freq
wheeoo - 1 freq
waa- - 1 freq
ywo - 1 freq
wao - 1 freq
wuu - 1 freq
'who' - 1 freq
'whae' - 1 freq
wea - 1 freq
WHA�
Time to execute Levenshtein function - 0.578974 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.868832 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.068683 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.045654 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.007278 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.