A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to aaaahoo in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
aaaahoo (0) - 1 freq
yaahoo (2) - 3 freq
aaaah (2) - 1 freq
'yaahoo (2) - 1 freq
abaaht (3) - 1 freq
aathoot (3) - 1 freq
wahoo (3) - 1 freq
aaach (3) - 1 freq
aaaa (3) - 2 freq
aaaaagh (3) - 1 freq
aathou (3) - 1 freq
yahoo (3) - 5 freq
aaaargh (3) - 1 freq
anyhoo (3) - 9 freq
aaaaaaa (3) - 1 freq
aaagh (3) - 1 freq
aatho (3) - 2 freq
ataafoi (3) - 1 freq
aaaaah (3) - 3 freq
atchoo (3) - 1 freq
atishoo (3) - 2 freq
yawhoo (3) - 1 freq
aahin (4) - 3 freq
allah (4) - 1 freq
haaal (4) - 1 freq
aaaahoo (0) - 1 freq
aaaah (2) - 1 freq
yaahoo (2) - 3 freq
yahoo (3) - 5 freq
'yaahoo (3) - 1 freq
aaaaah (3) - 3 freq
aatho (4) - 2 freq
ahaa (4) - 1 freq
haaaa (4) - 2 freq
uaahu (4) - 1 freq
yoohoo (4) - 2 freq
aho (4) - 1 freq
aah (4) - 6 freq
aaagh (4) - 1 freq
yawhoo (4) - 1 freq
aaaaaaa (4) - 1 freq
aaach (4) - 1 freq
wahoo (4) - 1 freq
yaho (4) - 1 freq
hoo (4) - 1076 freq
yeehoo (4) - 1 freq
aaaa (4) - 2 freq
aaaaagh (4) - 1 freq
anyhoo (4) - 9 freq
aathou (4) - 1 freq
SoundEx code - A000
a - 92602 freq
awa - 4217 freq
aw - 8237 freq
aye - 6435 freq
'aiya - 1 freq
'aye - 306 freq
away - 766 freq
ae - 5555 freq
aa - 7151 freq
ah - 17377 freq
'awa - 22 freq
'aw - 69 freq
'a - 285 freq
ay - 2481 freq
-aye - 3 freq
awey - 180 freq
aea - 1 freq
a- - 4 freq
ah- - 4 freq
'ah - 382 freq
awee - 5 freq
awaa - 103 freq
'ae - 83 freq
awe - 396 freq
'awww - 2 freq
'ay - 112 freq
'ay' - 7 freq
aawey - 6 freq
- 2 freq
'ahhh - 1 freq
'ahh - 2 freq
a' - 459 freq
awa' - 25 freq
aawey' - 1 freq
'awwwwwwww - 1 freq
'awwwww-hawwwwww - 1 freq
'awwwwww - 1 freq
'awwwwwww - 2 freq
'aww - 2 freq
-a - 2 freq
aa' - 7 freq
'aa - 16 freq
a-a-ah - 1 freq
ah-ah-ah - 1 freq
aa- - 1 freq
ai - 30 freq
'aye' - 12 freq
aawye - 42 freq
au - 16 freq
a-wye - 1 freq
a-wee - 3 freq
awae - 22 freq
awiy - 22 freq
'away - 3 freq
'a' - 23 freq
'aa' - 3 freq
aiy - 4 freq
'aye'' - 1 freq
aaaa - 2 freq
awuiy - 1 freq
ahaa - 1 freq
aua - 1 freq
aha - 8 freq
aye¥ - 1 freq
ae' - 1 freq
ah' - 23 freq
ayyyyyyyyy - 1 freq
ayyyyyy - 1 freq
ahhhhhhhh - 1 freq
ahh - 9 freq
ayyyye - 1 freq
ahhh - 5 freq
ahh' - 1 freq
a-e - 1 freq
awoa - 1 freq
ah'y - 3 freq
aaa - 1 freq
aaaahoo - 1 freq
ah'ii - 2 freq
¢a - 1 freq
ah-ah - 1 freq
ah'ye - 1 freq
awah - 14 freq
aah - 6 freq
-ay - 1 freq
a'y - 1 freq
ah-ha - 3 freq
'aha - 1 freq
awww - 25 freq
ahah - 1 freq
ay¢ - 1 freq
ahahah - 1 freq
ay-ay - 1 freq
aay - 1 freq
ahae - 1 freq
'ah-ha - 2 freq
awhe - 2 freq
awhie - 2 freq
au' - 1 freq
a'hae - 1 freq
away' - 1 freq
awae' - 1 freq
'awa' - 1 freq
awwww - 22 freq
ah'i - 1 freq
aye' - 1 freq
a'h - 2 freq
aho - 1 freq
aw' - 83 freq
'ahhhhhhh - 1 freq
awo - 1 freq
a-ha - 3 freq
'a-aye - 1 freq
--aw - 1 freq
awaw - 72 freq
'ah-h-h-h-h-h - 1 freq
aww - 55 freq
aaaaah - 3 freq
a'aa - 2 freq
aye-aye - 4 freq
awaey - 4 freq
a-hah - 2 freq
a'wie - 1 freq
aaaah - 1 freq
a'i' - 3 freq
aa'wie - 1 freq
aai' - 1 freq
awye - 13 freq
'aaaaw' - 1 freq
aa'wye - 1 freq
awy - 13 freq
aweiy-' - 1 freq
aweiy - 8 freq
a-a - 1 freq
a - 2 freq
a - 1 freq
aye-oh - 1 freq
'ae' - 1 freq
ahie - 1 freq
awwwwww - 3 freq
ahhhhhh - 2 freq
'away' - 1 freq
aye - 85 freq
ah - 240 freq
awa - 2 freq
aw - 19 freq
a - 73 freq
ah - 123 freq
ay - 9 freq
aye - 62 freq
aa - 23 freq
a - 20 freq
a - 185 freq
ay - 91 freq
aa - 7 freq
aw-w-w - 1 freq
a - 6 freq
awà - 12 freq
a'a - 77 freq
awie - 1 freq
a - 1 freq
awaw - 1 freq
awa - 8 freq
aoww - 1 freq
ai - 1 freq
ai - 1 freq
aw - 11 freq
aw - 2 freq
awh - 6 freq
ae - 12 freq
aaahhh - 1 freq
awaiy - 4 freq
away - 5 freq
ahh - 1 freq
aw - 2 freq
ae - 4 freq
ayee - 8 freq
aeway - 1 freq
a - 3 freq
aaww - 1 freq
a'e - 1 freq
ay' - 4 freq
awwwwww - 1 freq
a - 1 freq
ao - 2 freq
ao - 1 freq
ay - 1 freq
ay- - 1 freq
awyo - 1 freq
awww - 1 freq
ay-y-y - 2 freq
ah-hah - 1 freq
aaahh - 1 freq
aw-wey - 2 freq
aye - 3 freq
ah - 5 freq
-ae - 1 freq
ae - 2 freq
awiye - 1 freq
awei - 1 freq
ah - 32 freq
aye - 12 freq
“ah - 4 freq
aeaw - 1 freq
ao - 1 freq
auy - 1 freq
- 13 freq
a'wi - 1 freq
aaue - 1 freq
a’w - 1 freq
“awa - 2 freq
aey - 4 freq
“awww” - 1 freq
“a - 5 freq
awwwwwww - 2 freq
a’i - 1 freq
'aw' - 1 freq
aye” - 1 freq
ayw - 1 freq
ahhhhh - 2 freq
awwwww - 7 freq
awé - 1 freq
awewwwwww - 1 freq
ahhhhhhhhhhh - 1 freq
‘aye - 2 freq
“away” - 1 freq
“aye - 2 freq
“aye” - 1 freq
“aw - 2 freq
ahhhh - 2 freq
awa’ - 1 freq
awi - 1 freq
ýae - 1 freq
ayui - 1 freq
aoa - 1 freq
aaaaaaa - 1 freq
aa’ - 4 freq
away” - 1 freq
‘a’ - 1 freq
MetaPhone code - AH
'awwwww-hawwwwww - 1 freq
ahaa - 1 freq
aha - 8 freq
aaaahoo - 1 freq
ah-ha - 3 freq
'aha - 1 freq
ahah - 1 freq
ahae - 1 freq
'ah-ha - 2 freq
awhe - 2 freq
awhie - 2 freq
a'hae - 1 freq
aho - 1 freq
a-ha - 3 freq
a-hah - 2 freq
ahie - 1 freq
ah-hah - 1 freq
AAAAHOO
Time to execute Levenshtein function - 0.173950 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.338066 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028381 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037401 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000900 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.