A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ae in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ae (0) - 5537 freq
pe (1) - 24 freq
aes (1) - 4 freq
rae (1) - 13 freq
ale (1) - 51 freq
ace (1) - 19 freq
gae (1) - 501 freq
axe (1) - 22 freq
ie (1) - 40 freq
e (1) - 1 freq
je (1) - 10 freq
ael (1) - 1 freq
cae (1) - 7 freq
ane (1) - 2115 freq
sae (1) - 4643 freq
az (1) - 1 freq
awe (1) - 396 freq
e (1) - 4627 freq
aet (1) - 102 freq
qe (1) - 5 freq
de (1) - 260 freq
abe (1) - 3 freq
be (1) - 14795 freq
av (1) - 161 freq
aj (1) - 6 freq
ae (0) - 5537 freq
a (1) - 91162 freq
ai (1) - 29 freq
e (1) - 4627 freq
aa (1) - 7091 freq
ue (1) - 2 freq
ye (1) - 20449 freq
iae (1) - 5 freq
ay (1) - 2131 freq
aye (1) - 6376 freq
ee (1) - 1590 freq
ao (1) - 1 freq
aea (1) - 1 freq
yae (1) - 1059 freq
oe (1) - 13 freq
ie (1) - 40 freq
aey (1) - 4 freq
au (1) - 16 freq
eau (2) - 2 freq
yy (2) - 1 freq
oye (2) - 1 freq
oa (2) - 14 freq
eye (2) - 224 freq
ii (2) - 69 freq
y (2) - 154 freq
SoundEx code - A000
a - 91162 freq
awa - 4197 freq
aw - 8032 freq
aye - 6376 freq
'aiya - 1 freq
'aye - 304 freq
away - 732 freq
ae - 5537 freq
aa - 7091 freq
ah - 16973 freq
'awa - 22 freq
'aw - 68 freq
'a - 282 freq
ay - 2131 freq
-aye - 3 freq
awey - 180 freq
aea - 1 freq
a- - 4 freq
ah- - 4 freq
'ah - 377 freq
awee - 5 freq
awaa - 99 freq
'ae - 8 freq
awe - 396 freq
'awww - 2 freq
'ay - 112 freq
'ay' - 7 freq
aawey - 6 freq
- 2 freq
'ahhh - 1 freq
'ahh - 2 freq
a' - 396 freq
awa' - 24 freq
aawey' - 1 freq
'awwwwwwww - 1 freq
'awwwww-hawwwwww - 1 freq
'awwwwww - 1 freq
'awwwwwww - 2 freq
'aww - 1 freq
-a - 2 freq
aa' - 7 freq
'aa - 16 freq
a-a-ah - 1 freq
ah-ah-ah - 1 freq
aa- - 1 freq
ai - 29 freq
'aye' - 12 freq
aawye - 41 freq
au - 16 freq
a-wye - 1 freq
a-wee - 3 freq
awae - 22 freq
awiy - 8 freq
'away - 3 freq
'a' - 23 freq
'aa' - 3 freq
aiy - 4 freq
'aye'' - 1 freq
aaaa - 2 freq
awuiy - 1 freq
ahaa - 1 freq
aua - 1 freq
aha - 8 freq
aye¥ - 1 freq
ae' - 1 freq
ah' - 23 freq
awoa - 1 freq
ah'y - 3 freq
aaa - 1 freq
aaaahoo - 1 freq
ah'ii - 2 freq
¢a - 1 freq
ah-ah - 1 freq
ah'ye - 1 freq
awah - 14 freq
aah - 6 freq
ahhh - 4 freq
-ay - 1 freq
a'y - 1 freq
ah-ha - 3 freq
'aha - 1 freq
awww - 25 freq
ahah - 1 freq
ay¢ - 1 freq
ahahah - 1 freq
ay-ay - 1 freq
aay - 1 freq
ahae - 1 freq
'ah-ha - 2 freq
awhe - 2 freq
awhie - 2 freq
au' - 1 freq
a'hae - 1 freq
away' - 1 freq
awae' - 1 freq
'awa' - 1 freq
awwww - 22 freq
ah'i - 1 freq
aye' - 1 freq
a'h - 2 freq
aho - 1 freq
aw' - 83 freq
ahh - 7 freq
'ahhhhhhh - 1 freq
awo - 1 freq
a-ha - 3 freq
'a-aye - 1 freq
--aw - 1 freq
awaw - 72 freq
'ah-h-h-h-h-h - 1 freq
aww - 55 freq
aaaaah - 3 freq
a'aa - 2 freq
aye-aye - 4 freq
awaey - 4 freq
a-hah - 2 freq
a'wie - 1 freq
aaaah - 1 freq
a'i' - 3 freq
aa'wie - 1 freq
aai' - 1 freq
awye - 13 freq
'aaaaw' - 1 freq
aa'wye - 1 freq
awy - 13 freq
aweiy-' - 1 freq
aweiy - 8 freq
a-a - 1 freq
a - 2 freq
a - 1 freq
aye-oh - 1 freq
'ae' - 1 freq
ahie - 1 freq
awwwwww - 3 freq
ahhhhhh - 2 freq
'away' - 1 freq
aye - 85 freq
ah - 239 freq
awa - 2 freq
aw - 19 freq
a - 73 freq
ah - 123 freq
ay - 9 freq
aye - 54 freq
aa - 23 freq
a - 20 freq
a - 185 freq
ay - 91 freq
aa - 7 freq
aw-w-w - 1 freq
a - 6 freq
awà - 12 freq
a'a - 77 freq
awie - 1 freq
a - 1 freq
awaw - 1 freq
awa - 8 freq
aoww - 1 freq
ai - 1 freq
ai - 1 freq
aw - 8 freq
aw - 2 freq
awh - 6 freq
ae - 12 freq
aaahhh - 1 freq
awaiy - 4 freq
away - 5 freq
ahh - 1 freq
aw - 2 freq
ae - 4 freq
ayee - 8 freq
aeway - 1 freq
a - 3 freq
aaww - 1 freq
a'e - 1 freq
ay' - 4 freq
awwwwww - 1 freq
a - 1 freq
ao - 2 freq
ao - 1 freq
ay - 1 freq
ay- - 1 freq
awyo - 1 freq
awww - 1 freq
ay-y-y - 2 freq
ah-hah - 1 freq
aaahh - 1 freq
aw-wey - 2 freq
aye - 3 freq
ah - 5 freq
-ae - 1 freq
ae - 2 freq
awiye - 1 freq
awei - 1 freq
ah - 32 freq
aye - 12 freq
“ah - 4 freq
aeaw - 1 freq
ao - 1 freq
auy - 1 freq
- 13 freq
a'wi - 1 freq
aaue - 1 freq
a’w - 1 freq
“awa - 2 freq
aey - 4 freq
“awww” - 1 freq
“a - 5 freq
awwwwwww - 2 freq
a’i - 1 freq
'aw' - 1 freq
aye” - 1 freq
ayw - 1 freq
ahhhhh - 2 freq
awwwww - 7 freq
awé - 1 freq
awewwwwww - 1 freq
ahhhhhhhhhhh - 1 freq
‘aye - 2 freq
“away” - 1 freq
“aye - 2 freq
“aye” - 1 freq
“aw - 2 freq
ahhhh - 2 freq
awa’ - 1 freq
awi - 1 freq
ýae - 1 freq
ayui - 1 freq
aoa - 1 freq
aaaaaaa - 1 freq
aa’ - 4 freq
away” - 1 freq
‘a’ - 1 freq
MetaPhone code - E
eh - 1127 freq
ae - 5537 freq
ee - 1590 freq
e - 4627 freq
'ee - 5 freq
ee' - 4 freq
eo - 6 freq
aea - 1 freq
eh- - 1 freq
ehh - 1 freq
ey - 140 freq
'ae - 8 freq
'eh - 19 freq
'e - 61 freq
eaa - 1 freq
'eehh - 1 freq
ew - 5 freq
eih - 51 freq
eee - 3 freq
ae' - 1 freq
e'e - 19 freq
e' - 88 freq
eu - 126 freq
ei - 102 freq
ee-aw - 1 freq
'-e - 1 freq
ée - 1 freq
e- - 1 freq
e - 1 freq
'ee' - 1 freq
þe - 4 freq
'ea' - 1 freq
'ae' - 1 freq
''e - 1 freq
'ee-o' - 1 freq
e - 16 freq
eh - 15 freq
ea - 4 freq
ðe - 4 freq
'ey - 1 freq
e - 12 freq
ee - 1 freq
ei- - 1 freq
ae - 12 freq
eeeeeee - 1 freq
eeeeeeeee - 1 freq
eeeeeeeeee - 1 freq
eeeeeeee - 1 freq
eh - 9 freq
ae - 4 freq
-ae - 1 freq
ae - 2 freq
ee - 1 freq
e - 1 freq
eh - 1 freq
aeaw - 1 freq
‘e - 4 freq
eh' - 1 freq
- 8 freq
euy - 1 freq
aey - 4 freq
“eh - 1 freq
eau - 2 freq
e'' - 1 freq
eeh - 1 freq
ewwwww - 1 freq
eea - 1 freq
ýae - 1 freq
eew - 1 freq
eah - 1 freq
‘e’ - 1 freq
AE
ae - 5537 freq
aye - 6376 freq
always - 408 freq
aawis - freq
alwis - 23 freq
aawys - freq
aaweys - 5 freq
o - 56035 freq
of - 4012 freq
ae - 5537 freq
o' - 2257 freq
Time to execute Levenshtein function - 0.159902 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.316374 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029846 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039769 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001475 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.