A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ýae in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ýae (0) - 1 freq
'gae (2) - 3 freq
anae (2) - 1 freq
grae (2) - 4 freq
dae (2) - 4565 freq
irae (2) - 1 freq
gae (2) - 503 freq
'ae (2) - 83 freq
soae (2) - 1 freq
umae (2) - 1 freq
thae (2) - 1233 freq
fae (2) - 9131 freq
nae (2) - 9872 freq
sae (2) - 4672 freq
'yae (2) - 2 freq
mae (2) - 361 freq
awae (2) - 22 freq
'fae (2) - 4 freq
trae (2) - 1 freq
chae (2) - 7 freq
adae (2) - 139 freq
plae (2) - 1 freq
brae (2) - 285 freq
frae (2) - 4566 freq
lae (2) - 28 freq
ýae (0) - 1 freq
ée (3) - 1 freq
ðe (3) - 4 freq
þe (3) - 4 freq
tae (4) - 65006 freq
zrae (4) - 1 freq
biae (4) - 1 freq
flae (4) - 2 freq
apae (4) - 4 freq
wae (4) - 2590 freq
'mae (4) - 1 freq
axae (4) - 1 freq
'dae (4) - 55 freq
iae (4) - 5 freq
agae (4) - 2 freq
'tae (4) - 47 freq
ae (4) - 4 freq
ae (4) - 12 freq
ae (4) - 2 freq
stae (4) - 1 freq
jae (4) - 2 freq
shae (4) - 853 freq
d'ae (4) - 1 freq
-sae (4) - 1 freq
½ (4) - 1 freq
SoundEx code - A000
a - 92602 freq
awa - 4217 freq
aw - 8237 freq
aye - 6435 freq
'aiya - 1 freq
'aye - 306 freq
away - 766 freq
ae - 5555 freq
aa - 7151 freq
ah - 17377 freq
'awa - 22 freq
'aw - 69 freq
'a - 285 freq
ay - 2481 freq
-aye - 3 freq
awey - 180 freq
aea - 1 freq
a- - 4 freq
ah- - 4 freq
'ah - 382 freq
awee - 5 freq
awaa - 103 freq
'ae - 83 freq
awe - 396 freq
'awww - 2 freq
'ay - 112 freq
'ay' - 7 freq
aawey - 6 freq
- 2 freq
'ahhh - 1 freq
'ahh - 2 freq
a' - 459 freq
awa' - 25 freq
aawey' - 1 freq
'awwwwwwww - 1 freq
'awwwww-hawwwwww - 1 freq
'awwwwww - 1 freq
'awwwwwww - 2 freq
'aww - 2 freq
-a - 2 freq
aa' - 7 freq
'aa - 16 freq
a-a-ah - 1 freq
ah-ah-ah - 1 freq
aa- - 1 freq
ai - 30 freq
'aye' - 12 freq
aawye - 42 freq
au - 16 freq
a-wye - 1 freq
a-wee - 3 freq
awae - 22 freq
awiy - 22 freq
'away - 3 freq
'a' - 23 freq
'aa' - 3 freq
aiy - 4 freq
'aye'' - 1 freq
aaaa - 2 freq
awuiy - 1 freq
ahaa - 1 freq
aua - 1 freq
aha - 8 freq
aye¥ - 1 freq
ae' - 1 freq
ah' - 23 freq
ayyyyyyyyy - 1 freq
ayyyyyy - 1 freq
ahhhhhhhh - 1 freq
ahh - 9 freq
ayyyye - 1 freq
ahhh - 5 freq
ahh' - 1 freq
a-e - 1 freq
awoa - 1 freq
ah'y - 3 freq
aaa - 1 freq
aaaahoo - 1 freq
ah'ii - 2 freq
¢a - 1 freq
ah-ah - 1 freq
ah'ye - 1 freq
awah - 14 freq
aah - 6 freq
-ay - 1 freq
a'y - 1 freq
ah-ha - 3 freq
'aha - 1 freq
awww - 25 freq
ahah - 1 freq
ay¢ - 1 freq
ahahah - 1 freq
ay-ay - 1 freq
aay - 1 freq
ahae - 1 freq
'ah-ha - 2 freq
awhe - 2 freq
awhie - 2 freq
au' - 1 freq
a'hae - 1 freq
away' - 1 freq
awae' - 1 freq
'awa' - 1 freq
awwww - 22 freq
ah'i - 1 freq
aye' - 1 freq
a'h - 2 freq
aho - 1 freq
aw' - 83 freq
'ahhhhhhh - 1 freq
awo - 1 freq
a-ha - 3 freq
'a-aye - 1 freq
--aw - 1 freq
awaw - 72 freq
'ah-h-h-h-h-h - 1 freq
aww - 55 freq
aaaaah - 3 freq
a'aa - 2 freq
aye-aye - 4 freq
awaey - 4 freq
a-hah - 2 freq
a'wie - 1 freq
aaaah - 1 freq
a'i' - 3 freq
aa'wie - 1 freq
aai' - 1 freq
awye - 13 freq
'aaaaw' - 1 freq
aa'wye - 1 freq
awy - 13 freq
aweiy-' - 1 freq
aweiy - 8 freq
a-a - 1 freq
a - 2 freq
a - 1 freq
aye-oh - 1 freq
'ae' - 1 freq
ahie - 1 freq
awwwwww - 3 freq
ahhhhhh - 2 freq
'away' - 1 freq
aye - 85 freq
ah - 240 freq
awa - 2 freq
aw - 19 freq
a - 73 freq
ah - 123 freq
ay - 9 freq
aye - 62 freq
aa - 23 freq
a - 20 freq
a - 185 freq
ay - 91 freq
aa - 7 freq
aw-w-w - 1 freq
a - 6 freq
awà - 12 freq
a'a - 77 freq
awie - 1 freq
a - 1 freq
awaw - 1 freq
awa - 8 freq
aoww - 1 freq
ai - 1 freq
ai - 1 freq
aw - 11 freq
aw - 2 freq
awh - 6 freq
ae - 12 freq
aaahhh - 1 freq
awaiy - 4 freq
away - 5 freq
ahh - 1 freq
aw - 2 freq
ae - 4 freq
ayee - 8 freq
aeway - 1 freq
a - 3 freq
aaww - 1 freq
a'e - 1 freq
ay' - 4 freq
awwwwww - 1 freq
a - 1 freq
ao - 2 freq
ao - 1 freq
ay - 1 freq
ay- - 1 freq
awyo - 1 freq
awww - 1 freq
ay-y-y - 2 freq
ah-hah - 1 freq
aaahh - 1 freq
aw-wey - 2 freq
aye - 3 freq
ah - 5 freq
-ae - 1 freq
ae - 2 freq
awiye - 1 freq
awei - 1 freq
ah - 32 freq
aye - 12 freq
“ah - 4 freq
aeaw - 1 freq
ao - 1 freq
auy - 1 freq
- 13 freq
a'wi - 1 freq
aaue - 1 freq
a’w - 1 freq
“awa - 2 freq
aey - 4 freq
“awww” - 1 freq
“a - 5 freq
awwwwwww - 2 freq
a’i - 1 freq
'aw' - 1 freq
aye” - 1 freq
ayw - 1 freq
ahhhhh - 2 freq
awwwww - 7 freq
awé - 1 freq
awewwwwww - 1 freq
ahhhhhhhhhhh - 1 freq
‘aye - 2 freq
“away” - 1 freq
“aye - 2 freq
“aye” - 1 freq
“aw - 2 freq
ahhhh - 2 freq
awa’ - 1 freq
awi - 1 freq
ýae - 1 freq
ayui - 1 freq
aoa - 1 freq
aaaaaaa - 1 freq
aa’ - 4 freq
away” - 1 freq
‘a’ - 1 freq
MetaPhone code - E
eh - 1202 freq
ae - 5555 freq
ee - 1592 freq
e - 4634 freq
'ee - 6 freq
ee' - 4 freq
eo - 6 freq
aea - 1 freq
eh- - 1 freq
ehh - 1 freq
ey - 141 freq
'ae - 83 freq
'eh - 21 freq
'e - 63 freq
eaa - 1 freq
'eehh - 1 freq
ew - 5 freq
eih - 51 freq
eee - 3 freq
ae' - 1 freq
e'e - 20 freq
e' - 89 freq
ehhh - 2 freq
eu - 126 freq
ei - 102 freq
ee-aw - 1 freq
'-e - 1 freq
ée - 1 freq
e- - 1 freq
e - 1 freq
'ee' - 1 freq
þe - 4 freq
'ea' - 1 freq
'ae' - 1 freq
''e - 1 freq
'ee-o' - 1 freq
e - 16 freq
eh - 15 freq
ea - 4 freq
ðe - 4 freq
'ey - 1 freq
e - 12 freq
ee - 1 freq
ei- - 1 freq
eh - 10 freq
ae - 12 freq
eeeeeee - 1 freq
eeeeeeeee - 1 freq
eeeeeeeeee - 1 freq
eeeeeeee - 1 freq
ae - 4 freq
-ae - 1 freq
ae - 2 freq
ee - 1 freq
e - 1 freq
eh - 1 freq
aeaw - 1 freq
‘e - 4 freq
eh' - 1 freq
- 8 freq
euy - 1 freq
aey - 4 freq
“eh - 1 freq
eau - 2 freq
e'' - 1 freq
eeh - 1 freq
ewwwww - 1 freq
eea - 1 freq
ýae - 1 freq
eew - 1 freq
eah - 1 freq
‘e’ - 1 freq
ÝAE
Time to execute Levenshtein function - 0.280078 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.523104 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.069535 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.049048 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000878 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.