A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to adae in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
adae (0) - 139 freq
adaes (1) - 1 freq
ade (1) - 3 freq
dae (1) - 4560 freq
anae (1) - 1 freq
agae (1) - 2 freq
ahae (1) - 1 freq
atae (1) - 1 freq
ada (1) - 4 freq
adee (1) - 30 freq
awae (1) - 22 freq
adam (1) - 189 freq
axae (1) - 1 freq
adage (1) - 2 freq
adie (1) - 16 freq
'dae (1) - 55 freq
apae (1) - 4 freq
aday (1) - 4 freq
ape (2) - 5 freq
adz (2) - 1 freq
agam (2) - 1 freq
hae (2) - 8031 freq
ad's (2) - 1 freq
dge (2) - 1 freq
gade (2) - 1 freq
adae (0) - 139 freq
adie (1) - 16 freq
aday (1) - 4 freq
ada (1) - 4 freq
adee (1) - 30 freq
dae (1) - 4560 freq
ade (1) - 3 freq
aedie (2) - 1 freq
due (2) - 177 freq
ad (2) - 126 freq
oda (2) - 5 freq
ado (2) - 4 freq
adieu (2) - 1 freq
adio (2) - 1 freq
da (2) - 9788 freq
dai (2) - 2 freq
eday (2) - 5 freq
die (2) - 122 freq
iday (2) - 19 freq
dee (2) - 1209 freq
aide (2) - 1 freq
daie (2) - 3 freq
ide (2) - 3 freq
dau (2) - 1 freq
ady (2) - 2 freq
SoundEx code - A300
at - 20353 freq
aed - 2 freq
ate - 115 freq
aat - 852 freq
aet - 102 freq
a'd - 168 freq
ah'd - 540 freq
add - 137 freq
at- - 1 freq
aheid - 193 freq
ahd - 15 freq
ahead - 84 freq
'at - 358 freq
await - 7 freq
adae - 139 freq
'a'd - 3 freq
adee - 30 freq
ad - 126 freq
aht - 27 freq
aid - 37 freq
ado - 4 freq
'ah'd - 13 freq
at'd - 4 freq
ada - 4 freq
aat' - 1 freq
aw-day - 2 freq
aa'd - 2 freq
awtho - 5 freq
atho - 3 freq
awthou - 1 freq
adio - 1 freq
ata - 58 freq
ait - 138 freq
aith - 23 freq
atth - 1 freq
aite - 4 freq
aud - 32 freq
ahaud - 5 freq
aty - 2 freq
awta - 1 freq
ayedeea - 1 freq
aydea - 1 freq
'ahd - 1 freq
aheed - 6 freq
ata' - 6 freq
adieu - 1 freq
auid - 1 freq
ahid - 2 freq
ah'da - 2 freq
ati - 16 freq
ah-ta - 1 freq
ahied - 1 freq
ataw - 8 freq
aaid - 2 freq
addi - 3 freq
awtie - 1 freq
aat'd - 5 freq
ahead' - 1 freq
att - 451 freq
att' - 2 freq
'att - 1 freq
aaud - 1 freq
a'da - 9 freq
aa'at - 1 freq
aedie - 1 freq
'aedie - 1 freq
adö - 1 freq
aet' - 1 freq
adie - 16 freq
audio - 29 freq
'add - 1 freq
aathou - 1 freq
'at'd - 1 freq
atthe - 2 freq
awte - 1 freq
atey - 1 freq
a'hæt - 1 freq
ati'aa - 1 freq
awid - 2 freq
ataa - 7 freq
-at - 1 freq
awety - 1 freq
ady - 2 freq
þat - 2 freq
aat- - 1 freq
awed - 3 freq
'ad - 1 freq
ataw' - 1 freq
ahaed - 1 freq
€œat - 7 freq
€˜aat - 1 freq
adaya - 1 freq
€˜at - 44 freq
ae-twae - 1 freq
ðat - 1 freq
€™at - 40 freq
auto - 5 freq
ahoot - 1 freq
attie - 3 freq
ae-day - 1 freq
awday - 2 freq
ayday - 1 freq
audi - 1 freq
€™ad - 1 freq
atae - 1 freq
aatho - 2 freq
addie - 6 freq
€”at - 1 freq
ae-twa - 2 freq
addy - 6 freq
aweet - 1 freq
aide - 1 freq
atwà - 2 freq
aaht - 1 freq
€˜att - 1 freq
aidh - 1 freq
atÂ’a - 1 freq
at' - 1 freq
a't - 1 freq
aÂ’day - 1 freq
ahÂ’d - 9 freq
at” - 1 freq
aday - 4 freq
awud - 1 freq
ahdh - 1 freq
‘at - 6 freq
a'day - 7 freq
a'the - 3 freq
ah't - 11 freq
aÂ’d - 2 freq
awd - 1 freq
ade - 3 freq
“at - 1 freq
aeiot - 1 freq
adda - 4 freq
'ate - 1 freq
MetaPhone code - AT
at - 20353 freq
ate - 115 freq
aat - 852 freq
a'd - 168 freq
ah'd - 540 freq
add - 137 freq
at- - 1 freq
ahd - 15 freq
'at - 358 freq
adae - 139 freq
'a'd - 3 freq
adee - 30 freq
ad - 126 freq
aht - 27 freq
aid - 37 freq
ado - 4 freq
'ah'd - 13 freq
ada - 4 freq
aat' - 1 freq
aw-day - 2 freq
aa'd - 2 freq
adio - 1 freq
ata - 58 freq
ait - 138 freq
atth - 1 freq
aite - 4 freq
aud - 32 freq
aty - 2 freq
awta - 1 freq
aydea - 1 freq
'ahd - 1 freq
ata' - 6 freq
adieu - 1 freq
auid - 1 freq
ah'da - 2 freq
ati - 16 freq
ah-ta - 1 freq
ataw - 8 freq
aaid - 2 freq
addi - 3 freq
awtie - 1 freq
att - 451 freq
att' - 2 freq
'att - 1 freq
aaud - 1 freq
a'da - 9 freq
aa'at - 1 freq
adö - 1 freq
adie - 16 freq
audio - 29 freq
'add - 1 freq
atthe - 2 freq
awte - 1 freq
atey - 1 freq
a'hæt - 1 freq
ati'aa - 1 freq
ataa - 7 freq
-at - 1 freq
ady - 2 freq
þat - 2 freq
aat- - 1 freq
'ad - 1 freq
ataw' - 1 freq
€œat - 7 freq
€˜aat - 1 freq
€˜at - 44 freq
ðat - 1 freq
€™at - 40 freq
auto - 5 freq
attie - 3 freq
awday - 2 freq
ayday - 1 freq
audi - 1 freq
€™ad - 1 freq
atae - 1 freq
addie - 6 freq
€”at - 1 freq
addy - 6 freq
aide - 1 freq
atwà - 2 freq
aaht - 1 freq
€˜att - 1 freq
aidh - 1 freq
atÂ’a - 1 freq
at' - 1 freq
a't - 1 freq
aÂ’day - 1 freq
ahÂ’d - 9 freq
at” - 1 freq
aday - 4 freq
ahdh - 1 freq
‘at - 6 freq
a'day - 7 freq
ah't - 11 freq
aÂ’d - 2 freq
awd - 1 freq
ade - 3 freq
“at - 1 freq
adda - 4 freq
'ate - 1 freq
ADAE
Time to execute Levenshtein function - 0.195753 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330821 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027967 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038346 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001140 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.