A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to adieu in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
adieu (0) - 1 freq
adie (1) - 16 freq
dieu (1) - 2 freq
adieus (1) - 1 freq
sadie (2) - 25 freq
'diel (2) - 1 freq
andied (2) - 1 freq
jadie (2) - 4 freq
dien (2) - 1 freq
die' (2) - 1 freq
ahie (2) - 1 freq
abie (2) - 1 freq
abies (2) - 3 freq
ariel (2) - 5 freq
adio (2) - 1 freq
gadie (2) - 5 freq
alie (2) - 1 freq
adaes (2) - 1 freq
adem (2) - 1 freq
ladies (2) - 51 freq
die (2) - 122 freq
awie (2) - 1 freq
diet (2) - 70 freq
addie (2) - 6 freq
dies (2) - 5 freq
adieu (0) - 1 freq
dieu (1) - 2 freq
adie (1) - 16 freq
eadie (2) - 1 freq
ade (2) - 3 freq
deu (2) - 40 freq
adee (2) - 30 freq
aedie (2) - 1 freq
adieus (2) - 1 freq
adae (2) - 139 freq
die (2) - 122 freq
adio (2) - 1 freq
ady (3) - 2 freq
di (3) - 68 freq
doei (3) - 1 freq
adaya (3) - 1 freq
aed (3) - 2 freq
audio (3) - 29 freq
duu (3) - 1 freq
dia (3) - 2 freq
aday (3) - 4 freq
edi (3) - 1 freq
edu (3) - 3 freq
aide (3) - 1 freq
dii (3) - 1 freq
SoundEx code - A300
at - 20079 freq
aed - 2 freq
ate - 115 freq
aat - 852 freq
aet - 102 freq
a'd - 168 freq
ah'd - 508 freq
add - 133 freq
at- - 1 freq
aheid - 189 freq
ahd - 19 freq
ahead - 80 freq
'at - 357 freq
await - 7 freq
adae - 139 freq
'a'd - 3 freq
adee - 30 freq
ad - 126 freq
aht - 27 freq
aid - 37 freq
ado - 4 freq
'ah'd - 13 freq
at'd - 4 freq
ada - 4 freq
aat' - 1 freq
aw-day - 2 freq
aa'd - 2 freq
awtho - 5 freq
atho - 3 freq
awthou - 1 freq
adio - 1 freq
ata - 58 freq
ait - 138 freq
aith - 23 freq
atth - 1 freq
aite - 4 freq
aud - 32 freq
ahaud - 5 freq
aty - 2 freq
awta - 1 freq
ayedeea - 1 freq
aydea - 1 freq
'ahd - 1 freq
aheed - 5 freq
ata' - 6 freq
adieu - 1 freq
ahid - 2 freq
ah'da - 2 freq
ati - 16 freq
ah-ta - 1 freq
ahied - 1 freq
ataw - 7 freq
aaid - 2 freq
addi - 3 freq
awtie - 1 freq
aat'd - 5 freq
ahead' - 1 freq
att - 451 freq
att' - 2 freq
'att - 1 freq
aaud - 1 freq
a'da - 9 freq
aa'at - 1 freq
aedie - 1 freq
'aedie - 1 freq
adö - 1 freq
aet' - 1 freq
adie - 16 freq
audio - 29 freq
'add - 1 freq
aathou - 1 freq
'at'd - 1 freq
atthe - 2 freq
awte - 1 freq
atey - 1 freq
a'hæt - 1 freq
ati'aa - 1 freq
awid - 2 freq
ataa - 7 freq
-at - 1 freq
awety - 1 freq
ady - 2 freq
þat - 2 freq
aat- - 1 freq
awed - 3 freq
'ad - 1 freq
ataw' - 1 freq
ahaed - 1 freq
€œat - 7 freq
€˜aat - 1 freq
adaya - 1 freq
€˜at - 44 freq
ae-twae - 1 freq
ðat - 1 freq
€™at - 40 freq
auto - 5 freq
ahoot - 1 freq
attie - 3 freq
ae-day - 1 freq
awday - 2 freq
ayday - 1 freq
audi - 1 freq
€™ad - 1 freq
atae - 1 freq
aatho - 2 freq
addie - 6 freq
€”at - 1 freq
ae-twa - 2 freq
addy - 6 freq
aweet - 1 freq
aide - 1 freq
atwà - 2 freq
aaht - 1 freq
€˜att - 1 freq
aidh - 1 freq
atÂ’a - 1 freq
at' - 1 freq
a't - 1 freq
aÂ’day - 1 freq
ahÂ’d - 9 freq
at” - 1 freq
aday - 4 freq
awud - 1 freq
ahdh - 1 freq
‘at - 6 freq
a'day - 7 freq
a'the - 3 freq
ah't - 11 freq
aÂ’d - 2 freq
awd - 1 freq
ade - 3 freq
“at - 1 freq
aeiot - 1 freq
adda - 4 freq
'ate - 1 freq
MetaPhone code - AT
at - 20079 freq
ate - 115 freq
aat - 852 freq
a'd - 168 freq
ah'd - 508 freq
add - 133 freq
at- - 1 freq
ahd - 19 freq
'at - 357 freq
adae - 139 freq
'a'd - 3 freq
adee - 30 freq
ad - 126 freq
aht - 27 freq
aid - 37 freq
ado - 4 freq
'ah'd - 13 freq
ada - 4 freq
aat' - 1 freq
aw-day - 2 freq
aa'd - 2 freq
adio - 1 freq
ata - 58 freq
ait - 138 freq
atth - 1 freq
aite - 4 freq
aud - 32 freq
aty - 2 freq
awta - 1 freq
aydea - 1 freq
'ahd - 1 freq
ata' - 6 freq
adieu - 1 freq
ah'da - 2 freq
ati - 16 freq
ah-ta - 1 freq
ataw - 7 freq
aaid - 2 freq
addi - 3 freq
awtie - 1 freq
att - 451 freq
att' - 2 freq
'att - 1 freq
aaud - 1 freq
a'da - 9 freq
aa'at - 1 freq
adö - 1 freq
adie - 16 freq
audio - 29 freq
'add - 1 freq
atthe - 2 freq
awte - 1 freq
atey - 1 freq
a'hæt - 1 freq
ati'aa - 1 freq
ataa - 7 freq
-at - 1 freq
ady - 2 freq
þat - 2 freq
aat- - 1 freq
'ad - 1 freq
ataw' - 1 freq
€œat - 7 freq
€˜aat - 1 freq
€˜at - 44 freq
ðat - 1 freq
€™at - 40 freq
auto - 5 freq
attie - 3 freq
awday - 2 freq
ayday - 1 freq
audi - 1 freq
€™ad - 1 freq
atae - 1 freq
addie - 6 freq
€”at - 1 freq
addy - 6 freq
aide - 1 freq
atwà - 2 freq
aaht - 1 freq
€˜att - 1 freq
aidh - 1 freq
atÂ’a - 1 freq
at' - 1 freq
a't - 1 freq
aÂ’day - 1 freq
ahÂ’d - 9 freq
at” - 1 freq
aday - 4 freq
ahdh - 1 freq
‘at - 6 freq
a'day - 7 freq
ah't - 11 freq
aÂ’d - 2 freq
awd - 1 freq
ade - 3 freq
“at - 1 freq
adda - 4 freq
'ate - 1 freq
ADIEU
Time to execute Levenshtein function - 0.247062 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.384227 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034388 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.047056 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000820 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.