A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to aat in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
aat (0) - 852 freq
aas (1) - 6 freq
alt (1) - 5 freq
aab (1) - 3 freq
saat (1) - 29 freq
maat (1) - 1 freq
aap (1) - 26 freq
aa (1) - 7129 freq
aah (1) - 6 freq
dat (1) - 1391 freq
aam (1) - 9 freq
aaf (1) - 23 freq
mat (1) - 23 freq
vat (1) - 12 freq
-at (1) - 1 freq
bat (1) - 50 freq
ait (1) - 138 freq
at (1) - 20353 freq
cat (1) - 567 freq
aal (1) - 755 freq
zat (1) - 11 freq
apt (1) - 45 freq
aalt (1) - 2 freq
nat (1) - 21 freq
art (1) - 92 freq
aat (0) - 852 freq
at (1) - 20353 freq
oat (1) - 12 freq
aet (1) - 102 freq
ait (1) - 138 freq
eat (1) - 466 freq
yat (1) - 1 freq
yet (2) - 1015 freq
atae (2) - 1 freq
ati (2) - 16 freq
aft (2) - 68 freq
et (2) - 256 freq
tae (2) - 64799 freq
tay (2) - 186 freq
tau (2) - 2 freq
yit (2) - 500 freq
fat (2) - 296 freq
jat (2) - 1 freq
ant (2) - 8 freq
faat (2) - 15 freq
aite (2) - 4 freq
a't (2) - 1 freq
aa' (2) - 7 freq
aa (2) - 7129 freq
alt (2) - 5 freq
SoundEx code - A300
at - 20353 freq
aed - 2 freq
ate - 115 freq
aat - 852 freq
aet - 102 freq
a'd - 168 freq
ah'd - 540 freq
add - 137 freq
at- - 1 freq
aheid - 193 freq
ahd - 15 freq
ahead - 84 freq
'at - 358 freq
await - 7 freq
adae - 139 freq
'a'd - 3 freq
adee - 30 freq
ad - 126 freq
aht - 27 freq
aid - 37 freq
ado - 4 freq
'ah'd - 13 freq
at'd - 4 freq
ada - 4 freq
aat' - 1 freq
aw-day - 2 freq
aa'd - 2 freq
awtho - 5 freq
atho - 3 freq
awthou - 1 freq
adio - 1 freq
ata - 58 freq
ait - 138 freq
aith - 23 freq
atth - 1 freq
aite - 4 freq
aud - 32 freq
ahaud - 5 freq
aty - 2 freq
awta - 1 freq
ayedeea - 1 freq
aydea - 1 freq
'ahd - 1 freq
aheed - 6 freq
ata' - 6 freq
adieu - 1 freq
auid - 1 freq
ahid - 2 freq
ah'da - 2 freq
ati - 16 freq
ah-ta - 1 freq
ahied - 1 freq
ataw - 8 freq
aaid - 2 freq
addi - 3 freq
awtie - 1 freq
aat'd - 5 freq
ahead' - 1 freq
att - 451 freq
att' - 2 freq
'att - 1 freq
aaud - 1 freq
a'da - 9 freq
aa'at - 1 freq
aedie - 1 freq
'aedie - 1 freq
adö - 1 freq
aet' - 1 freq
adie - 16 freq
audio - 29 freq
'add - 1 freq
aathou - 1 freq
'at'd - 1 freq
atthe - 2 freq
awte - 1 freq
atey - 1 freq
a'hæt - 1 freq
ati'aa - 1 freq
awid - 2 freq
ataa - 7 freq
-at - 1 freq
awety - 1 freq
ady - 2 freq
þat - 2 freq
aat- - 1 freq
awed - 3 freq
'ad - 1 freq
ataw' - 1 freq
ahaed - 1 freq
€œat - 7 freq
€˜aat - 1 freq
adaya - 1 freq
€˜at - 44 freq
ae-twae - 1 freq
ðat - 1 freq
€™at - 40 freq
auto - 5 freq
ahoot - 1 freq
attie - 3 freq
ae-day - 1 freq
awday - 2 freq
ayday - 1 freq
audi - 1 freq
€™ad - 1 freq
atae - 1 freq
aatho - 2 freq
addie - 6 freq
€”at - 1 freq
ae-twa - 2 freq
addy - 6 freq
aweet - 1 freq
aide - 1 freq
atwà - 2 freq
aaht - 1 freq
€˜att - 1 freq
aidh - 1 freq
atÂ’a - 1 freq
at' - 1 freq
a't - 1 freq
aÂ’day - 1 freq
ahÂ’d - 9 freq
at” - 1 freq
aday - 4 freq
awud - 1 freq
ahdh - 1 freq
‘at - 6 freq
a'day - 7 freq
a'the - 3 freq
ah't - 11 freq
aÂ’d - 2 freq
awd - 1 freq
ade - 3 freq
“at - 1 freq
aeiot - 1 freq
adda - 4 freq
'ate - 1 freq
MetaPhone code - AT
at - 20353 freq
ate - 115 freq
aat - 852 freq
a'd - 168 freq
ah'd - 540 freq
add - 137 freq
at- - 1 freq
ahd - 15 freq
'at - 358 freq
adae - 139 freq
'a'd - 3 freq
adee - 30 freq
ad - 126 freq
aht - 27 freq
aid - 37 freq
ado - 4 freq
'ah'd - 13 freq
ada - 4 freq
aat' - 1 freq
aw-day - 2 freq
aa'd - 2 freq
adio - 1 freq
ata - 58 freq
ait - 138 freq
atth - 1 freq
aite - 4 freq
aud - 32 freq
aty - 2 freq
awta - 1 freq
aydea - 1 freq
'ahd - 1 freq
ata' - 6 freq
adieu - 1 freq
auid - 1 freq
ah'da - 2 freq
ati - 16 freq
ah-ta - 1 freq
ataw - 8 freq
aaid - 2 freq
addi - 3 freq
awtie - 1 freq
att - 451 freq
att' - 2 freq
'att - 1 freq
aaud - 1 freq
a'da - 9 freq
aa'at - 1 freq
adö - 1 freq
adie - 16 freq
audio - 29 freq
'add - 1 freq
atthe - 2 freq
awte - 1 freq
atey - 1 freq
a'hæt - 1 freq
ati'aa - 1 freq
ataa - 7 freq
-at - 1 freq
ady - 2 freq
þat - 2 freq
aat- - 1 freq
'ad - 1 freq
ataw' - 1 freq
€œat - 7 freq
€˜aat - 1 freq
€˜at - 44 freq
ðat - 1 freq
€™at - 40 freq
auto - 5 freq
attie - 3 freq
awday - 2 freq
ayday - 1 freq
audi - 1 freq
€™ad - 1 freq
atae - 1 freq
addie - 6 freq
€”at - 1 freq
addy - 6 freq
aide - 1 freq
atwà - 2 freq
aaht - 1 freq
€˜att - 1 freq
aidh - 1 freq
atÂ’a - 1 freq
at' - 1 freq
a't - 1 freq
aÂ’day - 1 freq
ahÂ’d - 9 freq
at” - 1 freq
aday - 4 freq
ahdh - 1 freq
‘at - 6 freq
a'day - 7 freq
ah't - 11 freq
aÂ’d - 2 freq
awd - 1 freq
ade - 3 freq
“at - 1 freq
adda - 4 freq
'ate - 1 freq
AAT
that - 27031 freq
tha - 6295 freq
at - 20353 freq
thats - 200 freq
that's - 2198 freq
dat - 1391 freq
aat - 852 freq
aat's - 173 freq
dat's - 97 freq
att - 451 freq
at's - 423 freq
thit - 568 freq
Time to execute Levenshtein function - 0.181657 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.321331 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027745 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038322 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001020 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.