A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ald in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ald (0) - 14 freq
aild (1) - 1 freq
auld (1) - 3449 freq
bald (1) - 16 freq
aly (1) - 5 freq
sld (1) - 9 freq
aid (1) - 37 freq
alf (1) - 7 freq
all (1) - 1278 freq
and (1) - 24890 freq
awd (1) - 1 freq
axd (1) - 1 freq
ayld (1) - 1 freq
a'd (1) - 168 freq
wald (1) - 2 freq
add (1) - 133 freq
ali (1) - 162 freq
ld (1) - 3 freq
aald (1) - 202 freq
al' (1) - 24 freq
bld (1) - 1 freq
nald (1) - 1 freq
aed (1) - 2 freq
awld (1) - 6 freq
ard (1) - 4 freq
ald (0) - 14 freq
aldo (1) - 266 freq
ld (1) - 3 freq
aald (1) - 202 freq
aldi (1) - 9 freq
ayld (1) - 1 freq
eld (1) - 30 freq
old (1) - 178 freq
aild (1) - 1 freq
auld (1) - 3449 freq
lid (2) - 71 freq
laid (2) - 269 freq
lod (2) - 2 freq
eauld (2) - 1 freq
ale (2) - 51 freq
wld (2) - 1 freq
alp (2) - 1 freq
aln (2) - 1 freq
alt (2) - 5 freq
olde (2) - 1 freq
aud (2) - 32 freq
al- (2) - 2 freq
akd (2) - 1 freq
led (2) - 245 freq
laud (2) - 5 freq
SoundEx code - A430
auld - 3449 freq
aloot - 1 freq
alloued - 54 freq
alood - 27 freq
allowed - 126 freq
all-white - 1 freq
alt - 5 freq
aloud - 19 freq
ailed - 2 freq
ailt - 3 freq
allood - 7 freq
allooed - 58 freq
'auld - 11 freq
altho - 38 freq
awald - 3 freq
auld' - 1 freq
alot - 18 freq
altho' - 2 freq
ald - 14 freq
aloat - 8 freq
aloatae - 1 freq
aldo - 266 freq
aldo' - 1 freq
aald - 202 freq
alloot - 4 freq
allouit - 1 freq
aulde - 1 freq
alyth - 2 freq
alloo'd - 1 freq
'aald - 1 freq
allyat - 2 freq
awld - 6 freq
aalt - 2 freq
allied - 3 freq
allout - 2 freq
allou'd - 1 freq
alloeud - 1 freq
€˜auld - 3 freq
ayld - 1 freq
alooed - 2 freq
€œauld - 4 freq
alloed - 3 freq
aldi - 9 freq
'auld' - 1 freq
altai - 2 freq
aild - 1 freq
€œald - 1 freq
€™aldo - 7 freq
allude - 1 freq
alloud - 1 freq
MetaPhone code - ALT
auld - 3449 freq
aloot - 1 freq
alloued - 54 freq
alood - 27 freq
alt - 5 freq
aloud - 19 freq
ailed - 2 freq
ailt - 3 freq
allood - 7 freq
allooed - 58 freq
'auld - 11 freq
auld' - 1 freq
alot - 18 freq
ald - 14 freq
aloat - 8 freq
aloatae - 1 freq
aldo - 266 freq
aldo' - 1 freq
aald - 202 freq
alloot - 4 freq
allouit - 1 freq
aulde - 1 freq
alloo'd - 1 freq
'aald - 1 freq
awld - 6 freq
aalt - 2 freq
allied - 3 freq
allout - 2 freq
allou'd - 1 freq
alloeud - 1 freq
€˜auld - 3 freq
ayld - 1 freq
alooed - 2 freq
€œauld - 4 freq
alloed - 3 freq
aldi - 9 freq
'auld' - 1 freq
altai - 2 freq
aild - 1 freq
€œald - 1 freq
€™aldo - 7 freq
allude - 1 freq
alloud - 1 freq
ALD
Time to execute Levenshtein function - 0.186561 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.378282 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027579 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037185 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.