A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to altai in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
altai (0) - 2 freq
altar (1) - 23 freq
dalai (2) - 2 freq
alibi (2) - 2 freq
allan (2) - 91 freq
almac (2) - 1 freq
ata' (2) - 6 freq
alva (2) - 2 freq
ataw (2) - 8 freq
alwais (2) - 1 freq
asti (2) - 1 freq
aldi (2) - 9 freq
ali (2) - 165 freq
alba (2) - 75 freq
aetan (2) - 3 freq
malti (2) - 1 freq
hlai (2) - 1 freq
malta (2) - 2 freq
aloat (2) - 8 freq
alfdi (2) - 1 freq
althar (2) - 5 freq
alsae (2) - 9 freq
alas (2) - 43 freq
aaltar (2) - 2 freq
attar (2) - 1 freq
altai (0) - 2 freq
alt (2) - 5 freq
aloat (2) - 8 freq
altar (2) - 23 freq
alab (3) - 1 freq
ailt (3) - 3 freq
alian (3) - 1 freq
alya (3) - 1 freq
lota (3) - 1 freq
alot (3) - 18 freq
alki (3) - 1 freq
alain (3) - 8 freq
ataa (3) - 8 freq
alter (3) - 15 freq
alsa (3) - 1 freq
galti (3) - 1 freq
aalt (3) - 2 freq
lotae (3) - 1 freq
aloot (3) - 1 freq
aloatae (3) - 1 freq
lat (3) - 555 freq
lt (3) - 5 freq
elt (3) - 2 freq
leat (3) - 1 freq
lait (3) - 4 freq
SoundEx code - A430
auld - 3501 freq
aloot - 1 freq
alloued - 54 freq
alood - 27 freq
allowed - 130 freq
all-white - 1 freq
alt - 5 freq
aloud - 22 freq
ailed - 2 freq
ailt - 3 freq
allood - 7 freq
allooed - 58 freq
'auld - 11 freq
altho - 38 freq
awald - 3 freq
auld' - 1 freq
alot - 18 freq
altho' - 2 freq
allied - 4 freq
ald - 14 freq
aloat - 8 freq
aloatae - 1 freq
aldo - 266 freq
aldo' - 1 freq
aald - 202 freq
alloot - 4 freq
allouit - 1 freq
aulde - 1 freq
alyth - 2 freq
alloo'd - 1 freq
'aald - 1 freq
allyat - 2 freq
awld - 6 freq
aalt - 2 freq
allout - 2 freq
allou'd - 1 freq
alloeud - 1 freq
€˜auld - 3 freq
ayld - 1 freq
alooed - 2 freq
€œauld - 4 freq
alloed - 3 freq
aldi - 9 freq
'auld' - 1 freq
altai - 2 freq
aild - 1 freq
€œald - 1 freq
€™aldo - 7 freq
allude - 1 freq
alloud - 1 freq
MetaPhone code - ALT
auld - 3501 freq
aloot - 1 freq
alloued - 54 freq
alood - 27 freq
alt - 5 freq
aloud - 22 freq
ailed - 2 freq
ailt - 3 freq
allood - 7 freq
allooed - 58 freq
'auld - 11 freq
auld' - 1 freq
alot - 18 freq
allied - 4 freq
ald - 14 freq
aloat - 8 freq
aloatae - 1 freq
aldo - 266 freq
aldo' - 1 freq
aald - 202 freq
alloot - 4 freq
allouit - 1 freq
aulde - 1 freq
alloo'd - 1 freq
'aald - 1 freq
awld - 6 freq
aalt - 2 freq
allout - 2 freq
allou'd - 1 freq
alloeud - 1 freq
€˜auld - 3 freq
ayld - 1 freq
alooed - 2 freq
€œauld - 4 freq
alloed - 3 freq
aldi - 9 freq
'auld' - 1 freq
altai - 2 freq
aild - 1 freq
€œald - 1 freq
€™aldo - 7 freq
allude - 1 freq
alloud - 1 freq
ALTAI
Time to execute Levenshtein function - 0.222082 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.321223 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.026825 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036925 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000826 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.