A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to added in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
added (0) - 173 freq
padded (1) - 13 freq
wadded (1) - 1 freq
adden (1) - 1 freq
aided (1) - 2 freq
dadded (1) - 1 freq
addet (1) - 25 freq
adder (1) - 4 freq
adeed (1) - 1 freq
addled (1) - 3 freq
waded (2) - 6 freq
aised (2) - 6 freq
adhd (2) - 5 freq
hauded (2) - 2 freq
awed (2) - 3 freq
asked (2) - 498 freq
arden (2) - 2 freq
ahied (2) - 1 freq
falded (2) - 1 freq
bedded (2) - 8 freq
aeged (2) - 1 freq
noadded (2) - 2 freq
axed (2) - 81 freq
redded (2) - 2 freq
adee (2) - 30 freq
added (0) - 173 freq
adeed (2) - 1 freq
addled (2) - 3 freq
dauded (2) - 1 freq
ddd (2) - 13 freq
adder (2) - 4 freq
padded (2) - 13 freq
adden (2) - 1 freq
addet (2) - 25 freq
wadded (2) - 1 freq
dadded (2) - 1 freq
aided (2) - 2 freq
adead (3) - 1 freq
udder (3) - 1 freq
abided (3) - 1 freq
adda (3) - 4 freq
aidder (3) - 1 freq
lauded (3) - 1 freq
kidded (3) - 1 freq
addy (3) - 6 freq
andoed (3) - 1 freq
andied (3) - 1 freq
add (3) - 133 freq
inded (3) - 1 freq
coded (3) - 2 freq
SoundEx code - A330
athoot - 319 freq
'audit - 1 freq
addit - 97 freq
added - 173 freq
athout - 27 freq
adead - 1 freq
adhd - 5 freq
atotie - 1 freq
awaited - 4 freq
addet - 25 freq
a'daeth - 1 freq
awaitit - 3 freq
ati'da - 18 freq
audit - 12 freq
aathoot - 1 freq
atidda - 7 freq
€˜audit - 1 freq
aided - 2 freq
adeed - 1 freq
awytit - 1 freq
aidit - 1 freq
a'thoot - 1 freq
athott - 1 freq
MetaPhone code - ATT
'audit - 1 freq
addit - 97 freq
added - 173 freq
adead - 1 freq
at'd - 4 freq
adhd - 5 freq
atotie - 1 freq
addet - 25 freq
aat'd - 5 freq
'at'd - 1 freq
ati'da - 18 freq
audit - 12 freq
atidda - 7 freq
€˜audit - 1 freq
aided - 2 freq
adeed - 1 freq
awytit - 1 freq
aidit - 1 freq
ADDED
Time to execute Levenshtein function - 0.410020 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.807128 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.059410 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.083752 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001028 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.