A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dvds in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dvds (0) - 3 freq
dvd (1) - 11 freq
dvd' (1) - 1 freq
dads (1) - 19 freq
duds (1) - 15 freq
dods (1) - 8 freq
eves (2) - 1 freq
dugs (2) - 231 freq
dodo (2) - 79 freq
dives (2) - 7 freq
dpis (2) - 1 freq
dees (2) - 41 freq
daws (2) - 5 freq
avs (2) - 1 freq
vd (2) - 2 freq
dus (2) - 24 freq
didn (2) - 25 freq
dedes (2) - 1 freq
teds (2) - 1 freq
ds (2) - 9 freq
hads (2) - 2 freq
zvs (2) - 1 freq
vdb (2) - 3 freq
dod's (2) - 9 freq
dj's (2) - 1 freq
dvds (0) - 3 freq
devdas (2) - 3 freq
dods (2) - 8 freq
duds (2) - 15 freq
dvd (2) - 11 freq
dvd' (2) - 1 freq
dads (2) - 19 freq
davis (3) - 7 freq
dadds (3) - 1 freq
odds (3) - 91 freq
dudes (3) - 1 freq
dodds (3) - 4 freq
div's (3) - 1 freq
dedis (3) - 1 freq
daves (3) - 2 freq
vids (3) - 2 freq
dodos (3) - 1 freq
deids (3) - 1 freq
dides (3) - 1 freq
daads (3) - 1 freq
vdus (3) - 1 freq
doves (3) - 2 freq
dives (3) - 7 freq
dauds (3) - 26 freq
adds (3) - 23 freq
SoundEx code - D132
david's - 17 freq
daftish - 2 freq
dafties - 17 freq
dafties' - 1 freq
debts - 8 freq
divots - 5 freq
depths - 14 freq
doubts - 5 freq
devotees - 3 freq
divot's - 2 freq
dvds - 3 freq
depth's - 1 freq
diabetes - 6 freq
debt's - 1 freq
dabbities - 1 freq
davidson - 26 freq
daftest - 4 freq
diabetic - 2 freq
davit's - 1 freq
dippitest - 1 freq
devoto's - 1 freq
daavit's - 6 freq
doobts - 1 freq
deepths - 5 freq
daftie's - 10 freq
depts - 3 freq
debates - 11 freq
divides - 4 freq
daavid's - 1 freq
dauvit's - 1 freq
divits - 1 freq
davidson's - 1 freq
davidsons - 3 freq
deputes - 1 freq
depth-charges - 2 freq
devdas - 3 freq
deputyship - 1 freq
dafities - 1 freq
dtptsgmqe - 1 freq
davidjames - 3 freq
davidcameron - 1 freq
davidÂ’s - 1 freq
davidjmadden - 1 freq
dipduckdive - 3 freq
davidccraig - 1 freq
davidjwood - 3 freq
davidjewood - 1 freq
davidsonmagnus - 3 freq
diabetesuk - 1 freq
davidghfrost - 1 freq
duvets - 1 freq
davidschneider - 2 freq
davidwshedden - 1 freq
davidhawker - 1 freq
MetaPhone code - TFTS
david's - 17 freq
dafties - 17 freq
dafties' - 1 freq
divots - 5 freq
devotees - 3 freq
dights - 5 freq
divot's - 2 freq
dvds - 3 freq
tufty's - 3 freq
tuftie's - 1 freq
tuffty's - 1 freq
tights - 15 freq
davit's - 1 freq
devoto's - 1 freq
daavit's - 6 freq
tufts - 2 freq
daftie's - 10 freq
divides - 4 freq
daavid's - 1 freq
dauvit's - 1 freq
divits - 1 freq
€œtights - 1 freq
devdas - 3 freq
dafities - 1 freq
davidÂ’s - 1 freq
duvets - 1 freq
DVDS
Time to execute Levenshtein function - 0.255375 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.451556 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030033 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043181 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001122 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.