A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to davis in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
davis (0) - 7 freq
davie (1) - 229 freq
dayis (1) - 4 freq
ravis (1) - 1 freq
daves (1) - 2 freq
davies (1) - 13 freq
mavis (1) - 5 freq
david (1) - 234 freq
davit (1) - 23 freq
savin (2) - 38 freq
damish (2) - 4 freq
devi (2) - 1 freq
daein (2) - 882 freq
dares (2) - 5 freq
dauvit (2) - 95 freq
cavin (2) - 1 freq
taves (2) - 4 freq
datit (2) - 6 freq
xavi (2) - 1 freq
dads (2) - 19 freq
havns (2) - 1 freq
dawin (2) - 32 freq
xav's (2) - 1 freq
davey (2) - 13 freq
dasies (2) - 1 freq
davis (0) - 7 freq
daves (1) - 2 freq
davies (1) - 13 freq
davit (2) - 23 freq
devise (2) - 2 freq
dives (2) - 7 freq
david (2) - 234 freq
doves (2) - 2 freq
dayis (2) - 4 freq
mavis (2) - 5 freq
ravis (2) - 1 freq
davie (2) - 229 freq
dams (3) - 6 freq
days (3) - 1574 freq
dacs (3) - 1 freq
daiys (3) - 1 freq
vais (3) - 2 freq
caves (3) - 17 freq
daavid (3) - 12 freq
devyse (3) - 2 freq
doris (3) - 58 freq
div's (3) - 1 freq
maves (3) - 8 freq
dav (3) - 1 freq
daiss (3) - 2 freq
SoundEx code - D120
dips - 11 freq
dabs - 5 freq
daubs - 1 freq
davis - 7 freq
diffuse - 2 freq
div's - 1 freq
deeps - 7 freq
devious - 4 freq
deep-sea - 4 freq
davies - 13 freq
dives - 7 freq
dowfhike - 1 freq
dabs' - 1 freq
dubs - 84 freq
dobbies - 2 freq
device - 23 freq
daffs - 3 freq
davie's - 36 freq
devise - 2 freq
dybbuk's - 1 freq
depose - 1 freq
divvies - 1 freq
defuse - 2 freq
dubious - 3 freq
doffs - 1 freq
deep-sey - 1 freq
dfs - 2 freq
diffs - 2 freq
davy's - 1 freq
duffy's - 4 freq
deeves - 1 freq
daffik - 2 freq
doupies - 1 freq
deips - 1 freq
daffies - 4 freq
davoo's - 1 freq
dfc - 2 freq
devyse - 2 freq
daffiks - 1 freq
dpis - 1 freq
dowps - 6 freq
daffys - 1 freq
debauch - 1 freq
dpbc - 1 freq
doves - 2 freq
deives - 1 freq
defeck - 1 freq
duffs - 2 freq
dps - 1 freq
defies - 2 freq
dpbz - 1 freq
dpz - 1 freq
dbz - 1 freq
dfpz - 1 freq
dpke - 5 freq
debs - 1 freq
dbyg - 1 freq
daveg - 1 freq
dwbaqh - 1 freq
dfkxa - 1 freq
debbie's - 1 freq
ddaps - 1 freq
dbis - 1 freq
daves - 2 freq
MetaPhone code - TFS
davis - 7 freq
diffuse - 2 freq
div's - 1 freq
devious - 4 freq
davies - 13 freq
dives - 7 freq
device - 23 freq
daffs - 3 freq
davie's - 36 freq
devise - 2 freq
toffees - 3 freq
toffs - 13 freq
divvies - 1 freq
defuse - 2 freq
doffs - 1 freq
dfs - 2 freq
diffs - 2 freq
davy's - 1 freq
duffy's - 4 freq
deeves - 1 freq
tighes - 1 freq
tiefs - 2 freq
daffies - 4 freq
davoo's - 1 freq
devyse - 2 freq
dughoose - 1 freq
taves - 4 freq
'taves' - 1 freq
toves - 1 freq
tief's - 1 freq
tvs - 2 freq
daffys - 1 freq
typhus - 1 freq
doves - 2 freq
deives - 1 freq
duffs - 2 freq
defies - 2 freq
teefs - 1 freq
daves - 2 freq
DAVIS
Time to execute Levenshtein function - 0.513385 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.906901 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.089172 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.110433 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001037 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.