A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to depth in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
depth (0) - 22 freq
deith (1) - 102 freq
deepth (1) - 1 freq
dept (1) - 7 freq
depts (1) - 3 freq
death (1) - 168 freq
deeth (1) - 4 freq
deptht (1) - 4 freq
derth (1) - 2 freq
depths (1) - 13 freq
ept (2) - 1 freq
jenth (2) - 1 freq
herth (2) - 3 freq
denty (2) - 6 freq
adept (2) - 2 freq
meith (2) - 3 freq
cept (2) - 25 freq
neath (2) - 4 freq
deith' (2) - 1 freq
delta (2) - 1 freq
derts (2) - 2 freq
epty (2) - 1 freq
peth (2) - 37 freq
heth (2) - 25 freq
dent (2) - 4 freq
depth (0) - 22 freq
deepth (1) - 1 freq
deptht (2) - 4 freq
depths (2) - 13 freq
deeth (2) - 4 freq
derth (2) - 2 freq
death (2) - 168 freq
deith (2) - 102 freq
dept (2) - 7 freq
depts (2) - 3 freq
peth (3) - 37 freq
deputy (3) - 6 freq
doth (3) - 9 freq
daeth (3) - 17 freq
dearth (3) - 9 freq
daph (3) - 1 freq
dith (3) - 1 freq
deepths (3) - 5 freq
darth (3) - 3 freq
depot (3) - 1 freq
daith (3) - 286 freq
depute (3) - 21 freq
adept (3) - 2 freq
dath (3) - 1 freq
deh (4) - 5 freq
SoundEx code - D130
dippit - 12 freq
daft - 436 freq
doubt - 86 freq
daftie - 23 freq
dauvit - 95 freq
dipped - 23 freq
devoid - 7 freq
dabbed - 9 freq
debt - 44 freq
deaved - 13 freq
divvied - 2 freq
dafty - 29 freq
debate - 81 freq
'daft - 2 freq
david - 230 freq
doft - 1 freq
defeat - 30 freq
divide - 24 freq
divot - 13 freq
daavit - 25 freq
'daavit - 1 freq
defait - 7 freq
doobt - 26 freq
defaut - 20 freq
dubbed - 2 freq
dived - 21 freq
duvet - 20 freq
daffed - 3 freq
daupit - 3 freq
dabbit - 6 freq
dowpit - 26 freq
defied - 5 freq
'dauvit - 2 freq
divid - 10 freq
dvd - 11 freq
dafft - 2 freq
dayvideee - 2 freq
depth - 22 freq
deeved - 6 freq
devout - 4 freq
dabaittie - 2 freq
deputy - 6 freq
daivit - 4 freq
dappit - 1 freq
devide - 1 freq
dobbid - 1 freq
doped - 2 freq
debut - 7 freq
daubed - 3 freq
davit - 23 freq
divïd - 8 freq
'dippit - 1 freq
doffed - 3 freq
debait - 1 freq
'david - 2 freq
debit - 2 freq
deft - 4 freq
depute - 21 freq
deived - 1 freq
dept - 7 freq
divvy-oot - 1 freq
depot - 1 freq
davyth - 1 freq
doupit - 1 freq
€œdauvit - 3 freq
€œdavid - 1 freq
dowped - 1 freq
€˜devout - 1 freq
deavit - 1 freq
daavid - 12 freq
daavd - 1 freq
dawpit - 1 freq
€œdaft - 1 freq
dpd - 1 freq
dbooth - 1 freq
dvd' - 1 freq
deepth - 1 freq
“daft - 1 freq
'dived' - 3 freq
dopyt - 1 freq
MetaPhone code - TP0
depth - 22 freq
deepth - 1 freq
DEPTH
Time to execute Levenshtein function - 0.209034 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.355890 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027470 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036825 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000891 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.