A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to depth in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
depth (0) - 23 freq
depths (1) - 14 freq
deepth (1) - 1 freq
depts (1) - 3 freq
death (1) - 170 freq
deith (1) - 103 freq
deptht (1) - 4 freq
dept (1) - 7 freq
derth (1) - 2 freq
deeth (1) - 4 freq
deputy (2) - 6 freq
deat (2) - 1 freq
eith (2) - 26 freq
epty (2) - 1 freq
perth (2) - 41 freq
lenth (2) - 83 freq
det (2) - 3 freq
depute (2) - 21 freq
deh (2) - 5 freq
depth's (2) - 1 freq
dath (2) - 1 freq
deish (2) - 6 freq
tecth (2) - 2 freq
feth (2) - 45 freq
seth (2) - 4 freq
depth (0) - 23 freq
deepth (1) - 1 freq
dept (2) - 7 freq
derth (2) - 2 freq
deptht (2) - 4 freq
deeth (2) - 4 freq
deith (2) - 103 freq
depths (2) - 14 freq
depts (2) - 3 freq
death (2) - 170 freq
adept (3) - 2 freq
depot (3) - 1 freq
doth (3) - 9 freq
peth (3) - 37 freq
daeth (3) - 17 freq
dearth (3) - 9 freq
daith (3) - 294 freq
darth (3) - 3 freq
daph (3) - 1 freq
dath (3) - 1 freq
depute (3) - 21 freq
dith (3) - 1 freq
deepths (3) - 5 freq
deputy (3) - 6 freq
deiths (4) - 7 freq
SoundEx code - D130
dippit - 12 freq
daft - 444 freq
doubt - 94 freq
daftie - 23 freq
dauvit - 95 freq
dipped - 23 freq
devoid - 7 freq
dabbed - 9 freq
debt - 45 freq
deaved - 13 freq
divvied - 2 freq
dafty - 29 freq
debate - 81 freq
'daft - 2 freq
david - 234 freq
doft - 1 freq
defeat - 30 freq
divide - 24 freq
divot - 13 freq
daavit - 25 freq
'daavit - 1 freq
defait - 7 freq
doobt - 26 freq
defaut - 20 freq
dubbed - 3 freq
dived - 21 freq
duvet - 20 freq
daffed - 3 freq
daupit - 3 freq
dabbit - 6 freq
dowpit - 26 freq
defied - 5 freq
'dauvit - 2 freq
divid - 10 freq
dvd - 11 freq
dafft - 2 freq
dayvideee - 2 freq
depth - 23 freq
deeved - 6 freq
devout - 4 freq
dabaittie - 2 freq
deputy - 6 freq
daivit - 4 freq
dappit - 1 freq
devide - 1 freq
dobbid - 1 freq
doped - 2 freq
debut - 7 freq
daubed - 3 freq
davit - 23 freq
divïd - 8 freq
'dippit - 1 freq
doffed - 3 freq
debait - 1 freq
'david - 2 freq
debit - 2 freq
deft - 4 freq
depute - 21 freq
deived - 1 freq
dept - 7 freq
divvy-oot - 1 freq
depot - 1 freq
davyth - 1 freq
doupit - 1 freq
€œdauvit - 3 freq
€œdavid - 1 freq
dowped - 1 freq
€˜devout - 1 freq
deavit - 1 freq
daavid - 12 freq
daavd - 1 freq
dawpit - 1 freq
€œdaft - 1 freq
dpd - 1 freq
dbooth - 1 freq
dvd' - 1 freq
deepth - 1 freq
“daft - 1 freq
'dived' - 3 freq
dopyt - 1 freq
MetaPhone code - TP0
depth - 23 freq
deepth - 1 freq
DEPTH
Time to execute Levenshtein function - 0.797023 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.583080 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.096934 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.107221 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000876 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.