A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to doubt in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
doubt (0) - 86 freq
doublt (1) - 2 freq
doobt (1) - 26 freq
dout (1) - 167 freq
doukt (1) - 3 freq
doubts (1) - 5 freq
dopyt (2) - 1 freq
douro (2) - 1 freq
pourt (2) - 2 freq
doukit (2) - 3 freq
durt (2) - 2 freq
ojbt (2) - 1 freq
dob (2) - 1 freq
doupit (2) - 1 freq
nout (2) - 2 freq
doilt (2) - 1 freq
douff (2) - 2 freq
doubtfu (2) - 3 freq
hout (2) - 1 freq
doot (2) - 565 freq
dotit (2) - 1 freq
pout (2) - 1 freq
douts (2) - 18 freq
dubs (2) - 84 freq
doup (2) - 18 freq
doubt (0) - 86 freq
doobt (1) - 26 freq
doubts (2) - 5 freq
debt (2) - 44 freq
doukt (2) - 3 freq
dout (2) - 167 freq
doublt (2) - 2 freq
daub (3) - 3 freq
bout (3) - 14 freq
doft (3) - 1 freq
dowt (3) - 6 freq
doubted (3) - 7 freq
debit (3) - 2 freq
dont (3) - 76 freq
doubly (3) - 4 freq
daunt (3) - 3 freq
doute (3) - 2 freq
donut (3) - 1 freq
doit (3) - 3 freq
dot (3) - 47 freq
oubit (3) - 2 freq
doant (3) - 2 freq
doubtin (3) - 3 freq
dubh (3) - 4 freq
robt (3) - 1 freq
SoundEx code - D130
dippit - 12 freq
daft - 436 freq
doubt - 86 freq
daftie - 23 freq
dauvit - 95 freq
dipped - 23 freq
devoid - 7 freq
dabbed - 9 freq
debt - 44 freq
deaved - 13 freq
divvied - 2 freq
dafty - 29 freq
debate - 81 freq
'daft - 2 freq
david - 230 freq
doft - 1 freq
defeat - 30 freq
divide - 24 freq
divot - 13 freq
daavit - 25 freq
'daavit - 1 freq
defait - 7 freq
doobt - 26 freq
defaut - 20 freq
dubbed - 2 freq
dived - 21 freq
duvet - 20 freq
daffed - 3 freq
daupit - 3 freq
dabbit - 6 freq
dowpit - 26 freq
defied - 5 freq
'dauvit - 2 freq
divid - 10 freq
dvd - 11 freq
dafft - 2 freq
dayvideee - 2 freq
depth - 22 freq
deeved - 6 freq
devout - 4 freq
dabaittie - 2 freq
deputy - 6 freq
daivit - 4 freq
dappit - 1 freq
devide - 1 freq
dobbid - 1 freq
doped - 2 freq
debut - 7 freq
daubed - 3 freq
davit - 23 freq
divïd - 8 freq
'dippit - 1 freq
doffed - 3 freq
debait - 1 freq
'david - 2 freq
debit - 2 freq
deft - 4 freq
depute - 21 freq
deived - 1 freq
dept - 7 freq
divvy-oot - 1 freq
depot - 1 freq
davyth - 1 freq
doupit - 1 freq
€œdauvit - 3 freq
€œdavid - 1 freq
dowped - 1 freq
€˜devout - 1 freq
deavit - 1 freq
daavid - 12 freq
daavd - 1 freq
dawpit - 1 freq
€œdaft - 1 freq
dpd - 1 freq
dbooth - 1 freq
dvd' - 1 freq
deepth - 1 freq
“daft - 1 freq
'dived' - 3 freq
dopyt - 1 freq
MetaPhone code - TBT
doubt - 86 freq
dabbed - 9 freq
debt - 44 freq
debate - 81 freq
doobt - 26 freq
dubbed - 2 freq
taibit - 1 freq
dabbit - 6 freq
tibet - 10 freq
dabaittie - 2 freq
dobbid - 1 freq
debut - 7 freq
daubed - 3 freq
debait - 1 freq
debit - 2 freq
€œtibet - 1 freq
DOUBT
doot - 565 freq
doubt - 86 freq
doots - 33 freq
doubts - 5 freq
Time to execute Levenshtein function - 0.288989 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.555955 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.040555 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037647 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000815 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.