A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to debait in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
debait (0) - 1 freq
decait (1) - 2 freq
debit (1) - 2 freq
defait (1) - 7 freq
debatit (1) - 9 freq
detail (2) - 42 freq
demit (2) - 2 freq
debate (2) - 81 freq
bait (2) - 32 freq
devrait (2) - 1 freq
deavit (2) - 1 freq
decairt (2) - 1 freq
ebbit (2) - 1 freq
debt (2) - 44 freq
defaut (2) - 20 freq
dealt (2) - 33 freq
depart (2) - 7 freq
dubai (2) - 1 freq
deeit (2) - 1 freq
deat (2) - 1 freq
dernit (2) - 4 freq
deit (2) - 9 freq
debaiten (2) - 1 freq
debbie (2) - 2 freq
dentit (2) - 3 freq
debait (0) - 1 freq
debit (1) - 2 freq
debate (2) - 81 freq
debt (2) - 44 freq
debatit (2) - 9 freq
decait (2) - 2 freq
defait (2) - 7 freq
debut (2) - 7 freq
deemit (3) - 2 freq
deit (3) - 9 freq
debatin (3) - 3 freq
debaiten (3) - 1 freq
debbie (3) - 2 freq
dabbit (3) - 6 freq
deceit (3) - 5 freq
u-bait (3) - 1 freq
rebat (3) - 5 freq
deat (3) - 1 freq
decayit (3) - 2 freq
dubai (3) - 1 freq
defaut (3) - 20 freq
deeit (3) - 1 freq
ebbit (3) - 1 freq
deavit (3) - 1 freq
bait (3) - 32 freq
SoundEx code - D130
dippit - 12 freq
daft - 436 freq
doubt - 86 freq
daftie - 23 freq
dauvit - 95 freq
dipped - 23 freq
devoid - 7 freq
dabbed - 9 freq
debt - 44 freq
deaved - 13 freq
divvied - 2 freq
dafty - 29 freq
debate - 81 freq
'daft - 2 freq
david - 230 freq
doft - 1 freq
defeat - 30 freq
divide - 24 freq
divot - 13 freq
daavit - 25 freq
'daavit - 1 freq
defait - 7 freq
doobt - 26 freq
defaut - 20 freq
dubbed - 2 freq
dived - 21 freq
duvet - 20 freq
daffed - 3 freq
daupit - 3 freq
dabbit - 6 freq
dowpit - 26 freq
defied - 5 freq
'dauvit - 2 freq
divid - 10 freq
dvd - 11 freq
dafft - 2 freq
dayvideee - 2 freq
depth - 22 freq
deeved - 6 freq
devout - 4 freq
dabaittie - 2 freq
deputy - 6 freq
daivit - 4 freq
dappit - 1 freq
devide - 1 freq
dobbid - 1 freq
doped - 2 freq
debut - 7 freq
daubed - 3 freq
davit - 23 freq
divïd - 8 freq
'dippit - 1 freq
doffed - 3 freq
debait - 1 freq
'david - 2 freq
debit - 2 freq
deft - 4 freq
depute - 21 freq
deived - 1 freq
dept - 7 freq
divvy-oot - 1 freq
depot - 1 freq
davyth - 1 freq
doupit - 1 freq
€œdauvit - 3 freq
€œdavid - 1 freq
dowped - 1 freq
€˜devout - 1 freq
deavit - 1 freq
daavid - 12 freq
daavd - 1 freq
dawpit - 1 freq
€œdaft - 1 freq
dpd - 1 freq
dbooth - 1 freq
dvd' - 1 freq
deepth - 1 freq
“daft - 1 freq
'dived' - 3 freq
dopyt - 1 freq
MetaPhone code - TBT
doubt - 86 freq
dabbed - 9 freq
debt - 44 freq
debate - 81 freq
doobt - 26 freq
dubbed - 2 freq
taibit - 1 freq
dabbit - 6 freq
tibet - 10 freq
dabaittie - 2 freq
dobbid - 1 freq
debut - 7 freq
daubed - 3 freq
debait - 1 freq
debit - 2 freq
€œtibet - 1 freq
DEBAIT
Time to execute Levenshtein function - 0.229427 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.329170 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027662 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037059 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000859 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.