A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to debts in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
debts (0) - 8 freq
debs (1) - 1 freq
depts (1) - 3 freq
derts (1) - 2 freq
debt (1) - 44 freq
debt's (1) - 1 freq
der's (2) - 31 freq
delyts (2) - 1 freq
meets (2) - 33 freq
perts (2) - 6 freq
deans (2) - 3 freq
nests (2) - 27 freq
deis (2) - 1 freq
dedis (2) - 1 freq
delta (2) - 1 freq
diets (2) - 4 freq
des (2) - 23 freq
odets (2) - 1 freq
beuts (2) - 6 freq
bents (2) - 11 freq
dusts (2) - 2 freq
dee's (2) - 2 freq
gebs (2) - 2 freq
yeats (2) - 1 freq
dept (2) - 7 freq
debts (0) - 8 freq
debs (2) - 1 freq
doubts (2) - 5 freq
doobts (2) - 1 freq
debt's (2) - 1 freq
debates (2) - 11 freq
debt (2) - 44 freq
depts (2) - 3 freq
derts (2) - 2 freq
beets (3) - 49 freq
debit (3) - 2 freq
dabs (3) - 5 freq
dats (3) - 21 freq
debate (3) - 81 freq
dunts (3) - 22 freq
ducts (3) - 4 freq
dists (3) - 1 freq
dirts (3) - 1 freq
dints (3) - 2 freq
dbis (3) - 1 freq
dorts (3) - 8 freq
dwts (3) - 1 freq
debris (3) - 10 freq
douts (3) - 18 freq
doots (3) - 33 freq
SoundEx code - D132
david's - 17 freq
daftish - 2 freq
dafties - 17 freq
dafties' - 1 freq
debts - 8 freq
divots - 5 freq
depths - 13 freq
doubts - 5 freq
devotees - 3 freq
divot's - 2 freq
dvds - 3 freq
depth's - 1 freq
diabetes - 6 freq
debt's - 1 freq
dabbities - 1 freq
davidson - 26 freq
daftest - 4 freq
diabetic - 2 freq
davit's - 1 freq
dippitest - 1 freq
devoto's - 1 freq
daavit's - 6 freq
doobts - 1 freq
deepths - 5 freq
daftie's - 10 freq
depts - 3 freq
debates - 11 freq
divides - 4 freq
daavid's - 1 freq
dauvit's - 1 freq
divits - 1 freq
davidson's - 1 freq
davidsons - 3 freq
deputes - 1 freq
depth-charges - 2 freq
devdas - 3 freq
deputyship - 1 freq
dafities - 1 freq
dtptsgmqe - 1 freq
davidjames - 3 freq
davidcameron - 1 freq
david’s - 1 freq
davidjmadden - 1 freq
dipduckdive - 3 freq
davidccraig - 1 freq
davidjwood - 3 freq
davidjewood - 1 freq
davidsonmagnus - 3 freq
diabetesuk - 1 freq
davidghfrost - 1 freq
duvets - 1 freq
davidschneider - 2 freq
davidwshedden - 1 freq
davidhawker - 1 freq
MetaPhone code - TBTS
debts - 8 freq
doubts - 5 freq
diabetes - 6 freq
debt's - 1 freq
dabbities - 1 freq
doobts - 1 freq
debates - 11 freq
DEBTS
Time to execute Levenshtein function - 0.190262 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.368457 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028673 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.045093 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001005 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.