A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to debates in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
debates (0) - 11 freq
debated (1) - 3 freq
debate (1) - 81 freq
debater (1) - 1 freq
relates (2) - 4 freq
debatin (2) - 3 freq
debatit (2) - 9 freq
dates (2) - 36 freq
deputes (2) - 1 freq
debt's (2) - 1 freq
delytes (2) - 1 freq
debts (2) - 8 freq
debased (2) - 1 freq
debaiten (2) - 1 freq
bates (2) - 5 freq
deaths (2) - 8 freq
negates (2) - 1 freq
decades (2) - 49 freq
deflated (3) - 9 freq
elated (3) - 3 freq
relaxes (3) - 2 freq
eats (3) - 30 freq
skates (3) - 6 freq
elbaes (3) - 1 freq
decanted (3) - 1 freq
debates (0) - 11 freq
debts (2) - 8 freq
debated (2) - 3 freq
debater (2) - 1 freq
debate (2) - 81 freq
debaiten (3) - 1 freq
delytes (3) - 1 freq
diabetes (3) - 6 freq
bates (3) - 5 freq
debt's (3) - 1 freq
debatin (3) - 3 freq
dates (3) - 36 freq
debatit (3) - 9 freq
deputes (3) - 1 freq
butes (4) - 2 freq
devotees (4) - 3 freq
debait (4) - 1 freq
delyts (4) - 1 freq
deities (4) - 2 freq
debut (4) - 7 freq
debtor (4) - 1 freq
debris (4) - 10 freq
deebaiten (4) - 1 freq
dotes (4) - 1 freq
deinties (4) - 2 freq
SoundEx code - D132
david's - 17 freq
daftish - 2 freq
dafties - 17 freq
dafties' - 1 freq
debts - 8 freq
divots - 5 freq
depths - 14 freq
doubts - 5 freq
devotees - 3 freq
divot's - 2 freq
dvds - 3 freq
depth's - 1 freq
diabetes - 6 freq
debt's - 1 freq
dabbities - 1 freq
davidson - 26 freq
daftest - 4 freq
diabetic - 2 freq
davit's - 1 freq
dippitest - 1 freq
devoto's - 1 freq
daavit's - 6 freq
doobts - 1 freq
deepths - 5 freq
daftie's - 10 freq
depts - 3 freq
debates - 11 freq
divides - 4 freq
daavid's - 1 freq
dauvit's - 1 freq
divits - 1 freq
davidson's - 1 freq
davidsons - 3 freq
deputes - 1 freq
depth-charges - 2 freq
devdas - 3 freq
deputyship - 1 freq
dafities - 1 freq
dtptsgmqe - 1 freq
davidjames - 3 freq
davidcameron - 1 freq
david’s - 1 freq
davidjmadden - 1 freq
dipduckdive - 3 freq
davidccraig - 1 freq
davidjwood - 3 freq
davidjewood - 1 freq
davidsonmagnus - 3 freq
diabetesuk - 1 freq
davidghfrost - 1 freq
duvets - 1 freq
davidschneider - 2 freq
davidwshedden - 1 freq
davidhawker - 1 freq
MetaPhone code - TBTS
debts - 8 freq
doubts - 5 freq
diabetes - 6 freq
debt's - 1 freq
dabbities - 1 freq
doobts - 1 freq
debates - 11 freq
DEBATES
Time to execute Levenshtein function - 0.328212 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.762948 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027702 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037012 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000866 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.