A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to divits in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
divits (0) - 1 freq
digits (1) - 3 freq
divots (1) - 5 freq
dirts (2) - 1 freq
davis (2) - 7 freq
deivils (2) - 1 freq
divot's (2) - 2 freq
divi (2) - 2 freq
daivit (2) - 4 freq
duvets (2) - 1 freq
dists (2) - 1 freq
rivets (2) - 1 freq
divine (2) - 29 freq
limits (2) - 15 freq
divid (2) - 10 freq
divil's (2) - 1 freq
divil (2) - 17 freq
divines (2) - 1 freq
div's (2) - 1 freq
dives (2) - 7 freq
rivit (2) - 2 freq
visits (2) - 24 freq
diverts (2) - 2 freq
diets (2) - 5 freq
divvies (2) - 1 freq
divits (0) - 1 freq
divots (1) - 5 freq
duvets (2) - 1 freq
digits (2) - 3 freq
divvies (3) - 1 freq
diets (3) - 5 freq
dives (3) - 7 freq
div's (3) - 1 freq
dints (3) - 2 freq
diverts (3) - 2 freq
davit (3) - 23 freq
davies (3) - 13 freq
divers (3) - 10 freq
divides (3) - 4 freq
divot (3) - 13 freq
divines (3) - 1 freq
davit's (3) - 1 freq
devils (3) - 1 freq
daivit (3) - 4 freq
dists (3) - 1 freq
divot's (3) - 2 freq
deivils (3) - 1 freq
dirts (3) - 1 freq
rivets (3) - 1 freq
davis (3) - 7 freq
SoundEx code - D132
david's - 17 freq
daftish - 2 freq
dafties - 17 freq
dafties' - 1 freq
debts - 8 freq
divots - 5 freq
depths - 14 freq
doubts - 5 freq
devotees - 3 freq
divot's - 2 freq
dvds - 3 freq
depth's - 1 freq
diabetes - 6 freq
debt's - 1 freq
dabbities - 1 freq
davidson - 26 freq
daftest - 4 freq
diabetic - 2 freq
davit's - 1 freq
dippitest - 1 freq
devoto's - 1 freq
daavit's - 6 freq
doobts - 1 freq
deepths - 5 freq
daftie's - 10 freq
depts - 3 freq
debates - 11 freq
divides - 4 freq
daavid's - 1 freq
dauvit's - 1 freq
divits - 1 freq
davidson's - 1 freq
davidsons - 3 freq
deputes - 1 freq
depth-charges - 2 freq
devdas - 3 freq
deputyship - 1 freq
dafities - 1 freq
dtptsgmqe - 1 freq
davidjames - 3 freq
davidcameron - 1 freq
davidÂ’s - 1 freq
davidjmadden - 1 freq
dipduckdive - 3 freq
davidccraig - 1 freq
davidjwood - 3 freq
davidjewood - 1 freq
davidsonmagnus - 3 freq
diabetesuk - 1 freq
davidghfrost - 1 freq
duvets - 1 freq
davidschneider - 2 freq
davidwshedden - 1 freq
davidhawker - 1 freq
MetaPhone code - TFTS
david's - 17 freq
dafties - 17 freq
dafties' - 1 freq
divots - 5 freq
devotees - 3 freq
dights - 5 freq
divot's - 2 freq
dvds - 3 freq
tufty's - 3 freq
tuftie's - 1 freq
tuffty's - 1 freq
tights - 15 freq
davit's - 1 freq
devoto's - 1 freq
daavit's - 6 freq
tufts - 2 freq
daftie's - 10 freq
divides - 4 freq
daavid's - 1 freq
dauvit's - 1 freq
divits - 1 freq
€œtights - 1 freq
devdas - 3 freq
dafities - 1 freq
davidÂ’s - 1 freq
duvets - 1 freq
DIVITS
Time to execute Levenshtein function - 0.232593 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.361269 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027804 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037264 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000834 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.