A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to distinguish in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
distinguish (0) - 6 freq
distinguisht (1) - 1 freq
distinguishes (2) - 2 freq
distinguished (2) - 3 freq
distinguishin (2) - 3 freq
disteinguishes (3) - 1 freq
distinguishing (3) - 1 freq
distingwished (3) - 1 freq
distinction (4) - 26 freq
diminish (4) - 3 freq
distinctive (4) - 16 freq
disguise (4) - 14 freq
diminisht (5) - 1 freq
stings (5) - 9 freq
disteinction (5) - 2 freq
destination (5) - 20 freq
distincitve (5) - 1 freq
distancing (5) - 14 freq
listing (5) - 4 freq
distense (5) - 1 freq
risings (5) - 2 freq
datings (5) - 1 freq
disteinctive (5) - 2 freq
destinies (5) - 1 freq
dusting (5) - 1 freq
distinguish (0) - 6 freq
distinguisht (2) - 1 freq
distinguishin (3) - 3 freq
distinguished (3) - 3 freq
distinguishes (3) - 2 freq
disteinguishes (4) - 1 freq
distingwished (5) - 1 freq
distinguishing (5) - 1 freq
desings (7) - 1 freq
distends (7) - 1 freq
distense (7) - 1 freq
hustings (7) - 7 freq
dusting (7) - 1 freq
datings (7) - 1 freq
stonefish (7) - 24 freq
disguise (7) - 14 freq
diminish (7) - 3 freq
distinction (7) - 26 freq
distinctive (7) - 16 freq
destinies (7) - 1 freq
dustin's (7) - 1 freq
stings (7) - 9 freq
distances (7) - 10 freq
distrust (8) - 3 freq
distinct (8) - 34 freq
SoundEx code - D235
distance - 173 freq
distinguishing - 1 freq
distance-bit - 1 freq
distant - 54 freq
decidin - 17 freq
distinct - 34 freq
destination - 20 freq
dichtin - 40 freq
dochtna - 2 freq
distinguish - 6 freq
destiny - 12 freq
distance-but - 3 freq
distancepit - 1 freq
daikitin - 1 freq
dustin - 12 freq
distinckt - 1 freq
distinguished - 3 freq
diction - 9 freq
distantly - 2 freq
dictionary - 124 freq
disdain - 10 freq
distempered - 3 freq
disdainit - 1 freq
disdainin - 1 freq
dictionars - 46 freq
distense - 1 freq
dichtin' - 1 freq
distain - 1 freq
distanced - 4 freq
destinies - 1 freq
destined - 12 freq
distinctive - 16 freq
distinctly - 4 freq
destinie - 3 freq
dissydents - 4 freq
dissydent - 1 freq
decidein - 1 freq
distinctively - 1 freq
distincitve - 1 freq
distinction - 26 freq
destinations - 3 freq
distanee - 2 freq
dictionaries - 40 freq
dictioun - 1 freq
dictionar - 95 freq
disdainfully - 1 freq
decadence - 2 freq
distinguisht - 1 freq
distances - 10 freq
dictionary' - 5 freq
'dictionary - 2 freq
'dictionar' - 1 freq
destinautioun - 4 freq
dightan - 1 freq
diastimeter - 1 freq
duggidniss - 1 freq
distancin - 10 freq
dyghtin - 1 freq
disteinction - 2 freq
distante - 2 freq
distinctless - 1 freq
distinguishes - 2 freq
distinguishin - 3 freq
distinctions - 2 freq
dictionairs - 1 freq
dictionaries' - 1 freq
dictum - 1 freq
'dictionary' - 1 freq
disteinguishes - 1 freq
distinction's - 1 freq
desydin - 1 freq
douchtna - 1 freq
disteenction - 3 freq
disteinctive - 2 freq
disteenctive - 1 freq
dictionar-dredgin - 1 freq
dusting - 1 freq
dictionar-traalin - 1 freq
deciding - 4 freq
dustin's - 1 freq
distemper - 1 freq
dichotomy - 1 freq
distends - 1 freq
dichotomies - 1 freq
dixie-deenies - 1 freq
€˜destiny - 1 freq
€œdestination - 1 freq
deistin - 3 freq
disteenctions - 1 freq
dissidents - 2 freq
distinctiveness - 1 freq
distinkshins - 1 freq
dcdndbfj - 1 freq
distant' - 1 freq
degwdm - 1 freq
distince - 1 freq
dichting - 1 freq
distancing - 14 freq
dougiedonnelly - 1 freq
distingwished - 1 freq
dgstandard - 2 freq
dustmopp - 1 freq
disstunt - 1 freq
MetaPhone code - TSTNKX
distinguish - 6 freq
DISTINGUISH
Time to execute Levenshtein function - 0.241281 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.420226 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030057 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038315 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000839 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.