A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to davidschneider in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
davidschneider (0) - 2 freq
daveshnedders (5) - 1 freq
davidwshedden (5) - 1 freq
davidalexander (6) - 1 freq
davidmcandrew (6) - 10 freq
davidlinden (6) - 2 freq
davidhawker (6) - 1 freq
davidofficer (6) - 1 freq
maidenheid (7) - 1 freq
lavished (7) - 1 freq
daichie (7) - 1 freq
davidmcintosh (7) - 1 freq
davidjmadden (7) - 1 freq
schreuder (7) - 14 freq
reid-heided (7) - 3 freq
davidpreece (7) - 1 freq
devihighlander (7) - 1 freq
heid-snedder (7) - 5 freq
davidtannertv (7) - 2 freq
davidson (7) - 26 freq
bald-heided (7) - 1 freq
baldy-heided (7) - 1 freq
davidson's (7) - 1 freq
davidccraig (7) - 1 freq
davidbowie (7) - 1 freq
davidschneider (0) - 2 freq
davidmcandrew (9) - 10 freq
davidalexander (9) - 1 freq
davidwshedden (9) - 1 freq
daveshnedders (9) - 1 freq
davidhawker (10) - 1 freq
davidlinden (10) - 2 freq
devihighlander (11) - 1 freq
davidson (11) - 26 freq
davidsons (11) - 3 freq
schreuder (11) - 14 freq
davidson's (11) - 1 freq
davidofficer (11) - 1 freq
davidleemedia (12) - 1 freq
deid-centre (12) - 1 freq
discoveded (12) - 1 freq
descended (12) - 6 freq
davidjmadden (12) - 1 freq
katerschnurr (12) - 1 freq
davidmunro (12) - 1 freq
dividends (12) - 1 freq
vicschoen (12) - 1 freq
dagidder (12) - 3 freq
devonshire (12) - 1 freq
davidcameron (12) - 1 freq
SoundEx code - D132
david's - 17 freq
daftish - 2 freq
dafties - 17 freq
dafties' - 1 freq
debts - 8 freq
divots - 5 freq
depths - 14 freq
doubts - 5 freq
devotees - 3 freq
divot's - 2 freq
dvds - 3 freq
depth's - 1 freq
diabetes - 6 freq
debt's - 1 freq
dabbities - 1 freq
davidson - 26 freq
daftest - 4 freq
diabetic - 2 freq
davit's - 1 freq
dippitest - 1 freq
devoto's - 1 freq
daavit's - 6 freq
doobts - 1 freq
deepths - 5 freq
daftie's - 10 freq
depts - 3 freq
debates - 11 freq
divides - 4 freq
daavid's - 1 freq
dauvit's - 1 freq
divits - 1 freq
davidson's - 1 freq
davidsons - 3 freq
deputes - 1 freq
depth-charges - 2 freq
devdas - 3 freq
deputyship - 1 freq
dafities - 1 freq
dtptsgmqe - 1 freq
davidjames - 3 freq
davidcameron - 1 freq
david’s - 1 freq
davidjmadden - 1 freq
dipduckdive - 3 freq
davidccraig - 1 freq
davidjwood - 3 freq
davidjewood - 1 freq
davidsonmagnus - 3 freq
diabetesuk - 1 freq
davidghfrost - 1 freq
duvets - 1 freq
davidschneider - 2 freq
davidwshedden - 1 freq
davidhawker - 1 freq
MetaPhone code - TFTSXNTR
davidschneider - 2 freq
DAVIDSCHNEIDER
Time to execute Levenshtein function - 0.447995 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.523614 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029879 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037808 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000864 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.