A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to drug-lairds in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
drug-lairds (0) - 1 freq
regairds (4) - 15 freq
dug-eared (5) - 1 freq
rig-heids (5) - 1 freq
rewairds (5) - 1 freq
saufgairds (5) - 1 freq
graveyairds (5) - 4 freq
drug-ring (5) - 1 freq
douglass (5) - 4 freq
druggies (5) - 1 freq
drumblade (5) - 1 freq
drumlanrig (5) - 1 freq
guairds (5) - 21 freq
landlairds (5) - 1 freq
bruntlands (5) - 4 freq
drunkarts (5) - 1 freq
dee-haird (5) - 1 freq
baudelaires (5) - 14 freq
dreglins (5) - 1 freq
gairds (5) - 16 freq
drumlins (5) - 5 freq
dounlaid (5) - 1 freq
dounwards (5) - 1 freq
douglas's (5) - 10 freq
rugbyaid (5) - 1 freq
drug-lairds (0) - 1 freq
regairds (7) - 15 freq
regulars (8) - 16 freq
ower-lords (8) - 1 freq
daurg-days (8) - 1 freq
diregaird (8) - 1 freq
drug-ring (8) - 1 freq
rig-heids (8) - 1 freq
dug-eared (8) - 1 freq
regards (8) - 11 freq
dreglins (8) - 1 freq
dugalds (8) - 1 freq
dug-chains (9) - 2 freq
druids (9) - 5 freq
grave-birds (9) - 1 freq
douglases (9) - 5 freq
drum-baets (9) - 1 freq
douglas (9) - 270 freq
regaird (9) - 39 freq
derk-haired (9) - 1 freq
dreggled (9) - 2 freq
air-raids (9) - 2 freq
daurk-haired (9) - 2 freq
duty-boards (9) - 1 freq
draiglins (9) - 1 freq
SoundEx code - D624
draiglins - 1 freq
droukelt - 2 freq
daurklins - 3 freq
derklins - 2 freq
dreglins - 1 freq
dreezle - 1 freq
draiglt - 1 freq
drizzle - 13 freq
daurklin - 2 freq
darklins - 1 freq
darkly - 3 freq
dracula - 4 freq
darkle - 1 freq
drochle - 5 freq
darjeeling - 2 freq
dark-sweelin - 1 freq
dreechle's - 1 freq
dairkly - 1 freq
darkley - 7 freq
direckly - 9 freq
dreichly - 3 freq
dreggled - 2 freq
draggelt - 2 freq
dreezlin - 1 freq
drochles - 1 freq
drookled - 3 freq
dry-cleaned - 1 freq
dorsal - 1 freq
direk-lik - 1 freq
dreich-lookin - 1 freq
dreichly-dressed - 1 freq
drug-lairds - 1 freq
derkly - 1 freq
draigelt - 2 freq
draigless - 1 freq
drizzil - 1 freq
draigglin - 1 freq
draiggled - 2 freq
draiglety - 2 freq
draggled - 1 freq
drookle - 2 freq
drookleen - 1 freq
drooklin - 1 freq
drochlin - 2 freq
draigled - 1 freq
drochlinest - 1 freq
draiglty - 1 freq
derklie - 1 freq
derklin - 1 freq
daurkly - 1 freq
direcklie - 1 freq
dreezil - 2 freq
draigle - 1 freq
darklin - 2 freq
dursley - 48 freq
dursleys - 5 freq
drizzled - 2 freq
draiglin - 1 freq
drowsily - 1 freq
MetaPhone code - TRKLRTS
drug-lairds - 1 freq
DRUG-LAIRDS
Time to execute Levenshtein function - 0.230491 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.423841 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027880 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037767 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000916 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.