A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to derg in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
derg (0) - 3 freq
dern (1) - 21 freq
derf (1) - 2 freq
deg (1) - 2 freq
der (1) - 302 freq
derk (1) - 99 freq
dere (1) - 207 freq
derd (1) - 1 freq
ferg (1) - 1 freq
berg (1) - 1 freq
darg (1) - 185 freq
deek (2) - 43 freq
beig (2) - 6 freq
seg (2) - 1 freq
yerk (2) - 3 freq
deks (2) - 1 freq
serr (2) - 9 freq
drf (2) - 1 freq
dugg (2) - 3 freq
kert (2) - 8 freq
ger- (2) - 1 freq
merr (2) - 6 freq
borg (2) - 1 freq
mert (2) - 1 freq
devo (2) - 2 freq
derg (0) - 3 freq
darg (1) - 185 freq
dreg (2) - 4 freq
dairg (2) - 1 freq
dirge (2) - 5 freq
drog (2) - 3 freq
drag (2) - 43 freq
daurg (2) - 3 freq
berg (2) - 1 freq
drug (2) - 23 freq
der (2) - 302 freq
ferg (2) - 1 freq
derf (2) - 2 freq
dern (2) - 21 freq
derk (2) - 99 freq
deg (2) - 2 freq
derd (2) - 1 freq
dere (2) - 207 freq
deyr (3) - 1 freq
deere (3) - 3 freq
doag (3) - 53 freq
dir (3) - 400 freq
verge (3) - 9 freq
dong (3) - 5 freq
draig (3) - 2 freq
SoundEx code - D620
daurk - 195 freq
derk - 99 freq
dark - 378 freq
dreich - 158 freq
doric - 479 freq
direck - 30 freq
darg - 185 freq
doors - 205 freq
draigs - 1 freq
drush - 7 freq
draws - 29 freq
dairk - 67 freq
drees - 6 freq
drag - 43 freq
dregs - 12 freq
'drugs - 1 freq
dears - 12 freq
dress - 97 freq
drugs - 56 freq
dark' - 5 freq
daursay - 4 freq
dirk - 29 freq
darkie - 2 freq
driech - 13 freq
dreck - 7 freq
dross - 18 freq
diaries - 4 freq
derek - 40 freq
dores - 1 freq
dresse - 1 freq
dirge - 5 freq
dares - 5 freq
dreg - 4 freq
dures - 3 freq
drags - 8 freq
daurs - 2 freq
dairg - 1 freq
darcy - 4 freq
drog - 3 freq
derrick - 18 freq
dorik - 5 freq
dowers - 1 freq
derg - 3 freq
drachy - 1 freq
drug - 23 freq
door's - 10 freq
doorweys - 2 freq
daurg - 3 freq
dorrs - 7 freq
doric's - 5 freq
'doric' - 1 freq
daark - 11 freq
dargs - 4 freq
'drag - 1 freq
dora's - 1 freq
dork - 5 freq
doris - 58 freq
dure's - 1 freq
dreech - 2 freq
dries - 8 freq
daurk' - 1 freq
druggy - 1 freq
dere's - 3 freq
dirs - 28 freq
'der's - 3 freq
der's - 31 freq
der''s - 1 freq
draas - 12 freq
daresay - 3 freq
draig - 2 freq
deer's - 3 freq
direk - 2 freq
duirs - 5 freq
doars - 3 freq
dors - 1 freq
doris's - 3 freq
'dar's - 1 freq
daars - 1 freq
dær's - 1 freq
dhraws - 1 freq
droosy - 1 freq
drouk - 1 freq
drousy - 1 freq
dairies - 3 freq
'dark' - 1 freq
dirks - 4 freq
dark's - 1 freq
drake - 6 freq
dreiche - 1 freq
drousie - 1 freq
daarsay - 1 freq
droich - 4 freq
dairy's - 1 freq
dewar's - 1 freq
dareasay - 1 freq
drogs - 2 freq
droozie - 3 freq
drushy - 1 freq
'droich - 1 freq
derk' - 1 freq
doorways - 2 freq
€˜doric - 3 freq
€œderek - 3 freq
derek's - 1 freq
derkie - 3 freq
durk - 1 freq
drougs - 8 freq
droug - 1 freq
deiriss - 1 freq
Údarás - 1 freq
deorc - 2 freq
droosie - 1 freq
derick - 1 freq
€œdoris - 1 freq
doricy - 1 freq
druggie - 1 freq
€˜derek - 1 freq
drook - 1 freq
drouks - 1 freq
doorkey - 1 freq
duress - 4 freq
dawrk - 1 freq
doirc - 1 freq
€˜dark - 2 freq
'dark - 1 freq
drowsy - 1 freq
dorics - 2 freq
dtthrrjq - 1 freq
dreich” - 1 freq
dirÂ’s - 2 freq
dir's - 6 freq
drqs - 1 freq
doras - 1 freq
droayios - 1 freq
dreic - 1 freq
MetaPhone code - TRK
track - 127 freq
daurk - 195 freq
derk - 99 freq
dark - 378 freq
traik - 10 freq
trock - 12 freq
doric - 479 freq
treck - 7 freq
direck - 30 freq
darg - 185 freq
turkey - 41 freq
dairk - 67 freq
drag - 43 freq
troke - 17 freq
trig - 41 freq
turk - 6 freq
tracky - 12 freq
trick - 68 freq
trek - 13 freq
dark' - 5 freq
dirk - 29 freq
darkie - 2 freq
truck - 48 freq
trackie - 9 freq
dreck - 7 freq
derek - 40 freq
dreg - 4 freq
taerag - 1 freq
trickie - 5 freq
dairg - 1 freq
drog - 3 freq
derrick - 18 freq
dorik - 5 freq
treik - 1 freq
trike - 2 freq
derg - 3 freq
drug - 23 freq
tricky - 9 freq
torag - 1 freq
daurg - 3 freq
'doric' - 1 freq
daark - 11 freq
'drag - 1 freq
dork - 5 freq
daurk' - 1 freq
druggy - 1 freq
draig - 2 freq
tirrick - 6 freq
direk - 2 freq
treki - 1 freq
drouk - 1 freq
'dark' - 1 freq
drake - 6 freq
targ - 2 freq
'troke' - 1 freq
derk' - 1 freq
€˜doric - 3 freq
€œderek - 3 freq
derkie - 3 freq
durk - 1 freq
trak - 3 freq
droug - 1 freq
deorc - 2 freq
derick - 1 freq
€œturkey - 2 freq
'tirrick' - 1 freq
terk - 1 freq
€˜trick - 1 freq
druggie - 1 freq
€˜derek - 1 freq
drook - 1 freq
doorkey - 1 freq
dawrk - 1 freq
doirc - 1 freq
€˜dark - 2 freq
'dark - 1 freq
torryq - 5 freq
terreg - 1 freq
ttruq - 1 freq
'turkey' - 1 freq
dreic - 1 freq
DERG
Time to execute Levenshtein function - 0.291102 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.566254 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028741 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.068274 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000810 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.