A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dang in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dang (0) - 12 freq
dann (1) - 9 freq
dawg (1) - 2 freq
gang (1) - 1098 freq
ding (1) - 87 freq
dong (1) - 5 freq
fang (1) - 12 freq
wang (1) - 6 freq
yang (1) - 3 freq
dane (1) - 55 freq
pang (1) - 3 freq
ang (1) - 8 freq
mang (1) - 17 freq
tang (1) - 50 freq
daing (1) - 1 freq
jang (1) - 3 freq
bang (1) - 97 freq
iang (1) - 1 freq
kang (1) - 1 freq
hang (1) - 48 freq
dag (1) - 2 freq
dandg (1) - 5 freq
rang (1) - 82 freq
lang (1) - 3210 freq
dana (1) - 1 freq
dang (0) - 12 freq
daing (1) - 1 freq
dung (1) - 26 freq
ding (1) - 87 freq
dong (1) - 5 freq
dandg (2) - 5 freq
dans (2) - 3 freq
rang (2) - 82 freq
sang (2) - 611 freq
lang (2) - 3210 freq
dana (2) - 1 freq
dan (2) - 472 freq
darg (2) - 185 freq
dingo (2) - 1 freq
doing (2) - 82 freq
daeing (2) - 2 freq
dying (2) - 15 freq
dag (2) - 2 freq
dant (2) - 1 freq
dingy (2) - 1 freq
dank (2) - 11 freq
dawg (2) - 2 freq
yang (2) - 3 freq
dane (2) - 55 freq
fang (2) - 12 freq
SoundEx code - D520
deems - 6 freq
ding - 87 freq
dying - 15 freq
dinah's - 5 freq
damage - 45 freq
dance - 376 freq
denies - 12 freq
dwynes - 10 freq
dwaums - 5 freq
damies - 1 freq
dennis's - 1 freq
dings - 16 freq
dwams - 17 freq
dung - 26 freq
dens - 20 freq
dams - 6 freq
dam-s - 1 freq
daunce - 42 freq
demise - 12 freq
danny's - 7 freq
dimaggio - 1 freq
doing - 82 freq
dunkey - 19 freq
dunkie - 5 freq
deen's - 3 freq
donk - 1 freq
dawn's - 2 freq
dinghy - 3 freq
dims - 3 freq
denish' - 1 freq
dionysia - 1 freq
douns - 2 freq
dwines - 6 freq
dons - 31 freq
dunce - 27 freq
dennis - 11 freq
dunes - 6 freq
donkey - 16 freq
donsie - 2 freq
daeins - 125 freq
doons - 10 freq
dames - 4 freq
dang - 12 freq
dense - 12 freq
domes - 2 freq
danes - 2 freq
damayge - 1 freq
dammeyge - 1 freq
danns - 1 freq
damege - 1 freq
don's - 8 freq
dunse - 1 freq
daein's - 1 freq
dunk - 7 freq
dinnis - 1 freq
danse - 4 freq
donks - 5 freq
doensae - 2 freq
doms - 3 freq
deemies - 1 freq
dems - 6 freq
duns - 2 freq
dmnk - 1 freq
'daunce - 1 freq
doyens - 1 freq
dunch - 27 freq
danish - 19 freq
do-eeng - 1 freq
do-ing - 1 freq
damask - 2 freq
dimewise - 1 freq
denys - 1 freq
dinng - 1 freq
daince - 1 freq
dunky - 1 freq
dina's - 3 freq
deeins - 7 freq
dank - 11 freq
dinkie - 3 freq
denis - 11 freq
damish - 4 freq
danss - 2 freq
daunss - 1 freq
domms - 2 freq
dwangs - 3 freq
'ding - 4 freq
dong - 5 freq
daeneesh - 1 freq
daeings - 3 freq
deans - 3 freq
deanies - 1 freq
damns - 2 freq
dwyns - 1 freq
dwymes - 1 freq
doonways - 1 freq
dwance - 1 freq
dawins - 1 freq
diine's - 1 freq
dummies - 4 freq
'dance - 1 freq
deemikie - 2 freq
dinks - 1 freq
donne's - 1 freq
donnachie - 1 freq
deames' - 1 freq
dywnes - 1 freq
deimos - 1 freq
dinnaes - 1 freq
dawns - 5 freq
dons' - 1 freq
dunsh - 1 freq
dooeeng - 1 freq
domsie - 1 freq
dink - 2 freq
dans - 3 freq
dingey - 1 freq
deeing - 5 freq
diniz - 1 freq
daimige - 1 freq
dame's' - 1 freq
'daeing' - 1 freq
daing - 1 freq
dowiness - 1 freq
dooms - 1 freq
demos- - 2 freq
demos - 2 freq
dansk - 1 freq
dannsa - 1 freq
daeing - 2 freq
deans's - 1 freq
deuing - 1 freq
dingy - 1 freq
dinky - 1 freq
dms - 1 freq
dynesy - 1 freq
dingo - 1 freq
dunskey - 5 freq
dmgjei - 1 freq
donsÂ’ - 2 freq
dunns - 1 freq
dunc - 4 freq
dnycq - 1 freq
dmjco - 1 freq
dmxyo - 1 freq
dimmock - 1 freq
damij - 1 freq
dancey - 1 freq
dhmck - 1 freq
dunak's - 1 freq
dunnocks - 1 freq
donna's - 1 freq
denise - 1 freq
demoss - 1 freq
dumsch - 1 freq
dunsy - 2 freq
dunnock - 1 freq
MetaPhone code - TNK
tang - 50 freq
tongue - 426 freq
ding - 87 freq
tung - 403 freq
dung - 26 freq
tonic - 13 freq
tank - 25 freq
doing - 82 freq
dunkey - 19 freq
dunkie - 5 freq
donk - 1 freq
donkey - 16 freq
ting - 49 freq
dang - 12 freq
tink - 195 freq
taing - 10 freq
tunic - 20 freq
tinkie - 5 freq
tng - 1 freq
tunk - 6 freq
toung - 7 freq
tango - 10 freq
dunk - 7 freq
tongue' - 7 freq
tankie - 1 freq
do-eeng - 1 freq
do-ing - 1 freq
dinng - 1 freq
tangue - 1 freq
'tonic - 1 freq
dunky - 1 freq
dank - 11 freq
ting' - 1 freq
dinkie - 3 freq
tong - 6 freq
'ding - 4 freq
dong - 5 freq
tank- - 1 freq
tanga - 1 freq
tonka - 2 freq
dooeeng - 1 freq
dink - 2 freq
tinky - 2 freq
deeing - 5 freq
'daeing' - 1 freq
tonga - 1 freq
daing - 1 freq
€˜ting - 1 freq
daeing - 2 freq
teng - 1 freq
deuing - 1 freq
tink' - 1 freq
dinky - 1 freq
dingo - 1 freq
dunc - 4 freq
ydiunc - 1 freq
dunnock - 1 freq
DANG
Time to execute Levenshtein function - 0.260444 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.398952 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.039050 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.052034 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001203 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.