A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dung in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dung (0) - 26 freq
bung (1) - 2 freq
duns (1) - 2 freq
jung (1) - 3 freq
ung (1) - 1 freq
dug (1) - 575 freq
tung (1) - 403 freq
dunt (1) - 92 freq
rung (1) - 25 freq
ding (1) - 87 freq
fung (1) - 2 freq
dunn (1) - 8 freq
-ung (1) - 1 freq
dun' (1) - 2 freq
yung (1) - 72 freq
dang (1) - 12 freq
dunc (1) - 4 freq
dunk (1) - 7 freq
dong (1) - 5 freq
dun (1) - 36 freq
dune (1) - 103 freq
dugg (1) - 3 freq
sung (1) - 68 freq
lung (1) - 13 freq
hung (1) - 158 freq
dung (0) - 26 freq
dang (1) - 12 freq
ding (1) - 87 freq
dong (1) - 5 freq
sung (2) - 68 freq
dugg (2) - 3 freq
doing (2) - 82 freq
dune (2) - 103 freq
hung (2) - 158 freq
deuing (2) - 1 freq
daing (2) - 1 freq
dingo (2) - 1 freq
dying (2) - 15 freq
dunk (2) - 7 freq
dingy (2) - 1 freq
lung (2) - 13 freq
dun (2) - 36 freq
dunc (2) - 4 freq
tung (2) - 403 freq
dug (2) - 575 freq
duns (2) - 2 freq
ung (2) - 1 freq
jung (2) - 3 freq
bung (2) - 2 freq
dunt (2) - 92 freq
SoundEx code - D520
deems - 6 freq
ding - 87 freq
dying - 15 freq
dinah's - 5 freq
damage - 45 freq
dance - 376 freq
denies - 12 freq
dwynes - 10 freq
dwaums - 5 freq
damies - 1 freq
dennis's - 1 freq
dings - 16 freq
dwams - 17 freq
dung - 26 freq
dens - 20 freq
dams - 6 freq
dam-s - 1 freq
daunce - 42 freq
demise - 12 freq
danny's - 7 freq
dimaggio - 1 freq
doing - 82 freq
dunkey - 19 freq
dunkie - 5 freq
deen's - 3 freq
donk - 1 freq
dawn's - 2 freq
dinghy - 3 freq
dims - 3 freq
denish' - 1 freq
dionysia - 1 freq
douns - 2 freq
dwines - 6 freq
dons - 31 freq
dunce - 27 freq
dennis - 11 freq
dunes - 6 freq
donkey - 16 freq
donsie - 2 freq
daeins - 125 freq
doons - 10 freq
dames - 4 freq
dang - 12 freq
dense - 12 freq
domes - 2 freq
danes - 2 freq
damayge - 1 freq
dammeyge - 1 freq
danns - 1 freq
damege - 1 freq
don's - 8 freq
dunse - 1 freq
daein's - 1 freq
dunk - 7 freq
dinnis - 1 freq
danse - 4 freq
donks - 5 freq
doensae - 2 freq
doms - 3 freq
deemies - 1 freq
dems - 6 freq
duns - 2 freq
dmnk - 1 freq
'daunce - 1 freq
doyens - 1 freq
dunch - 27 freq
danish - 19 freq
do-eeng - 1 freq
do-ing - 1 freq
damask - 2 freq
dimewise - 1 freq
denys - 1 freq
dinng - 1 freq
daince - 1 freq
dunky - 1 freq
dina's - 3 freq
deeins - 7 freq
dank - 11 freq
dinkie - 3 freq
denis - 11 freq
damish - 4 freq
danss - 2 freq
daunss - 1 freq
domms - 2 freq
dwangs - 3 freq
'ding - 4 freq
dong - 5 freq
daeneesh - 1 freq
daeings - 3 freq
deans - 3 freq
deanies - 1 freq
damns - 2 freq
dwyns - 1 freq
dwymes - 1 freq
doonways - 1 freq
dwance - 1 freq
dawins - 1 freq
diine's - 1 freq
dummies - 4 freq
'dance - 1 freq
deemikie - 2 freq
dinks - 1 freq
donne's - 1 freq
donnachie - 1 freq
deames' - 1 freq
dywnes - 1 freq
deimos - 1 freq
dinnaes - 1 freq
dawns - 5 freq
dons' - 1 freq
dunsh - 1 freq
dooeeng - 1 freq
domsie - 1 freq
dink - 2 freq
dans - 3 freq
dingey - 1 freq
deeing - 5 freq
diniz - 1 freq
daimige - 1 freq
dame's' - 1 freq
'daeing' - 1 freq
daing - 1 freq
dowiness - 1 freq
dooms - 1 freq
demos- - 2 freq
demos - 2 freq
dansk - 1 freq
dannsa - 1 freq
daeing - 2 freq
deans's - 1 freq
deuing - 1 freq
dingy - 1 freq
dinky - 1 freq
dms - 1 freq
dynesy - 1 freq
dingo - 1 freq
dunskey - 5 freq
dmgjei - 1 freq
donsÂ’ - 2 freq
dunns - 1 freq
dunc - 4 freq
dnycq - 1 freq
dmjco - 1 freq
dmxyo - 1 freq
dimmock - 1 freq
damij - 1 freq
dancey - 1 freq
dhmck - 1 freq
dunak's - 1 freq
dunnocks - 1 freq
donna's - 1 freq
denise - 1 freq
demoss - 1 freq
dumsch - 1 freq
dunsy - 2 freq
dunnock - 1 freq
MetaPhone code - TNK
tang - 50 freq
tongue - 426 freq
ding - 87 freq
tung - 403 freq
dung - 26 freq
tonic - 13 freq
tank - 25 freq
doing - 82 freq
dunkey - 19 freq
dunkie - 5 freq
donk - 1 freq
donkey - 16 freq
ting - 49 freq
dang - 12 freq
tink - 195 freq
taing - 10 freq
tunic - 20 freq
tinkie - 5 freq
tng - 1 freq
tunk - 6 freq
toung - 7 freq
tango - 10 freq
dunk - 7 freq
tongue' - 7 freq
tankie - 1 freq
do-eeng - 1 freq
do-ing - 1 freq
dinng - 1 freq
tangue - 1 freq
'tonic - 1 freq
dunky - 1 freq
dank - 11 freq
ting' - 1 freq
dinkie - 3 freq
tong - 6 freq
'ding - 4 freq
dong - 5 freq
tank- - 1 freq
tanga - 1 freq
tonka - 2 freq
dooeeng - 1 freq
dink - 2 freq
tinky - 2 freq
deeing - 5 freq
'daeing' - 1 freq
tonga - 1 freq
daing - 1 freq
€˜ting - 1 freq
daeing - 2 freq
teng - 1 freq
deuing - 1 freq
tink' - 1 freq
dinky - 1 freq
dingo - 1 freq
dunc - 4 freq
ydiunc - 1 freq
dunnock - 1 freq
DUNG
Time to execute Levenshtein function - 0.530819 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.075548 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.093469 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.102044 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001022 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.