A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to doing in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
doing (0) - 85 freq
dong (1) - 5 freq
going (1) - 248 freq
doin (1) - 75 freq
ding (1) - 87 freq
doin' (1) - 1 freq
daing (1) - 1 freq
do-ing (1) - 1 freq
coing (1) - 1 freq
dring (1) - 1 freq
dying (1) - 15 freq
doirc (2) - 1 freq
doit (2) - 3 freq
donna (2) - 51 freq
dining (2) - 6 freq
swing (2) - 70 freq
tong (2) - 7 freq
sling (2) - 7 freq
dossing (2) - 1 freq
moving (2) - 24 freq
donnt (2) - 1 freq
daoming (2) - 1 freq
joint (2) - 52 freq
domino (2) - 10 freq
rging (2) - 1 freq
doing (0) - 85 freq
daing (1) - 1 freq
dong (1) - 5 freq
ding (1) - 87 freq
dying (1) - 15 freq
dingy (2) - 1 freq
daeing (2) - 2 freq
dang (2) - 12 freq
deeing (2) - 5 freq
dingo (2) - 1 freq
dung (2) - 29 freq
doin' (2) - 1 freq
going (2) - 248 freq
do-ing (2) - 1 freq
doin (2) - 75 freq
coing (2) - 1 freq
dring (2) - 1 freq
deuing (2) - 1 freq
hong (3) - 5 freq
kong (3) - 7 freq
sing (3) - 350 freq
young (3) - 1106 freq
geing (3) - 1 freq
using (3) - 35 freq
don' (3) - 1 freq
SoundEx code - D520
deems - 6 freq
ding - 87 freq
dying - 15 freq
dinah's - 5 freq
damage - 45 freq
dance - 380 freq
denies - 12 freq
dwynes - 10 freq
dwaums - 5 freq
damies - 1 freq
dennis's - 1 freq
dings - 16 freq
dwams - 18 freq
dung - 29 freq
dens - 21 freq
dams - 6 freq
dam-s - 1 freq
daunce - 42 freq
demise - 12 freq
danny's - 7 freq
dimaggio - 1 freq
doing - 85 freq
dunkey - 19 freq
dunkie - 5 freq
deen's - 3 freq
donk - 1 freq
dawn's - 2 freq
dinghy - 3 freq
dims - 3 freq
denish' - 1 freq
dionysia - 1 freq
douns - 2 freq
dwines - 6 freq
dons - 31 freq
dunce - 27 freq
dennis - 11 freq
dunes - 6 freq
donkey - 16 freq
donsie - 2 freq
daeins - 125 freq
doons - 11 freq
dames - 4 freq
dang - 12 freq
dense - 12 freq
domes - 2 freq
danes - 2 freq
damayge - 1 freq
dammeyge - 1 freq
danns - 1 freq
damege - 1 freq
don's - 8 freq
dunse - 1 freq
daein's - 1 freq
dunk - 7 freq
dam's - 1 freq
dinnis - 1 freq
danse - 4 freq
donks - 5 freq
doensae - 2 freq
doms - 3 freq
deemies - 1 freq
dems - 6 freq
duns - 2 freq
dmnk - 1 freq
'daunce - 1 freq
doyens - 1 freq
dunch - 27 freq
danish - 19 freq
do-eeng - 1 freq
do-ing - 1 freq
damask - 2 freq
dimewise - 1 freq
denys - 1 freq
dinng - 1 freq
daince - 1 freq
dunky - 1 freq
dina's - 3 freq
deeins - 7 freq
dank - 11 freq
dinkie - 3 freq
denis - 11 freq
damish - 4 freq
danss - 2 freq
daunss - 1 freq
domms - 2 freq
dwangs - 3 freq
'ding - 4 freq
dong - 5 freq
daeneesh - 1 freq
daeings - 3 freq
deans - 3 freq
deanies - 1 freq
damns - 2 freq
dwyns - 1 freq
dwymes - 1 freq
doonways - 1 freq
dwance - 1 freq
dawins - 1 freq
diine's - 1 freq
dummies - 4 freq
'dance - 1 freq
deemikie - 2 freq
dinks - 1 freq
donne's - 1 freq
donnachie - 1 freq
deames' - 1 freq
dywnes - 1 freq
deimos - 1 freq
dinnaes - 1 freq
dawns - 5 freq
dons' - 1 freq
dunsh - 1 freq
dooeeng - 1 freq
domsie - 1 freq
dink - 2 freq
dans - 3 freq
dingey - 1 freq
deeing - 5 freq
diniz - 1 freq
daimige - 1 freq
dame's' - 1 freq
'daeing' - 1 freq
daing - 1 freq
dowiness - 1 freq
dooms - 1 freq
demos- - 2 freq
demos - 2 freq
dansk - 1 freq
dannsa - 1 freq
daeing - 2 freq
deans's - 1 freq
deuing - 1 freq
dingy - 1 freq
dinky - 1 freq
dms - 1 freq
dynesy - 1 freq
dingo - 1 freq
dunskey - 5 freq
dmgjei - 1 freq
donsÂ’ - 2 freq
dunns - 1 freq
dunc - 4 freq
dnycq - 1 freq
dmjco - 1 freq
dmxyo - 1 freq
dimmock - 1 freq
damij - 1 freq
dancey - 1 freq
dhmck - 1 freq
dunak's - 1 freq
dunnocks - 1 freq
donna's - 1 freq
denise - 1 freq
demoss - 1 freq
dumsch - 1 freq
dunsy - 2 freq
dunnock - 1 freq
MetaPhone code - TNK
tang - 50 freq
tongue - 434 freq
ding - 87 freq
tung - 403 freq
dung - 29 freq
tonic - 13 freq
tank - 27 freq
doing - 85 freq
dunkey - 19 freq
dunkie - 5 freq
donk - 1 freq
donkey - 16 freq
ting - 49 freq
dang - 12 freq
tink - 195 freq
taing - 10 freq
tunic - 20 freq
tinkie - 5 freq
tng - 1 freq
tunk - 6 freq
toung - 7 freq
tango - 10 freq
dunk - 7 freq
tong - 7 freq
tungue - 2 freq
tongue' - 7 freq
tankie - 1 freq
do-eeng - 1 freq
do-ing - 1 freq
dinng - 1 freq
tangue - 1 freq
'tonic - 1 freq
dunky - 1 freq
dank - 11 freq
ting' - 1 freq
dinkie - 3 freq
'ding - 4 freq
dong - 5 freq
tank- - 1 freq
tanga - 1 freq
tonka - 2 freq
dooeeng - 1 freq
dink - 2 freq
tinky - 2 freq
deeing - 5 freq
'daeing' - 1 freq
tonga - 1 freq
daing - 1 freq
€˜ting - 1 freq
daeing - 2 freq
teng - 1 freq
deuing - 1 freq
tink' - 1 freq
dinky - 1 freq
dingo - 1 freq
dunc - 4 freq
ydiunc - 1 freq
dunnock - 1 freq
DOING
dae - 4565 freq
do - 861 freq
does - 385 freq
did - 2859 freq
doin - 75 freq
daein - 882 freq
div - 506 freq
done - 821 freq
dee - 1212 freq
don't - 605 freq
didn't - 39 freq
dinna - 1825 freq
dinnae - 1942 freq
didnae - 1693 freq
didna - 1636 freq
daenae - 4 freq
disnae - 586 freq
disna - 401 freq
doesnae - 176 freq
doesna - 90 freq
duin - 393 freq
doing - 85 freq
dain - 144 freq
dane - 55 freq
Time to execute Levenshtein function - 0.208204 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360027 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027876 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037521 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000988 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.