A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dousinÂ’ in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dousinÂ’ (0) - 1 freq
dousin (2) - 1 freq
durinÂ’ (2) - 1 freq
cousinÂ’s (2) - 1 freq
honkinÂ’ (3) - 1 freq
cousin's (3) - 15 freq
cousin (3) - 100 freq
cousins (3) - 46 freq
dowsing (3) - 1 freq
doesnÂ’t (3) - 7 freq
cousinly (3) - 1 freq
dossin (3) - 3 freq
doukin (3) - 3 freq
lookinÂ’ (3) - 2 freq
housing (3) - 4 freq
housin (3) - 5 freq
movinÂ’ (3) - 1 freq
dousie's (3) - 1 freq
goinÂ’ (3) - 2 freq
rousin (3) - 5 freq
dowsin (3) - 2 freq
dossing (3) - 1 freq
doutin (3) - 1 freq
donsÂ’ (3) - 2 freq
dousit (3) - 1 freq
dousinÂ’ (0) - 1 freq
durinÂ’ (3) - 1 freq
doesnÂ’t (4) - 7 freq
cousinÂ’s (4) - 1 freq
dousin (4) - 1 freq
goinÂ’ (5) - 2 freq
movinÂ’ (5) - 1 freq
daysÂ’ (5) - 1 freq
doosin (5) - 2 freq
lookinÂ’ (5) - 2 freq
dumpinÂ’ (5) - 1 freq
donsÂ’ (5) - 2 freq
sayinÂ’ (5) - 1 freq
doesna (6) - 90 freq
lininÂ’ (6) - 1 freq
readinÂ’ (6) - 1 freq
beinÂ’ (6) - 2 freq
desing (6) - 2 freq
ainÂ’ (6) - 1 freq
desine (6) - 1 freq
amazinÂ’ (6) - 1 freq
dusney (6) - 1 freq
deusna (6) - 1 freq
bakinÂ’ (6) - 1 freq
arounÂ’ (6) - 1 freq
SoundEx code - D250
dookin - 20 freq
disna - 397 freq
doesna - 90 freq
dozin - 8 freq
diggin - 43 freq
disnae - 577 freq
daikin - 1 freq
deekin - 2 freq
daesna - 71 freq
duisna - 13 freq
disown - 4 freq
doesnae - 166 freq
doggin - 10 freq
doukin - 3 freq
docken - 30 freq
duckin - 3 freq
dacin - 15 freq
dyke-an - 1 freq
dossin - 3 freq
dickson - 12 freq
dashin - 7 freq
dismay - 15 freq
dozen - 38 freq
daesnae - 29 freq
dizzen - 85 freq
diznae - 3 freq
deseen - 1 freq
dousin - 1 freq
deign - 1 freq
deacon - 5 freq
decin - 2 freq
dcein - 1 freq
''dassin'' - 1 freq
dassin - 23 freq
dassen - 28 freq
dizin - 3 freq
dismae - 1 freq
disney - 19 freq
dishin - 12 freq
dizzna - 9 freq
disna¢ - 1 freq
dozin' - 1 freq
dosena - 1 freq
dcemie - 1 freq
dooshin - 1 freq
doosin - 2 freq
dsin - 1 freq
dusna - 18 freq
deusna - 1 freq
dashan - 1 freq
dösna - 1 freq
disno - 35 freq
deckin - 1 freq
doesnae-' - 1 freq
'disney - 1 freq
doesn - 11 freq
dookan - 2 freq
diggan - 6 freq
diggen - 1 freq
dusnae - 12 freq
day-suin - 1 freq
dowiesome - 2 freq
doosno - 1 freq
dozan - 1 freq
dowsin - 2 freq
døsna - 1 freq
dizzin - 1 freq
disjune - 2 freq
dokken - 1 freq
€˜disnae - 1 freq
dizna - 1 freq
dockin - 2 freq
dizen - 7 freq
deusno - 3 freq
disny - 5 freq
decayin - 1 freq
doesne - 1 freq
daesno - 1 freq
disni - 7 freq
dosan - 1 freq
deism - 1 freq
€œdisna - 2 freq
€œdsien - 1 freq
dyshin - 1 freq
dushin - 1 freq
diggin' - 1 freq
dookin' - 1 freq
dousinÂ’ - 1 freq
dyson - 1 freq
duzni - 1 freq
deken - 4 freq
dizni - 3 freq
doesni - 1 freq
dsimmie - 1 freq
dickin - 3 freq
disna' - 1 freq
dyjmh - 1 freq
deekin' - 1 freq
dqm - 1 freq
dequinn - 3 freq
dusney - 1 freq
duisnae - 1 freq
duojum - 1 freq
desine - 1 freq
dysony - 1 freq
“disnae - 1 freq
'diagon - 1 freq
'dookin - 1 freq
dixon - 1 freq
MetaPhone code - TSN
disna - 397 freq
doesna - 90 freq
dozin - 8 freq
disnae - 577 freq
tossin - 18 freq
daesna - 71 freq
teasin - 9 freq
duisna - 13 freq
disown - 4 freq
doesnae - 166 freq
dacin - 15 freq
dossin - 3 freq
teasin' - 1 freq
dozen - 38 freq
daesnae - 29 freq
dizzen - 85 freq
diznae - 3 freq
deseen - 1 freq
tisen - 1 freq
taison - 1 freq
dousin - 1 freq
decin - 2 freq
dcein - 1 freq
''dassin'' - 1 freq
dassin - 23 freq
dassen - 28 freq
dizin - 3 freq
design - 36 freq
design' - 1 freq
disney - 19 freq
dizzna - 9 freq
ticino - 1 freq
disna¢ - 1 freq
teason - 2 freq
dozin' - 1 freq
dosena - 1 freq
tizin - 3 freq
doosin - 2 freq
dsin - 1 freq
dusna - 18 freq
deusna - 1 freq
dösna - 1 freq
disno - 35 freq
tisan - 2 freq
doesnae-' - 1 freq
'disney - 1 freq
doesn - 11 freq
tossan - 4 freq
taisin - 1 freq
dusnae - 12 freq
day-suin - 1 freq
doosno - 1 freq
dozan - 1 freq
dowsin - 2 freq
døsna - 1 freq
dizzin - 1 freq
€˜disnae - 1 freq
dizna - 1 freq
dizen - 7 freq
deusno - 3 freq
disny - 5 freq
doesne - 1 freq
daesno - 1 freq
disni - 7 freq
dosan - 1 freq
€œdisna - 2 freq
€œdsien - 1 freq
dousinÂ’ - 1 freq
dyson - 1 freq
duzni - 1 freq
dizni - 3 freq
doesni - 1 freq
disna' - 1 freq
dusney - 1 freq
duisnae - 1 freq
desine - 1 freq
dysony - 1 freq
“disnae - 1 freq
DOUSINÂ’
Time to execute Levenshtein function - 0.226083 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.374447 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028117 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037184 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000857 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.