A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to duimster in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
duimster (0) - 2 freq
duster (2) - 12 freq
haimster (2) - 2 freq
diaster (2) - 1 freq
dustir (3) - 1 freq
cuisten (3) - 2 freq
hubster (3) - 1 freq
sister (3) - 449 freq
duimed (3) - 2 freq
glister (3) - 8 freq
maister (3) - 216 freq
duist (3) - 1 freq
adminster (3) - 1 freq
disters (3) - 1 freq
munster (3) - 7 freq
dusters (3) - 3 freq
twister (3) - 2 freq
disted (3) - 1 freq
blister (3) - 3 freq
plister (3) - 1 freq
mister (3) - 84 freq
tipster (3) - 1 freq
waister (3) - 1 freq
dempster (3) - 11 freq
haister (3) - 2 freq
duimster (0) - 2 freq
duster (3) - 12 freq
diaster (3) - 1 freq
haimster (3) - 2 freq
muster (4) - 7 freq
adminster (4) - 1 freq
maister (4) - 216 freq
hamster (4) - 6 freq
mister (4) - 84 freq
deemsters (4) - 1 freq
diameter (4) - 1 freq
dustir (4) - 1 freq
moister (4) - 7 freq
dempster (4) - 11 freq
disaster (4) - 45 freq
deester (4) - 2 freq
mistery (5) - 1 freq
administer (5) - 1 freq
dusted (5) - 6 freq
haimsters (5) - 1 freq
dunter (5) - 2 freq
buster (5) - 2 freq
leister (5) - 2 freq
mester (5) - 12 freq
aister (5) - 2 freq
SoundEx code - D523
daunced - 18 freq
dynasty - 10 freq
danced - 57 freq
doonsittin - 2 freq
doon-sittin - 1 freq
damaged - 15 freq
dingit - 13 freq
dinkit - 1 freq
doonsit - 1 freq
duncht - 1 freq
doonstair - 10 freq
dunced - 10 freq
'domesticated - 1 freq
doonstairs - 41 freq
dinged - 18 freq
donside - 4 freq
dunched - 8 freq
duimster - 2 freq
dauncit - 2 freq
dumstruik - 1 freq
downstairs - 2 freq
dammeyged - 1 freq
density - 4 freq
doonside - 4 freq
domestic - 20 freq
dunked - 2 freq
doon-stairs - 1 freq
dunstan - 1 freq
dynastic - 1 freq
ding-dong - 2 freq
doonstairs' - 1 freq
demi-gods - 1 freq
ding't - 2 freq
doonstream - 1 freq
densities - 1 freq
ding-dang - 6 freq
dunch't - 1 freq
dongting - 2 freq
donnchadh - 1 freq
dounsets - 1 freq
domestics - 1 freq
deemsters - 1 freq
doun-sittin - 1 freq
doonistair - 1 freq
dang-doun - 1 freq
damnest - 1 freq
doomnstairs - 1 freq
dunstane - 1 freq
dangit - 1 freq
domestication - 1 freq
domesticatit - 3 freq
domesticate - 1 freq
domesticates - 1 freq
dinkie-dies - 1 freq
doungate - 1 freq
doomsday - 1 freq
donaghadee - 5 freq
dinghyed - 1 freq
domesticated - 1 freq
dunecht - 1 freq
danight - 9 freq
dingyed - 1 freq
dingied - 3 freq
dimished - 1 freq
danicht - 4 freq
donstalk - 5 freq
dnjtk - 1 freq
donnchadhol - 2 freq
donsdaiiy - 6 freq
downside - 2 freq
dwanged - 1 freq
douneside - 4 freq
danceathon - 1 freq
dinsdale - 1 freq
dtammcd - 1 freq
densitie - 1 freq
dunsterhouseltd - 1 freq
domesticity - 1 freq
donagha-dreich - 1 freq
daneside - 1 freq
dunged - 1 freq
MetaPhone code - TMSTR
duimster - 2 freq
'teamster' - 1 freq
DUIMSTER
Time to execute Levenshtein function - 0.209135 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.395787 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030347 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041901 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001063 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.