A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cut-doon in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cut-doon (0) - 2 freq
lat-doon (2) - 1 freq
run-doon (2) - 2 freq
let-doon (2) - 1 freq
shut-doon (2) - 1 freq
cast-doun (3) - 1 freq
cardoon (3) - 1 freq
rin-doon (3) - 1 freq
sundoon (3) - 2 freq
muldoon (3) - 7 freq
pit-doun (3) - 1 freq
shutdoon (3) - 2 freq
judoon (3) - 1 freq
curdoo (3) - 1 freq
put-on (3) - 1 freq
nut-broon (3) - 2 freq
pit-doons (3) - 2 freq
put-upon (3) - 1 freq
sit-doun (3) - 1 freq
outdoor (3) - 7 freq
calmdoon (3) - 1 freq
pitdoon (3) - 1 freq
cuddlan (4) - 1 freq
sundoun (4) - 11 freq
clampdoon (4) - 1 freq
cut-doon (0) - 2 freq
let-doon (3) - 1 freq
lat-doon (3) - 1 freq
pit-doun (4) - 1 freq
cast-doun (4) - 1 freq
sit-doun (4) - 1 freq
run-doon (4) - 2 freq
shut-doon (4) - 1 freq
put-upon (5) - 1 freq
pit-doons (5) - 2 freq
courit-doun (5) - 1 freq
cardoon (5) - 1 freq
calmdoon (5) - 1 freq
put-on (5) - 1 freq
pitdoon (5) - 1 freq
rin-doon (5) - 1 freq
heid-doon (6) - 1 freq
cuttan (6) - 4 freq
sit-douns (6) - 1 freq
crawdoun (6) - 1 freq
cordon (6) - 2 freq
pit-on (6) - 7 freq
outdone (6) - 1 freq
but-in (6) - 1 freq
cast-iron (6) - 1 freq
SoundEx code - C350
cuttin - 74 freq
cut-doon - 2 freq
cuidnae - 135 freq
caution - 10 freq
coudna - 47 freq
cotton - 25 freq
cuidna - 101 freq
cudnae - 144 freq
cudna - 165 freq
chattin - 22 freq
cheatin - 4 freq
cotton-woo - 2 freq
cidna - 2 freq
cidnae - 2 freq
cwidna - 47 freq
chidin - 1 freq
cuttin' - 1 freq
coodna - 71 freq
coudnae - 20 freq
coddin - 2 freq
chaitin - 3 freq
cud'nae - 1 freq
'cudna - 1 freq
cuttan - 4 freq
chaetin - 1 freq
cheatan - 1 freq
coudno - 3 freq
cadona - 6 freq
coodnae - 52 freq
cuddie-an - 1 freq
chatham - 1 freq
cydonia - 2 freq
cweedna - 2 freq
cidni - 1 freq
cottown - 1 freq
chattan - 1 freq
€œcudna - 1 freq
coatin - 1 freq
citin - 1 freq
chutney - 4 freq
cowden - 7 freq
codeine - 1 freq
ctyem - 1 freq
cudnea - 2 freq
MetaPhone code - KTTN
cut-doon - 2 freq
quotidian - 1 freq
göd-döin - 1 freq
gododdin - 2 freq
CUT-DOON
Time to execute Levenshtein function - 0.528427 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.077884 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027994 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.098917 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000950 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.