A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to chaetin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
chaetin (0) - 1 freq
chastin (1) - 1 freq
chattin (1) - 22 freq
chantin (1) - 28 freq
chaitin (1) - 3 freq
chackin (2) - 3 freq
chawvin (2) - 1 freq
chantan (2) - 4 freq
hantin (2) - 6 freq
'haein (2) - 1 freq
cheatin (2) - 4 freq
cheetie (2) - 23 freq
cantin (2) - 4 freq
baetin (2) - 4 freq
chanting (2) - 3 freq
chagrin (2) - 1 freq
cheepin (2) - 6 freq
hatin (2) - 2 freq
castin (2) - 48 freq
chakkin (2) - 1 freq
cratin (2) - 1 freq
chawin (2) - 42 freq
chirtin (2) - 1 freq
clautin (2) - 1 freq
claenin (2) - 1 freq
chaetin (0) - 1 freq
chaitin (1) - 3 freq
cheatin (2) - 4 freq
chastin (2) - 1 freq
chattin (2) - 22 freq
chantin (2) - 28 freq
coastin (3) - 1 freq
cheatan (3) - 1 freq
chain (3) - 62 freq
chattie (3) - 1 freq
chaotic (3) - 3 freq
coatin (3) - 1 freq
sheetin (3) - 6 freq
cheerin (3) - 25 freq
chauvin (3) - 4 freq
cretin (3) - 3 freq
chaavin (3) - 2 freq
chewin (3) - 16 freq
chaain (3) - 7 freq
chauntin (3) - 3 freq
haitin (3) - 7 freq
cartin (3) - 1 freq
chavin (3) - 4 freq
chafin (3) - 1 freq
chattan (3) - 1 freq
SoundEx code - C350
cuttin - 74 freq
cut-doon - 2 freq
cuidnae - 135 freq
caution - 10 freq
coudna - 47 freq
cotton - 25 freq
cuidna - 101 freq
cudnae - 144 freq
cudna - 165 freq
chattin - 22 freq
cheatin - 4 freq
cotton-woo - 2 freq
cidna - 2 freq
cidnae - 2 freq
cwidna - 47 freq
chidin - 1 freq
cuttin' - 1 freq
coodna - 71 freq
coudnae - 20 freq
coddin - 2 freq
chaitin - 3 freq
cud'nae - 1 freq
'cudna - 1 freq
cuttan - 4 freq
chaetin - 1 freq
cheatan - 1 freq
coudno - 3 freq
cadona - 6 freq
coodnae - 52 freq
cuddie-an - 1 freq
chatham - 1 freq
cydonia - 2 freq
cweedna - 2 freq
cidni - 1 freq
cottown - 1 freq
chattan - 1 freq
€œcudna - 1 freq
coatin - 1 freq
citin - 1 freq
chutney - 4 freq
cowden - 7 freq
codeine - 1 freq
ctyem - 1 freq
cudnea - 2 freq
MetaPhone code - XTN
shoutin - 137 freq
shuttin - 39 freq
shootin - 43 freq
shoudna - 6 freq
shitin - 4 freq
chattin - 22 freq
shoutin' - 2 freq
cheatin - 4 freq
shuidnae - 11 freq
showdin - 2 freq
sheetin - 6 freq
shoutan - 5 freq
shudnae - 30 freq
chidin - 1 freq
sioatin' - 1 freq
shotten - 4 freq
shouten - 2 freq
sheeten - 1 freq
shidnae - 1 freq
shutt'n - 1 freq
showdoon - 1 freq
shittin - 4 freq
shidna - 9 freq
sheddin - 5 freq
chaitin - 3 freq
shutten - 2 freq
shuttan - 3 freq
chaetin - 1 freq
shoodna - 3 freq
shuidna - 16 freq
cheatan - 1 freq
shouteen - 1 freq
shaidin - 28 freq
shuitin - 4 freq
shuitten - 2 freq
shoudno - 2 freq
shiten - 1 freq
sheddan - 1 freq
shootan - 5 freq
shudna - 4 freq
shitein - 2 freq
shadno - 1 freq
chattan - 1 freq
shadin - 1 freq
shoudnae - 3 freq
chutney - 4 freq
shitin' - 1 freq
shoud'nae - 1 freq
shoodnae - 1 freq
CHAETIN
Time to execute Levenshtein function - 0.222034 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.382423 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028737 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041066 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000947 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.