A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to katsh in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
katsh (0) - 1 freq
kath (1) - 3 freq
kat's (2) - 15 freq
kat (2) - 156 freq
fatsu (2) - 1 freq
oath (2) - 13 freq
aish (2) - 2 freq
cash (2) - 84 freq
ktsc (2) - 1 freq
rats' (2) - 1 freq
kyth (2) - 3 freq
katt (2) - 1 freq
marsh (2) - 8 freq
kat' (2) - 1 freq
hash (2) - 25 freq
nash (2) - 2 freq
fatst (2) - 1 freq
tash (2) - 3 freq
pash (2) - 1 freq
lats (2) - 26 freq
bats (2) - 34 freq
kach (2) - 1 freq
waash (2) - 32 freq
watch (2) - 670 freq
kush (2) - 1 freq
katsh (0) - 1 freq
kath (2) - 3 freq
kuts (3) - 1 freq
ketch (3) - 9 freq
tash (3) - 3 freq
kush (3) - 1 freq
kesh (3) - 1 freq
kits (3) - 3 freq
kyth (3) - 3 freq
kitth (3) - 1 freq
kith (3) - 13 freq
kathy (3) - 4 freq
ktsc (3) - 1 freq
kitsch (3) - 4 freq
ramsh (4) - 2 freq
mash (4) - 14 freq
tats (4) - 1 freq
hath (4) - 3 freq
harsh (4) - 20 freq
bash (4) - 20 freq
pats (4) - 14 freq
lateh (4) - 1 freq
rash (4) - 11 freq
ash (4) - 31 freq
nats (4) - 2 freq
SoundEx code - K320
kites - 6 freq
ketch - 9 freq
kittiwake - 2 freq
kate's - 26 freq
kythes - 64 freq
kiddies - 2 freq
kids - 87 freq
kitchie - 71 freq
katja - 6 freq
kitty's - 1 freq
ket's - 1 freq
kidz - 1 freq
kits - 3 freq
kïsts - 1 freq
kytes - 2 freq
kat's - 15 freq
'kat's - 1 freq
kittag - 1 freq
kuts - 1 freq
katsh - 1 freq
'katze - 1 freq
kyths - 5 freq
kitsch - 4 freq
kidds - 2 freq
kudos - 2 freq
kathak - 1 freq
kodak - 1 freq
kszydj - 1 freq
katiec - 18 freq
kdaz - 1 freq
keithc - 1 freq
kdz - 1 freq
kdeyg - 1 freq
kid's - 1 freq
ketts - 1 freq
kxdkh - 1 freq
ktsc - 1 freq
ktze - 1 freq
kdjya - 1 freq
kdiouwg - 1 freq
kawwhtaxw - 1 freq
MetaPhone code - KTX
katsh - 1 freq
'quidditch' - 1 freq
KATSH
Time to execute Levenshtein function - 0.213928 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.380000 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028962 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038695 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000768 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.