A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to katie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
katie (0) - 34 freq
gatie (1) - 1 freq
katiec (1) - 18 freq
kate (1) - 176 freq
kathie (1) - 2 freq
kartie (1) - 1 freq
santie (2) - 14 freq
satin (2) - 12 freq
untie (2) - 2 freq
ratier (2) - 1 freq
late (2) - 491 freq
rate (2) - 101 freq
kwitie (2) - 11 freq
ettie (2) - 4 freq
lathie (2) - 1 freq
dutie (2) - 1 freq
pathie (2) - 9 freq
hastie (2) - 1 freq
hatit (2) - 15 freq
baatie (2) - 1 freq
native (2) - 93 freq
kaye (2) - 4 freq
daatie (2) - 6 freq
narie (2) - 1 freq
gate (2) - 293 freq
katie (0) - 34 freq
kate (1) - 176 freq
kat (2) - 156 freq
kyte (2) - 15 freq
kite (2) - 19 freq
katy (2) - 18 freq
kte (2) - 1 freq
katiec (2) - 18 freq
kathie (2) - 2 freq
kartie (2) - 1 freq
gatie (2) - 1 freq
cate (3) - 2 freq
oantie (3) - 1 freq
paetie (3) - 2 freq
notie (3) - 8 freq
sautie (3) - 4 freq
tie (3) - 88 freq
kaif (3) - 1 freq
totie (3) - 18 freq
kath (3) - 3 freq
'kate (3) - 1 freq
tate (3) - 12 freq
bate (3) - 66 freq
kaip (3) - 2 freq
ati (3) - 16 freq
SoundEx code - K300
kyte - 15 freq
kythe - 65 freq
kid - 103 freq
kit - 30 freq
kow-tow - 1 freq
kate - 176 freq
kith - 13 freq
kiddie - 1 freq
kathie - 2 freq
kwite - 1 freq
kite - 19 freq
kathy - 4 freq
keith - 15 freq
kithy - 2 freq
kitth - 1 freq
kid' - 2 freq
katie - 34 freq
kïsst - 2 freq
kïst - 1 freq
kat - 156 freq
'kat - 3 freq
kat' - 1 freq
keyed - 4 freq
'kate - 1 freq
kood - 5 freq
kott - 1 freq
ket - 1 freq
kudu - 1 freq
kyth - 3 freq
kywte - 1 freq
katy - 18 freq
kidd - 7 freq
key'd - 1 freq
kd - 2 freq
kitty - 13 freq
katt - 1 freq
kittie - 3 freq
€˜kssst - 1 freq
€˜katie - 3 freq
€˜kat - 2 freq
kath - 3 freq
keit - 1 freq
kydd - 1 freq
 koda - 2 freq
kita - 1 freq
kad - 1 freq
kiddy - 1 freq
kt - 3 freq
kd- - 1 freq
kwitie - 11 freq
‘keyed - 1 freq
kaat - 1 freq
kdu - 1 freq
kwaaet - 1 freq
kdy - 1 freq
kedy - 1 freq
kdyhu - 1 freq
kteu - 2 freq
kweyad - 1 freq
kgd - 2 freq
kte - 1 freq
kjkcyd - 1 freq
kstew - 1 freq
kiddo - 1 freq
MetaPhone code - KT
cuddy - 55 freq
gaed - 1526 freq
got - 3871 freq
quiet - 239 freq
cut - 455 freq
quite - 475 freq
guid - 3650 freq
cuid - 803 freq
gowd - 223 freq
quate - 161 freq
gate - 293 freq
caa'd - 64 freq
goad - 105 freq
kyte - 15 freq
cuttie - 30 freq
coat - 159 freq
good - 1020 freq
god - 941 freq
queat - 11 freq
gadie - 5 freq
gat - 367 freq
gait - 137 freq
cawd - 33 freq
goat - 706 freq
guide - 91 freq
cat - 557 freq
kid - 103 freq
cute - 41 freq
gaid - 5 freq
--good - 1 freq
coud - 151 freq
quit - 28 freq
quaiet - 4 freq
cuddie - 43 freq
ca'd - 33 freq
coda - 3 freq
'coud - 1 freq
gut - 23 freq
kit - 30 freq
quid - 81 freq
cd - 48 freq
gîte - 1 freq
kow-tow - 1 freq
gad - 2 freq
kate - 176 freq
quait - 131 freq
quat - 31 freq
'cuid - 7 freq
'guid - 51 freq
'cud - 2 freq
cud - 955 freq
caad - 306 freq
cad - 40 freq
kiddie - 1 freq
'good - 6 freq
guttie - 5 freq
quad - 18 freq
gud - 43 freq
'quate - 1 freq
caad- - 1 freq
gute - 1 freq
'goad - 3 freq
cutty - 32 freq
gaudie - 1 freq
cott - 4 freq
cuitie - 1 freq
god' - 6 freq
'quite - 3 freq
cot - 44 freq
catty - 30 freq
cae'd - 1 freq
gude - 82 freq
'got - 12 freq
gottae - 7 freq
kite - 19 freq
gout - 10 freq
ca'ed - 76 freq
cattie - 2 freq
quaet - 26 freq
queet - 2 freq
gaad - 4 freq
cod - 20 freq
ca-ed - 1 freq
gguid - 1 freq
'god - 11 freq
gt - 9 freq
caaed - 95 freq
gtow - 1 freq
kitth - 1 freq
cood - 257 freq
cooda - 3 freq
ct - 8 freq
'good' - 2 freq
code - 38 freq
caud - 7 freq
gota - 3 freq
gowdie - 8 freq
goatae - 4 freq
good' - 2 freq
kid' - 2 freq
goate - 1 freq
katie - 34 freq
quote - 50 freq
goatie - 1 freq
'cut - 5 freq
queeit - 2 freq
gatie - 1 freq
ga'ed - 3 freq
ca'ad - 11 freq
gued - 1 freq
gotta - 13 freq
quet - 3 freq
cut' - 2 freq
caa't - 4 freq
caa'ed - 5 freq
couttie - 12 freq
couid - 1 freq
goatee - 2 freq
cutte - 1 freq
queued - 4 freq
go-d - 1 freq
gaudy - 4 freq
cïtie - 6 freq
wïckit - 5 freq
quït - 3 freq
góat - 1 freq
cudda - 2 freq
cóat - 1 freq
gowdea - 1 freq
ca't - 1 freq
cat' - 1 freq
guid' - 1 freq
cato - 1 freq
got' - 3 freq
kat - 156 freq
'kat - 3 freq
kat' - 1 freq
cwyte - 3 freq
gote - 43 freq
gaet - 22 freq
göd - 254 freq
gdd - 2 freq
'cood - 1 freq
gd - 12 freq
göd' - 1 freq
'gat - 1 freq
'got' - 1 freq
guide' - 1 freq
coot - 2 freq
'quiet - 2 freq
gti - 1 freq
'cutty - 1 freq
qt - 5 freq
gaud - 5 freq
coit - 7 freq
cutt - 3 freq
codd - 7 freq
gode - 5 freq
quod - 1 freq
quyt - 3 freq
'kate - 1 freq
'cut' - 1 freq
goit - 3 freq
kood - 5 freq
gott - 3 freq
kott - 1 freq
goodo - 1 freq
goodie' - 1 freq
coatie - 5 freq
goed - 3 freq
ket - 1 freq
gød - 5 freq
kudu - 1 freq
cuida - 2 freq
caat - 9 freq
quhyt - 5 freq
kywte - 1 freq
cøt - 1 freq
güd - 3 freq
'gate' - 1 freq
goattie - 1 freq
gaw'd - 1 freq
gait- - 1 freq
'quit - 1 freq
katy - 18 freq
kidd - 7 freq
key'd - 1 freq
caed - 4 freq
gae't - 2 freq
gíed - 1 freq
cowt - 2 freq
kd - 2 freq
kitty - 13 freq
'gowd' - 1 freq
€˜göd - 1 freq
€žcuddy - 1 freq
quota - 1 freq
goud - 3 freq
€œgoad - 1 freq
cootie - 1 freq
€™-cat - 1 freq
cuit - 2 freq
quiat - 2 freq
cude - 1 freq
caddie - 2 freq
katt - 1 freq
kittie - 3 freq
€œgod - 5 freq
€˜cat - 1 freq
cote - 4 freq
€˜goad - 1 freq
€œcuid - 2 freq
€œgood - 6 freq
€˜gaed - 1 freq
€œguid - 21 freq
cata - 1 freq
wkd - 4 freq
€˜guid - 2 freq
€˜katie - 3 freq
€˜good - 8 freq
€˜goat - 2 freq
quitie - 2 freq
€˜kat - 2 freq
€˜god - 3 freq
gawd - 2 freq
€œcatty - 1 freq
€˜catty - 2 freq
€œcat - 2 freq
€œcut - 1 freq
€œgaddy - 1 freq
quaeit - 1 freq
keit - 1 freq
caddy - 2 freq
€˜cut - 1 freq
€œcud - 2 freq
€œgud - 1 freq
€œgottae - 1 freq
quietie - 1 freq
kydd - 1 freq
qaeda - 1 freq
cody - 1 freq
€œcode - 1 freq
 koda - 2 freq
quot - 15 freq
kita - 1 freq
'guid' - 1 freq
kad - 1 freq
gøtu - 1 freq
kiddy - 1 freq
€™goat - 1 freq
gtt - 1 freq
kt - 3 freq
cto - 1 freq
kd- - 1 freq
qtt - 2 freq
qto - 1 freq
goodie - 3 freq
kaat - 1 freq
kdu - 1 freq
godÂ’ - 1 freq
caÂ’d - 2 freq
ygt - 1 freq
gdh - 2 freq
ggtth - 1 freq
“guid - 1 freq
kdy - 1 freq
gade - 1 freq
gatt - 1 freq
kedy - 1 freq
good” - 1 freq
hqt - 1 freq
cutie - 4 freq
ctau - 1 freq
guid” - 1 freq
ctw - 1 freq
couddae - 1 freq
cate - 2 freq
cait - 2 freq
kteu - 2 freq
qd - 1 freq
kte - 1 freq
wqt - 1 freq
gowdd - 1 freq
gdiou - 1 freq
qoute - 1 freq
gtto - 3 freq
cowdie - 2 freq
kiddo - 1 freq
qht - 38 freq
qdiy - 1 freq
gwd - 1 freq
‘guid - 1 freq
cutey - 1 freq
KATIE
Time to execute Levenshtein function - 0.496828 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.637121 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.062690 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037468 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000800 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.