A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to goth in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
goth (0) - 10 freq
got (1) - 3893 freq
gote (1) - 43 freq
gott (1) - 3 freq
roth (1) - 2 freq
both (1) - 198 freq
got' (1) - 3 freq
moth (1) - 7 freq
gogh (1) - 1 freq
goch (1) - 1 freq
gosh (1) - 2 freq
woth (1) - 1 freq
gota (1) - 3 freq
doth (1) - 9 freq
noth (1) - 5 freq
troth (2) - 18 freq
borth (2) - 73 freq
toch (2) - 1 freq
vott (2) - 2 freq
gtc (2) - 1 freq
fowth (2) - 5 freq
gelh (2) - 1 freq
gate (2) - 294 freq
ooto (2) - 85 freq
vot (2) - 1 freq
goth (0) - 10 freq
gosh (2) - 2 freq
goch (2) - 1 freq
woth (2) - 1 freq
noth (2) - 5 freq
goethe (2) - 8 freq
gogh (2) - 1 freq
gota (2) - 3 freq
doth (2) - 9 freq
moth (2) - 7 freq
got (2) - 3893 freq
gote (2) - 43 freq
gott (2) - 3 freq
got' (2) - 3 freq
both (2) - 198 freq
roth (2) - 2 freq
gsoh (3) - 2 freq
gtt (3) - 1 freq
sth (3) - 1 freq
gothic (3) - 5 freq
gat (3) - 399 freq
getn (3) - 1 freq
gite (3) - 2 freq
gough (3) - 1 freq
sooth (3) - 335 freq
SoundEx code - G300
gaed - 1528 freq
get - 5225 freq
gied - 1359 freq
got - 3893 freq
guid - 3717 freq
gowd - 224 freq
gate - 294 freq
goad - 105 freq
gweed - 482 freq
good - 1025 freq
god - 969 freq
gadie - 5 freq
gat - 399 freq
gait - 137 freq
goat - 732 freq
git - 1244 freq
guide - 91 freq
gaid - 5 freq
git' - 3 freq
--good - 1 freq
gid - 302 freq
gyte - 89 freq
gut - 27 freq
gîte - 1 freq
gad - 2 freq
'gweed - 6 freq
'guid - 51 freq
'get - 30 freq
geed - 69 freq
'git - 5 freq
'good - 7 freq
gi'ed - 2 freq
geet - 10 freq
guttie - 5 freq
gud - 43 freq
gute - 1 freq
get' - 6 freq
'goad - 3 freq
gead - 2 freq
gaudie - 1 freq
god' - 6 freq
gude - 82 freq
'got - 12 freq
gottae - 7 freq
'gied - 1 freq
goth - 10 freq
geid - 53 freq
gout - 10 freq
gateway - 8 freq
gaad - 4 freq
gyaad - 2 freq
giddy - 4 freq
gyad - 3 freq
gie't - 12 freq
gguid - 1 freq
'god - 11 freq
gt - 9 freq
gtow - 1 freq
gceid - 1 freq
gcid - 4 freq
gct - 1 freq
'gid - 1 freq
'good' - 2 freq
gota - 3 freq
g't - 1 freq
goood - 3 freq
ght - 1 freq
gie'd - 4 freq
goodie - 4 freq
gata - 1 freq
gowdie - 8 freq
goatae - 4 freq
good' - 2 freq
gid' - 1 freq
goate - 1 freq
goatie - 1 freq
gatie - 1 freq
ga'ed - 3 freq
gued - 1 freq
gotta - 13 freq
getta - 2 freq
goatee - 2 freq
ge°d- - 1 freq
go-d - 1 freq
ghetto - 2 freq
gaudy - 4 freq
gett - 11 freq
góat - 1 freq
gowdea - 1 freq
geta - 1 freq
geit - 3 freq
guid' - 1 freq
got' - 3 freq
'gweed' - 1 freq
gote - 43 freq
gaet - 22 freq
göd - 254 freq
gdd - 2 freq
gd - 12 freq
göd' - 1 freq
'gat - 1 freq
'got' - 1 freq
guide' - 1 freq
gti - 1 freq
ged - 18 freq
gaud - 5 freq
gode - 5 freq
gyed - 6 freq
geud - 7 freq
gyde - 3 freq
gyet - 1 freq
goit - 3 freq
gott - 3 freq
gaetwaey - 1 freq
goodo - 1 freq
gite - 2 freq
goodie' - 1 freq
goed - 3 freq
gød - 5 freq
guyed - 1 freq
güd - 3 freq
'gate' - 1 freq
goattie - 1 freq
g-g-gowd - 1 freq
gaw'd - 1 freq
gait- - 1 freq
gytie - 1 freq
gießt - 1 freq
gieit - 1 freq
gae't - 2 freq
gíed - 1 freq
ghoti - 1 freq
'gowd' - 1 freq
€˜göd - 1 freq
gjit - 2 freq
goud - 3 freq
€œgoad - 1 freq
gide - 1 freq
€˜gweed - 2 freq
€œgit - 3 freq
geodie - 1 freq
gyat - 1 freq
€œgod - 5 freq
gyid - 1 freq
€˜goad - 1 freq
€œgood - 6 freq
€˜gaed - 1 freq
€œget - 8 freq
€œguid - 21 freq
gyit - 2 freq
€˜guid - 2 freq
€˜get - 11 freq
€˜good - 8 freq
€˜ged - 2 freq
€˜goat - 2 freq
€˜god - 3 freq
gawd - 2 freq
€œgweed - 2 freq
€œgaddy - 1 freq
goethe - 8 freq
giid - 2 freq
€œgud - 1 freq
€œgottae - 1 freq
€˜git - 1 freq
gyt - 1 freq
'guid' - 1 freq
gyd - 1 freq
gøtu - 1 freq
€™goat - 1 freq
gieÂ’d - 2 freq
gtt - 1 freq
gxzqhud - 1 freq
gyihdh - 1 freq
gcscot - 1 freq
godÂ’ - 1 freq
gdh - 2 freq
ggtth - 1 freq
gqt - 1 freq
gshd - 1 freq
“guid - 1 freq
gade - 1 freq
gatt - 1 freq
‘get - 1 freq
good” - 1 freq
guid” - 1 freq
ghaoth - 1 freq
'get' - 1 freq
gzet - 1 freq
gowdd - 1 freq
gdiou - 1 freq
gtto - 3 freq
geeat - 1 freq
gwd - 1 freq
‘guid - 1 freq
ghud - 4 freq
MetaPhone code - K0
couthy - 23 freq
cathy - 146 freq
kythe - 65 freq
kith - 13 freq
cou-the - 1 freq
couthie - 71 freq
kathie - 2 freq
goth - 10 freq
kathy - 4 freq
coothie - 4 freq
keith - 21 freq
kithy - 2 freq
quoth - 6 freq
cathai - 1 freq
'cathy - 13 freq
cath - 4 freq
caeth - 1 freq
kyth - 3 freq
cathie - 1 freq
couth - 1 freq
cathay - 1 freq
kath - 3 freq
€˜cauthe - 1 freq
goethe - 8 freq
GOTH
Time to execute Levenshtein function - 0.169951 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.323351 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028066 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037174 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000909 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.