A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to goch in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
goch (0) - 1 freq
gosh (1) - 2 freq
'och (1) - 118 freq
roch (1) - 121 freq
gooch (1) - 2 freq
noch (1) - 2 freq
gogh (1) - 1 freq
loch (1) - 272 freq
gouch (1) - 1 freq
moch (1) - 8 freq
goach (1) - 1 freq
hoch (1) - 10 freq
och (1) - 701 freq
soch (1) - 1 freq
goth (1) - 8 freq
yoch (1) - 1 freq
foc' (2) - 1 freq
such (2) - 278 freq
etch (2) - 2 freq
goit (2) - 3 freq
goon (2) - 33 freq
oche (2) - 1 freq
greh (2) - 1 freq
€œch (2) - 5 freq
logh (2) - 1 freq
goch (0) - 1 freq
gouch (1) - 1 freq
goach (1) - 1 freq
gooch (1) - 2 freq
yoch (2) - 1 freq
soch (2) - 1 freq
hoch (2) - 10 freq
och (2) - 701 freq
moch (2) - 8 freq
goth (2) - 8 freq
gosh (2) - 2 freq
loch (2) - 272 freq
roch (2) - 121 freq
'och (2) - 118 freq
gogh (2) - 1 freq
noch (2) - 2 freq
'ouch (3) - 2 freq
ich (3) - 4 freq
ych (3) - 1 freq
gluh (3) - 1 freq
mooch (3) - 25 freq
vouch (3) - 4 freq
guh (3) - 1 freq
gxh (3) - 1 freq
arch (3) - 13 freq
SoundEx code - G200
gauze - 7 freq
goes - 319 freq
gowk - 47 freq
gaze - 43 freq
gough - 1 freq
gies - 501 freq
guess - 144 freq
goose - 28 freq
gie's - 76 freq
geese - 42 freq
gausie - 3 freq
gas - 85 freq
guys - 464 freq
gig - 36 freq
guy's - 12 freq
guckie - 1 freq
gash - 14 freq
guig - 1 freq
gees - 41 freq
'gees - 1 freq
gaig - 1 freq
'gie's - 8 freq
gesh - 1 freq
gizz - 19 freq
gaes - 173 freq
gawkie - 1 freq
'guess' - 1 freq
giez - 1 freq
guse - 1 freq
'gies - 2 freq
gous - 1 freq
guys' - 1 freq
gus - 19 freq
gauge - 4 freq
gis - 1 freq
gieq - 1 freq
gask - 2 freq
gawks - 6 freq
gowks - 11 freq
gaga - 1 freq
gags - 2 freq
gayge - 1 freq
gass - 4 freq
geis - 8 freq
guiy's - 1 freq
guise - 17 freq
goach - 1 freq
ga-ga - 1 freq
geez - 20 freq
giza - 1 freq
geeky - 1 freq
geek - 3 freq
geeks - 1 freq
geggie - 17 freq
gouch - 1 freq
geg - 15 freq
goochee' - 1 freq
gawk - 3 freq
'gig - 1 freq
gigs - 5 freq
giess - 4 freq
gok - 1 freq
gic - 2 freq
gause - 4 freq
geggy - 2 freq
goog - 1 freq
gouge - 1 freq
giz - 1 freq
goss - 1 freq
gays - 1 freq
gioco - 1 freq
gaeg - 1 freq
gaawk - 1 freq
gawsie - 4 freq
gaius - 1 freq
guik - 1 freq
'gosh - 1 freq
gogh - 1 freq
gos - 1 freq
gucci - 2 freq
ga's - 1 freq
gauzy - 1 freq
gess - 1 freq
geck - 6 freq
giy's - 1 freq
gyos - 1 freq
guga - 1 freq
gouk's - 1 freq
gog - 1 freq
gowk's - 1 freq
geisha - 1 freq
'ghs' - 1 freq
gaws - 1 freq
€œguiss - 1 freq
geise - 1 freq
gauss - 2 freq
ges - 1 freq
€œgesgie - 1 freq
geyse - 1 freq
€œgies - 2 freq
goch - 1 freq
goosey - 6 freq
€˜gies - 3 freq
gcses - 3 freq
gigo - 1 freq
gec - 1 freq
guiss - 1 freq
gows - 2 freq
gawky - 1 freq
geex - 1 freq
gegs - 1 freq
€™gies - 1 freq
gowkie - 1 freq
€œgowk - 1 freq
geos - 2 freq
gag - 3 freq
€™goggz - 1 freq
gwcia - 1 freq
gaz - 1 freq
gaza - 1 freq
gooch - 2 freq
goksu - 1 freq
giggs - 2 freq
giggsy - 24 freq
gaÂ’s - 1 freq
gazza - 3 freq
gush - 1 freq
“geez - 1 freq
geog - 1 freq
gieÂ’s - 1 freq
gyz - 2 freq
gyoza - 3 freq
'gowk' - 1 freq
gokc - 1 freq
gyox - 1 freq
gqzuzia - 1 freq
gaisge - 1 freq
guz - 1 freq
giese - 2 freq
ghzkq - 1 freq
goz - 1 freq
gazzah - 3 freq
gosh - 2 freq
goggsy - 1 freq
ghqce - 1 freq
gegc - 1 freq
gassy - 1 freq
MetaPhone code - KX
catch - 346 freq
cosh - 12 freq
ketch - 9 freq
cash - 84 freq
couch - 67 freq
keech - 41 freq
keich - 8 freq
quaich - 25 freq
gash - 14 freq
coach - 51 freq
cushie - 15 freq
cotch - 8 freq
cushy - 3 freq
kitchie - 71 freq
kich - 1 freq
cautch - 1 freq
qu'she - 2 freq
goach - 1 freq
'cash - 1 freq
gouch - 1 freq
kesh - 1 freq
goochee' - 1 freq
keach - 2 freq
quiche - 3 freq
catia - 1 freq
cooch - 7 freq
'catch - 2 freq
kush - 1 freq
cösh - 1 freq
'gosh - 1 freq
catchy - 3 freq
kishie - 12 freq
'catchie' - 1 freq
cootch - 3 freq
kauch - 3 freq
coutch - 1 freq
goch - 1 freq
gotcha - 2 freq
catchie - 2 freq
kach - 1 freq
gwcia - 1 freq
gooch - 2 freq
gush - 1 freq
cashe - 1 freq
gosh - 2 freq
cach - 1 freq
'cach - 1 freq
GOCH
Time to execute Levenshtein function - 0.167755 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.328342 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031590 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040386 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000853 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.