A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gk (0) - 2 freq
gt (1) - 9 freq
‹k (1) - 3 freq
gki (1) - 1 freq
g' (1) - 1 freq
gkg (1) - 1 freq
gv (1) - 11 freq
dk (1) - 4 freq
ek (1) - 6 freq
gz (1) - 3 freq
ªk (1) - 3 freq
gx (1) - 4 freq
ok (1) - 116 freq
gkh (1) - 1 freq
rk (1) - 2 freq
gy (1) - 11 freq
gc (1) - 3 freq
ga (1) - 29 freq
pk (1) - 4 freq
ygk (1) - 1 freq
fk (1) - 9 freq
nk (1) - 3 freq
go (1) - 1980 freq
vk (1) - 5 freq
kk (1) - 4 freq
gk (0) - 2 freq
ygk (1) - 1 freq
gki (1) - 1 freq
gok (1) - 1 freq
lk (2) - 9 freq
wk (2) - 11 freq
gl (2) - 2 freq
sk (2) - 4 freq
gq (2) - 3 freq
qk (2) - 1 freq
uk (2) - 359 freq
gu (2) - 8 freq
tk (2) - 3 freq
gs (2) - 3 freq
hk (2) - 9 freq
mgk (2) - 1 freq
jk (2) - 8 freq
lgk (2) - 2 freq
gw (2) - 1 freq
mk (2) - 1 freq
g (2) - 278 freq
k (2) - 205 freq
ak (2) - 14 freq
guik (2) - 1 freq
geek (2) - 3 freq
SoundEx code - G000
gie - 2567 freq
gey - 1316 freq
go - 1980 freq
gae - 503 freq
gay - 69 freq
'gie - 26 freq
guy - 222 freq
gooooo - 1 freq
g - 278 freq
gu - 8 freq
gee - 178 freq
'go - 11 freq
gi'e - 2 freq
'gae - 3 freq
gq - 3 freq
ge - 13 freq
gye - 194 freq
gaw - 5 freq
ga - 29 freq
goa - 2 freq
gow - 7 freq
gce - 4 freq
geiy - 16 freq
'gee - 1 freq
goo - 5 freq
guiy - 1 freq
gwee - 1 freq
giy - 16 freq
g'wa - 17 freq
g'awa - 7 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
gie-awa - 1 freq
gea - 1 freq
gooey - 1 freq
goe - 1 freq
'gey - 2 freq
gaah - 1 freq
gui - 1 freq
g' - 1 freq
gös - 2 freq
go' - 1 freq
'gh' - 1 freq
gaye - 2 freq
gaia - 3 freq
'go' - 2 freq
gcse - 5 freq
gy - 11 freq
geo - 19 freq
gyo - 6 freq
gaey - 1 freq
gei - 15 freq
gau' - 4 freq
gou - 1 freq
-go - 1 freq
gjo - 2 freq
ªg - 1 freq
gøs - 1 freq
gi - 27 freq
gh - 5 freq
gaea - 1 freq
gio - 1 freq
€˜g - 5 freq
€œg - 2 freq
€œgo - 6 freq
gí - 1 freq
€œgie - 8 freq
gau - 2 freq
€˜go - 2 freq
€˜gie - 3 freq
€œgey - 1 freq
€œgae - 1 freq
€”go - 1 freq
€˜-gh - 1 freq
€œguy - 1 freq
gz - 3 freq
gg - 4 freq
guh - 1 freq
gk - 2 freq
g'wa' - 1 freq
gx - 4 freq
gcsw - 1 freq
gs - 3 freq
‘gie - 1 freq
gawa - 5 freq
gca - 1 freq
gki - 1 freq
ggks - 1 freq
gkk - 1 freq
gxx - 1 freq
gya - 1 freq
gai - 1 freq
gco - 1 freq
gj - 5 freq
gsy - 1 freq
gie' - 5 freq
goooooo - 1 freq
gc - 3 freq
gyy - 1 freq
gcc - 2 freq
“gie - 2 freq
“gey - 1 freq
gxh - 1 freq
gcs - 1 freq
gcwih - 1 freq
güs - 1 freq
gkg - 1 freq
ggc - 1 freq
gyah - 1 freq
gquo - 1 freq
gcuo - 1 freq
gqkzs - 1 freq
gxz - 1 freq
gqe - 1 freq
'gey' - 1 freq
guw - 1 freq
gjxjjc - 1 freq
gkh - 1 freq
gccc - 1 freq
gqo - 1 freq
gw - 1 freq
MetaPhone code - KK
quick - 374 freq
keek - 203 freq
gowk - 47 freq
cake - 166 freq
cook - 201 freq
cocky - 12 freq
cog - 3 freq
kick - 122 freq
cock - 82 freq
kg - 4 freq
guckie - 1 freq
guig - 1 freq
gaig - 1 freq
cack - 4 freq
gawkie - 1 freq
'quick - 5 freq
kowk - 1 freq
coggie - 5 freq
gq - 3 freq
cowk - 4 freq
cuik - 4 freq
c-cou - 1 freq
queek - 22 freq
gaga - 1 freq
cauk - 5 freq
cc - 19 freq
coke - 34 freq
gag - 4 freq
quickie - 1 freq
cookie - 15 freq
quake - 7 freq
ga-ga - 1 freq
coco - 5 freq
'cook - 1 freq
quack - 7 freq
gawk - 3 freq
gok - 1 freq
'cocky' - 2 freq
keekiy - 1 freq
kock - 1 freq
cokey - 1 freq
goog - 1 freq
coca - 1 freq
keik - 13 freq
quïck - 6 freq
gaeg - 1 freq
cacao - 14 freq
gaawk - 1 freq
guik - 1 freq
cak - 1 freq
quik - 5 freq
kik - 1 freq
cocoa - 3 freq
koko - 1 freq
queeck - 1 freq
kek - 1 freq
quaak - 3 freq
caa-caa - 1 freq
keekie - 1 freq
guga - 1 freq
coq - 1 freq
cookie' - 2 freq
gog - 1 freq
kiek - 1 freq
€œquick - 3 freq
ckeck - 1 freq
cok - 1 freq
caig - 1 freq
kake - 1 freq
quyk - 1 freq
cuckoo - 9 freq
cac - 1 freq
€˜quick - 1 freq
€œcuckoo - 4 freq
€˜cock - 1 freq
€œcowk - 2 freq
gawky - 1 freq
keg - 2 freq
quäck - 1 freq
cag - 1 freq
gowkie - 1 freq
€œgowk - 1 freq
kc - 7 freq
kyg - 1 freq
gk - 2 freq
cqah - 1 freq
qwqa - 1 freq
cg - 3 freq
qku - 1 freq
ygk - 1 freq
yqc - 1 freq
kuq - 1 freq
qg - 1 freq
ygq - 1 freq
gca - 1 freq
yqg - 1 freq
gki - 1 freq
qcu - 1 freq
cqu - 1 freq
wkkc - 1 freq
gkk - 1 freq
kca - 1 freq
kic - 1 freq
gco - 1 freq
qc - 2 freq
hcg - 1 freq
kqu - 1 freq
ckwg - 1 freq
gc - 3 freq
hqec - 1 freq
qgu - 1 freq
'keek' - 3 freq
'gowk' - 1 freq
kqe - 1 freq
ggc - 1 freq
ykg - 1 freq
gquo - 1 freq
kwc - 1 freq
gcuo - 1 freq
qk - 1 freq
cqe - 1 freq
gqe - 1 freq
cookoo - 1 freq
wcq - 1 freq
ygka - 1 freq
gkh - 1 freq
ckkg - 1 freq
gqo - 1 freq
qiki - 1 freq
hcc - 1 freq
GK
Time to execute Levenshtein function - 0.190402 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.320150 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029404 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037842 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000951 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.