A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to geg in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
geg (0) - 15 freq
gei (1) - 15 freq
geog (1) - 1 freq
gkg (1) - 1 freq
gea (1) - 1 freq
gem (1) - 23 freq
eg (1) - 23 freq
gegc (1) - 1 freq
gec (1) - 1 freq
ges (1) - 1 freq
beg (1) - 74 freq
jeg (1) - 2 freq
geb (1) - 1 freq
neg (1) - 1 freq
geo (1) - 19 freq
ger (1) - 2 freq
gleg (1) - 127 freq
geng (1) - 131 freq
seg (1) - 1 freq
get (1) - 5117 freq
peg (1) - 30 freq
veg (1) - 17 freq
gee (1) - 178 freq
weg (1) - 2 freq
gig (1) - 36 freq
geg (0) - 15 freq
gig (1) - 36 freq
gog (1) - 1 freq
gg (1) - 4 freq
gag (1) - 3 freq
geog (1) - 1 freq
gaeg (1) - 1 freq
gey (2) - 1312 freq
meg (2) - 133 freq
deg (2) - 2 freq
ge (2) - 13 freq
gel (2) - 13 freq
ged (2) - 18 freq
gen (2) - 11 freq
leg (2) - 199 freq
feg (2) - 3 freq
guga (2) - 1 freq
egg (2) - 95 freq
gaig (2) - 1 freq
gaga (2) - 1 freq
agog (2) - 1 freq
gei (2) - 15 freq
igg (2) - 1 freq
goog (2) - 1 freq
gigo (2) - 1 freq
SoundEx code - G200
gauze - 7 freq
goes - 319 freq
gowk - 47 freq
gaze - 43 freq
gough - 1 freq
gies - 501 freq
guess - 144 freq
goose - 28 freq
gie's - 76 freq
geese - 42 freq
gausie - 3 freq
gas - 85 freq
guys - 464 freq
gig - 36 freq
guy's - 12 freq
guckie - 1 freq
gash - 14 freq
guig - 1 freq
gees - 41 freq
'gees - 1 freq
gaig - 1 freq
'gie's - 8 freq
gesh - 1 freq
gizz - 19 freq
gaes - 173 freq
gawkie - 1 freq
'guess' - 1 freq
giez - 1 freq
guse - 1 freq
'gies - 2 freq
gous - 1 freq
guys' - 1 freq
gus - 19 freq
gauge - 4 freq
gis - 1 freq
gieq - 1 freq
gask - 2 freq
gawks - 6 freq
gowks - 11 freq
gaga - 1 freq
gags - 2 freq
gayge - 1 freq
gass - 4 freq
geis - 8 freq
guiy's - 1 freq
guise - 17 freq
goach - 1 freq
ga-ga - 1 freq
geez - 20 freq
giza - 1 freq
geeky - 1 freq
geek - 3 freq
geeks - 1 freq
geggie - 17 freq
gouch - 1 freq
geg - 15 freq
goochee' - 1 freq
gawk - 3 freq
'gig - 1 freq
gigs - 5 freq
giess - 4 freq
gok - 1 freq
gic - 2 freq
gause - 4 freq
geggy - 2 freq
goog - 1 freq
gouge - 1 freq
giz - 1 freq
goss - 1 freq
gays - 1 freq
gioco - 1 freq
gaeg - 1 freq
gaawk - 1 freq
gawsie - 4 freq
gaius - 1 freq
guik - 1 freq
'gosh - 1 freq
gogh - 1 freq
gos - 1 freq
gucci - 2 freq
ga's - 1 freq
gauzy - 1 freq
gess - 1 freq
geck - 6 freq
giy's - 1 freq
gyos - 1 freq
guga - 1 freq
gouk's - 1 freq
gog - 1 freq
gowk's - 1 freq
geisha - 1 freq
'ghs' - 1 freq
gaws - 1 freq
€œguiss - 1 freq
geise - 1 freq
gauss - 2 freq
ges - 1 freq
€œgesgie - 1 freq
geyse - 1 freq
€œgies - 2 freq
goch - 1 freq
goosey - 6 freq
€˜gies - 3 freq
gcses - 3 freq
gigo - 1 freq
gec - 1 freq
guiss - 1 freq
gows - 2 freq
gawky - 1 freq
geex - 1 freq
gegs - 1 freq
€™gies - 1 freq
gowkie - 1 freq
€œgowk - 1 freq
geos - 2 freq
gag - 3 freq
€™goggz - 1 freq
gwcia - 1 freq
gaz - 1 freq
gaza - 1 freq
gooch - 2 freq
goksu - 1 freq
giggs - 2 freq
giggsy - 24 freq
gaÂ’s - 1 freq
gazza - 3 freq
gush - 1 freq
“geez - 1 freq
geog - 1 freq
gieÂ’s - 1 freq
gyz - 2 freq
gyoza - 3 freq
'gowk' - 1 freq
gokc - 1 freq
gyox - 1 freq
gqzuzia - 1 freq
gaisge - 1 freq
guz - 1 freq
giese - 2 freq
ghzkq - 1 freq
goz - 1 freq
gazzah - 3 freq
gosh - 2 freq
goggsy - 1 freq
ghqce - 1 freq
gegc - 1 freq
gassy - 1 freq
MetaPhone code - JK
jaggy - 21 freq
jouk - 62 freq
jig - 26 freq
jock - 511 freq
joke - 140 freq
jug - 26 freq
jakey - 5 freq
joco - 37 freq
'joco - 1 freq
'joco' - 1 freq
jackie - 38 freq
jack - 224 freq
joog - 19 freq
jook - 24 freq
gig - 36 freq
joug - 8 freq
jaggie - 8 freq
joggie - 2 freq
jockie - 28 freq
jeuk - 1 freq
jag - 26 freq
jockie' - 1 freq
jake - 71 freq
gieq - 1 freq
jockey - 3 freq
jog - 7 freq
joackey - 3 freq
geeky - 1 freq
geek - 3 freq
geggie - 17 freq
geg - 15 freq
jeggie - 1 freq
'gig - 1 freq
jeck - 16 freq
gic - 2 freq
jike - 5 freq
geggy - 2 freq
gioco - 1 freq
juk - 2 freq
joukie - 2 freq
jek' - 1 freq
'jackie - 1 freq
jak - 1 freq
jeeg - 1 freq
jk - 8 freq
jaikee - 1 freq
jaikey - 1 freq
jg - 3 freq
jiggy - 9 freq
jocky - 25 freq
'jack' - 1 freq
hjook - 2 freq
jeg - 2 freq
geck - 6 freq
'jock - 2 freq
'jig - 1 freq
jc - 10 freq
jaikie' - 1 freq
juke - 7 freq
gigo - 1 freq
jagg - 1 freq
gec - 1 freq
€œjackie - 1 freq
hjuk - 1 freq
joakey - 1 freq
€œjock - 1 freq
joeq - 1 freq
jkeoi - 1 freq
jic - 1 freq
'jag' - 1 freq
jock' - 1 freq
jac - 3 freq
jik - 1 freq
geog - 1 freq
joak - 1 freq
yjc - 1 freq
joke' - 1 freq
hjquii - 1 freq
hjc - 1 freq
jok - 1 freq
jgw - 1 freq
jhg - 1 freq
jek - 1 freq
jaigw - 1 freq
jacky - 1 freq
GEG
Time to execute Levenshtein function - 0.176560 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.408372 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027493 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037080 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000822 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.