A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gig in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gig (0) - 46 freq
vig (1) - 1 freq
ging (1) - 347 freq
pig (1) - 171 freq
gid (1) - 302 freq
'gig (1) - 1 freq
gkg (1) - 1 freq
gim (1) - 1 freq
eig (1) - 1 freq
giy (1) - 16 freq
gis (1) - 1 freq
gag (1) - 4 freq
gif (1) - 207 freq
wig (1) - 34 freq
gog (1) - 1 freq
guig (1) - 1 freq
glig (1) - 1 freq
ig (1) - 16 freq
hig (1) - 4 freq
mig (1) - 2 freq
sig (1) - 3 freq
rig (1) - 45 freq
gaig (1) - 1 freq
geg (1) - 15 freq
tig (1) - 21 freq
gig (0) - 46 freq
guig (1) - 1 freq
gg (1) - 4 freq
gaig (1) - 1 freq
gigo (1) - 1 freq
geg (1) - 15 freq
gag (1) - 4 freq
gog (1) - 1 freq
guga (2) - 1 freq
igg (2) - 1 freq
giz (2) - 1 freq
dig (2) - 85 freq
gi (2) - 27 freq
agg (2) - 6 freq
gip (2) - 3 freq
git (2) - 1244 freq
gio (2) - 1 freq
jig (2) - 28 freq
goog (2) - 1 freq
gigs (2) - 9 freq
eigg (2) - 3 freq
gaeg (2) - 1 freq
geog (2) - 1 freq
egg (2) - 95 freq
gaga (2) - 1 freq
SoundEx code - G200
gauze - 7 freq
goes - 331 freq
gowk - 47 freq
gaze - 44 freq
gough - 1 freq
gies - 516 freq
guess - 148 freq
goose - 28 freq
gie's - 76 freq
geese - 42 freq
gausie - 3 freq
gas - 88 freq
guys - 472 freq
gig - 46 freq
guy's - 13 freq
guckie - 1 freq
gash - 14 freq
guig - 1 freq
gees - 41 freq
'gees - 1 freq
gaig - 1 freq
'gie's - 8 freq
gesh - 1 freq
gizz - 19 freq
gaes - 173 freq
gawkie - 1 freq
'guess' - 1 freq
giez - 1 freq
guse - 1 freq
'gies - 2 freq
gous - 1 freq
guys' - 1 freq
gus - 21 freq
gauge - 4 freq
gis - 1 freq
gieq - 1 freq
gask - 2 freq
gawks - 6 freq
gowks - 11 freq
gaga - 1 freq
gags - 2 freq
gayge - 1 freq
gass - 4 freq
geis - 8 freq
guiy's - 1 freq
guise - 17 freq
gag - 4 freq
gays - 2 freq
geeky - 2 freq
gush - 2 freq
gigs - 9 freq
giess - 5 freq
giows - 1 freq
goach - 1 freq
ga-ga - 1 freq
geez - 20 freq
giza - 1 freq
geek - 3 freq
geeks - 1 freq
geggie - 17 freq
gouch - 1 freq
geg - 15 freq
goochee' - 1 freq
gawk - 3 freq
'gig - 1 freq
gok - 1 freq
gic - 2 freq
gause - 4 freq
geggy - 2 freq
goog - 1 freq
gouge - 1 freq
giz - 1 freq
goss - 1 freq
gioco - 1 freq
gaeg - 1 freq
gaawk - 1 freq
gawsie - 4 freq
gaius - 1 freq
guik - 1 freq
'gosh - 1 freq
gogh - 1 freq
gos - 1 freq
gucci - 2 freq
ga's - 1 freq
gauzy - 1 freq
gess - 1 freq
geck - 6 freq
giy's - 1 freq
gyos - 1 freq
guga - 1 freq
gouk's - 1 freq
gog - 1 freq
gowk's - 1 freq
geisha - 1 freq
'ghs' - 1 freq
gaws - 1 freq
€œguiss - 1 freq
geise - 1 freq
gauss - 2 freq
ges - 1 freq
€œgesgie - 1 freq
geyse - 1 freq
€œgies - 2 freq
goch - 1 freq
goosey - 6 freq
€˜gies - 3 freq
gcses - 3 freq
gigo - 1 freq
gec - 1 freq
guiss - 1 freq
gows - 2 freq
gawky - 1 freq
geex - 1 freq
gegs - 1 freq
€™gies - 1 freq
gowkie - 1 freq
€œgowk - 1 freq
geos - 2 freq
€™goggz - 1 freq
gwcia - 1 freq
gaz - 1 freq
gaza - 1 freq
gooch - 2 freq
goksu - 1 freq
giggs - 2 freq
giggsy - 24 freq
gaÂ’s - 1 freq
gazza - 3 freq
“geez - 1 freq
geog - 1 freq
gieÂ’s - 1 freq
gyz - 2 freq
gyoza - 3 freq
'gowk' - 1 freq
gokc - 1 freq
gyox - 1 freq
gqzuzia - 1 freq
gaisge - 1 freq
guz - 1 freq
giese - 2 freq
ghzkq - 1 freq
goz - 1 freq
gazzah - 3 freq
gosh - 2 freq
goggsy - 1 freq
ghqce - 1 freq
gegc - 1 freq
gassy - 1 freq
MetaPhone code - JK
jaggy - 22 freq
jouk - 62 freq
jig - 28 freq
jock - 512 freq
joke - 145 freq
jug - 26 freq
jakey - 6 freq
joco - 37 freq
'joco - 1 freq
'joco' - 1 freq
jackie - 38 freq
jack - 230 freq
joog - 19 freq
jook - 24 freq
gig - 46 freq
joug - 8 freq
jaggie - 8 freq
joggie - 2 freq
jockie - 28 freq
jeuk - 1 freq
jag - 26 freq
jockie' - 1 freq
jake - 71 freq
gieq - 1 freq
jockey - 3 freq
jog - 7 freq
'jock - 3 freq
geeky - 2 freq
jeg - 3 freq
joackey - 3 freq
geek - 3 freq
geggie - 17 freq
geg - 15 freq
jeggie - 1 freq
'gig - 1 freq
jeck - 16 freq
gic - 2 freq
jike - 5 freq
geggy - 2 freq
gioco - 1 freq
juk - 2 freq
joukie - 2 freq
jek' - 1 freq
'jackie - 1 freq
jak - 1 freq
jeeg - 1 freq
jk - 8 freq
jaikee - 1 freq
jaikey - 1 freq
jg - 3 freq
jiggy - 9 freq
jocky - 25 freq
'jack' - 1 freq
hjook - 2 freq
geck - 6 freq
'jig - 1 freq
jc - 10 freq
jaikie' - 1 freq
juke - 7 freq
gigo - 1 freq
jagg - 1 freq
gec - 1 freq
€œjackie - 1 freq
hjuk - 1 freq
joakey - 1 freq
€œjock - 1 freq
joeq - 1 freq
jkeoi - 1 freq
jic - 1 freq
'jag' - 1 freq
jock' - 1 freq
jac - 3 freq
jik - 1 freq
geog - 1 freq
joak - 1 freq
yjc - 1 freq
joke' - 1 freq
hjquii - 1 freq
hjc - 1 freq
jok - 1 freq
jgw - 1 freq
jhg - 1 freq
jek - 1 freq
jaigw - 1 freq
jacky - 1 freq
GIG
Time to execute Levenshtein function - 0.181985 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.332871 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027577 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039615 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001000 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.