A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gis in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gis (0) - 1 freq
git (1) - 1232 freq
bis (1) - 19 freq
kis (1) - 141 freq
zis (1) - 2 freq
jis (1) - 36 freq
lis (1) - 1 freq
gos (1) - 1 freq
ges (1) - 1 freq
'is (1) - 150 freq
gic (1) - 2 freq
mis (1) - 3 freq
yis (1) - 275 freq
gcs (1) - 1 freq
gris (1) - 1 freq
qis (1) - 1 freq
gps (1) - 4 freq
girs (1) - 4 freq
gif (1) - 207 freq
gies (1) - 501 freq
giy (1) - 16 freq
dis (1) - 1549 freq
gits (1) - 102 freq
giz (1) - 1 freq
fis (1) - 5 freq
gis (0) - 1 freq
gos (1) - 1 freq
ges (1) - 1 freq
gs (1) - 3 freq
gies (1) - 501 freq
gus (1) - 19 freq
gas (1) - 85 freq
geis (1) - 8 freq
pis (2) - 2 freq
eis (2) - 478 freq
guise (2) - 17 freq
egs (2) - 2 freq
gin (2) - 1982 freq
uis (2) - 17 freq
vis (2) - 1 freq
gigs (2) - 5 freq
gi (2) - 27 freq
nis (2) - 2 freq
his (2) - 17050 freq
ages (2) - 151 freq
gim (2) - 1 freq
wis (2) - 27947 freq
guys (2) - 464 freq
gaes (2) - 173 freq
ags (2) - 1 freq
SoundEx code - G200
gauze - 7 freq
goes - 319 freq
gowk - 47 freq
gaze - 43 freq
gough - 1 freq
gies - 501 freq
guess - 144 freq
goose - 28 freq
gie's - 76 freq
geese - 42 freq
gausie - 3 freq
gas - 85 freq
guys - 464 freq
gig - 36 freq
guy's - 12 freq
guckie - 1 freq
gash - 14 freq
guig - 1 freq
gees - 41 freq
'gees - 1 freq
gaig - 1 freq
'gie's - 8 freq
gesh - 1 freq
gizz - 19 freq
gaes - 173 freq
gawkie - 1 freq
'guess' - 1 freq
giez - 1 freq
guse - 1 freq
'gies - 2 freq
gous - 1 freq
guys' - 1 freq
gus - 19 freq
gauge - 4 freq
gis - 1 freq
gieq - 1 freq
gask - 2 freq
gawks - 6 freq
gowks - 11 freq
gaga - 1 freq
gags - 2 freq
gayge - 1 freq
gass - 4 freq
geis - 8 freq
guiy's - 1 freq
guise - 17 freq
goach - 1 freq
ga-ga - 1 freq
geez - 20 freq
giza - 1 freq
geeky - 1 freq
geek - 3 freq
geeks - 1 freq
geggie - 17 freq
gouch - 1 freq
geg - 15 freq
goochee' - 1 freq
gawk - 3 freq
'gig - 1 freq
gigs - 5 freq
giess - 4 freq
gok - 1 freq
gic - 2 freq
gause - 4 freq
geggy - 2 freq
goog - 1 freq
gouge - 1 freq
giz - 1 freq
goss - 1 freq
gays - 1 freq
gioco - 1 freq
gaeg - 1 freq
gaawk - 1 freq
gawsie - 4 freq
gaius - 1 freq
guik - 1 freq
'gosh - 1 freq
gogh - 1 freq
gos - 1 freq
gucci - 2 freq
ga's - 1 freq
gauzy - 1 freq
gess - 1 freq
geck - 6 freq
giy's - 1 freq
gyos - 1 freq
guga - 1 freq
gouk's - 1 freq
gog - 1 freq
gowk's - 1 freq
geisha - 1 freq
'ghs' - 1 freq
gaws - 1 freq
€œguiss - 1 freq
geise - 1 freq
gauss - 2 freq
ges - 1 freq
€œgesgie - 1 freq
geyse - 1 freq
€œgies - 2 freq
goch - 1 freq
goosey - 6 freq
€˜gies - 3 freq
gcses - 3 freq
gigo - 1 freq
gec - 1 freq
guiss - 1 freq
gows - 2 freq
gawky - 1 freq
geex - 1 freq
gegs - 1 freq
€™gies - 1 freq
gowkie - 1 freq
€œgowk - 1 freq
geos - 2 freq
gag - 3 freq
€™goggz - 1 freq
gwcia - 1 freq
gaz - 1 freq
gaza - 1 freq
gooch - 2 freq
goksu - 1 freq
giggs - 2 freq
giggsy - 24 freq
gaÂ’s - 1 freq
gazza - 3 freq
gush - 1 freq
“geez - 1 freq
geog - 1 freq
gieÂ’s - 1 freq
gyz - 2 freq
gyoza - 3 freq
'gowk' - 1 freq
gokc - 1 freq
gyox - 1 freq
gqzuzia - 1 freq
gaisge - 1 freq
guz - 1 freq
giese - 2 freq
ghzkq - 1 freq
goz - 1 freq
gazzah - 3 freq
gosh - 2 freq
goggsy - 1 freq
ghqce - 1 freq
gegc - 1 freq
gassy - 1 freq
MetaPhone code - JS
gies - 501 freq
jis - 36 freq
jaws - 48 freq
jeez - 10 freq
gie's - 76 freq
joys - 32 freq
geese - 42 freq
joyce - 138 freq
josie - 18 freq
'jeezo - 1 freq
juice - 68 freq
juicy - 15 freq
gees - 41 freq
'gees - 1 freq
'gie's - 8 freq
jessie - 258 freq
gizz - 19 freq
giez - 1 freq
jews - 23 freq
jees - 3 freq
jeesy - 2 freq
joss - 2 freq
'gies - 2 freq
'jees - 1 freq
gis - 1 freq
jess - 21 freq
jows - 2 freq
jazz - 11 freq
joy's - 1 freq
geis - 8 freq
jew's - 2 freq
jus - 39 freq
jaas - 15 freq
joes - 4 freq
geez - 20 freq
giza - 1 freq
jassie - 3 freq
joycie - 6 freq
giess - 4 freq
js - 5 freq
giz - 1 freq
jeezo - 15 freq
jesse - 7 freq
jeeeez - 1 freq
joe's - 4 freq
joey's - 3 freq
jeuce - 1 freq
jise - 1 freq
juse - 1 freq
gess - 1 freq
giy's - 1 freq
jaw's - 1 freq
hgis - 1 freq
jesssie' - 1 freq
geise - 1 freq
ges - 1 freq
jauss - 7 freq
geyse - 1 freq
€œgies - 2 freq
€˜jeezo - 1 freq
jesu - 2 freq
€˜gies - 3 freq
josu - 1 freq
'jeezo' - 1 freq
€œjosie - 1 freq
jeeso - 1 freq
€œjessie - 2 freq
jazza - 2 freq
jjeezo - 1 freq
€™gies - 1 freq
geos - 2 freq
jysu - 1 freq
wjz - 2 freq
jjs - 52 freq
jizz - 1 freq
jhs - 2 freq
yjsu - 1 freq
jeezoo - 1 freq
joos - 6 freq
j's - 2 freq
“geez - 1 freq
jyhz - 1 freq
gieÂ’s - 1 freq
gyz - 2 freq
jas - 1 freq
giese - 2 freq
jz - 1 freq
jyz - 1 freq
yjeizaoa - 1 freq
GIS
Time to execute Levenshtein function - 0.297325 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.488838 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028382 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.072476 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000839 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.