A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to guys in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
guys (0) - 464 freq
gays (1) - 1 freq
gums (1) - 16 freq
guys' (1) - 1 freq
buys (1) - 12 freq
guvs (1) - 1 freq
wuys (1) - 1 freq
guy (1) - 210 freq
gurs (1) - 1 freq
guy's (1) - 12 freq
guts (1) - 73 freq
guns (1) - 71 freq
guds (1) - 1 freq
duys (1) - 1 freq
suys (1) - 1 freq
gus (1) - 19 freq
gubs (1) - 1 freq
gauns (2) - 30 freq
queys (2) - 1 freq
gems (2) - 13 freq
pups (2) - 9 freq
guids (2) - 24 freq
beuys (2) - 7 freq
nuts (2) - 55 freq
guyed (2) - 1 freq
guys (0) - 464 freq
gus (1) - 19 freq
gays (1) - 1 freq
goes (2) - 319 freq
ges (2) - 1 freq
geis (2) - 8 freq
gaes (2) - 173 freq
agus (2) - 3 freq
guise (2) - 17 freq
gos (2) - 1 freq
gs (2) - 3 freq
gous (2) - 1 freq
guse (2) - 1 freq
gsy (2) - 1 freq
gis (2) - 1 freq
gies (2) - 501 freq
geos (2) - 2 freq
gyos (2) - 1 freq
gas (2) - 85 freq
buys (2) - 12 freq
guy's (2) - 12 freq
gums (2) - 16 freq
gurs (2) - 1 freq
guy (2) - 210 freq
guvs (2) - 1 freq
SoundEx code - G200
gauze - 7 freq
goes - 319 freq
gowk - 47 freq
gaze - 43 freq
gough - 1 freq
gies - 501 freq
guess - 144 freq
goose - 28 freq
gie's - 76 freq
geese - 42 freq
gausie - 3 freq
gas - 85 freq
guys - 464 freq
gig - 36 freq
guy's - 12 freq
guckie - 1 freq
gash - 14 freq
guig - 1 freq
gees - 41 freq
'gees - 1 freq
gaig - 1 freq
'gie's - 8 freq
gesh - 1 freq
gizz - 19 freq
gaes - 173 freq
gawkie - 1 freq
'guess' - 1 freq
giez - 1 freq
guse - 1 freq
'gies - 2 freq
gous - 1 freq
guys' - 1 freq
gus - 19 freq
gauge - 4 freq
gis - 1 freq
gieq - 1 freq
gask - 2 freq
gawks - 6 freq
gowks - 11 freq
gaga - 1 freq
gags - 2 freq
gayge - 1 freq
gass - 4 freq
geis - 8 freq
guiy's - 1 freq
guise - 17 freq
goach - 1 freq
ga-ga - 1 freq
geez - 20 freq
giza - 1 freq
geeky - 1 freq
geek - 3 freq
geeks - 1 freq
geggie - 17 freq
gouch - 1 freq
geg - 15 freq
goochee' - 1 freq
gawk - 3 freq
'gig - 1 freq
gigs - 5 freq
giess - 4 freq
gok - 1 freq
gic - 2 freq
gause - 4 freq
geggy - 2 freq
goog - 1 freq
gouge - 1 freq
giz - 1 freq
goss - 1 freq
gays - 1 freq
gioco - 1 freq
gaeg - 1 freq
gaawk - 1 freq
gawsie - 4 freq
gaius - 1 freq
guik - 1 freq
'gosh - 1 freq
gogh - 1 freq
gos - 1 freq
gucci - 2 freq
ga's - 1 freq
gauzy - 1 freq
gess - 1 freq
geck - 6 freq
giy's - 1 freq
gyos - 1 freq
guga - 1 freq
gouk's - 1 freq
gog - 1 freq
gowk's - 1 freq
geisha - 1 freq
'ghs' - 1 freq
gaws - 1 freq
€œguiss - 1 freq
geise - 1 freq
gauss - 2 freq
ges - 1 freq
€œgesgie - 1 freq
geyse - 1 freq
€œgies - 2 freq
goch - 1 freq
goosey - 6 freq
€˜gies - 3 freq
gcses - 3 freq
gigo - 1 freq
gec - 1 freq
guiss - 1 freq
gows - 2 freq
gawky - 1 freq
geex - 1 freq
gegs - 1 freq
€™gies - 1 freq
gowkie - 1 freq
€œgowk - 1 freq
geos - 2 freq
gag - 3 freq
€™goggz - 1 freq
gwcia - 1 freq
gaz - 1 freq
gaza - 1 freq
gooch - 2 freq
goksu - 1 freq
giggs - 2 freq
giggsy - 24 freq
gaÂ’s - 1 freq
gazza - 3 freq
gush - 1 freq
“geez - 1 freq
geog - 1 freq
gieÂ’s - 1 freq
gyz - 2 freq
gyoza - 3 freq
'gowk' - 1 freq
gokc - 1 freq
gyox - 1 freq
gqzuzia - 1 freq
gaisge - 1 freq
guz - 1 freq
giese - 2 freq
ghzkq - 1 freq
goz - 1 freq
gazzah - 3 freq
gosh - 2 freq
goggsy - 1 freq
ghqce - 1 freq
gegc - 1 freq
gassy - 1 freq
MetaPhone code - KS
gauze - 7 freq
cosy - 55 freq
goes - 319 freq
case - 440 freq
gaze - 43 freq
keys - 60 freq
kis - 141 freq
cause - 1187 freq
guess - 144 freq
goose - 28 freq
kiss - 120 freq
gausie - 3 freq
'cause - 16 freq
gas - 85 freq
queues - 9 freq
coos - 71 freq
cos - 449 freq
quiz - 19 freq
guys - 464 freq
guy's - 12 freq
causey - 25 freq
'quiz - 1 freq
caas - 44 freq
cuz - 8 freq
coz - 44 freq
'cos - 7 freq
cows - 5 freq
gaes - 173 freq
cou's - 1 freq
hkes - 1 freq
caws - 36 freq
'guess' - 1 freq
caus - 8 freq
caz - 2 freq
cassie - 10 freq
guse - 1 freq
quasi - 1 freq
ca's - 4 freq
gous - 1 freq
guys' - 1 freq
queue's - 1 freq
gus - 19 freq
queys - 1 freq
quoys - 24 freq
couse - 23 freq
cus - 130 freq
caise - 6 freq
coasee - 1 freq
gce - 4 freq
gass - 4 freq
guiy's - 1 freq
k's - 1 freq
casey - 7 freq
guise - 17 freq
csi - 3 freq
cow's - 2 freq
cozy - 4 freq
cosie - 22 freq
casie - 3 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
caase - 29 freq
cozzie - 3 freq
coo's - 16 freq
gause - 4 freq
goss - 1 freq
gays - 1 freq
kïss - 3 freq
caause - 2 freq
gawsie - 4 freq
gaius - 1 freq
caa's - 1 freq
cass - 14 freq
keess - 10 freq
kaiys - 1 freq
coose - 8 freq
gös - 2 freq
causay - 2 freq
kay's - 2 freq
gos - 1 freq
kiz - 18 freq
kess - 5 freq
cas - 4 freq
queasy - 2 freq
ga's - 1 freq
gauzy - 1 freq
ccea - 1 freq
cce - 1 freq
cues - 3 freq
cs - 5 freq
kaas - 2 freq
cais - 1 freq
caess - 3 freq
cows' - 1 freq
gøs - 1 freq
kissie - 1 freq
c's - 1 freq
€˜cozie - 1 freq
gaws - 1 freq
queazy - 1 freq
causie - 8 freq
€œguiss - 1 freq
gauss - 2 freq
cace - 1 freq
kace - 2 freq
kies - 1 freq
cusa - 1 freq
€™casey - 1 freq
goosey - 6 freq
kays - 1 freq
kehs - 1 freq
€œcos - 1 freq
cous - 1 freq
caese - 2 freq
€˜case - 1 freq
kezia - 5 freq
€¦cos - 1 freq
€˜cause - 7 freq
€œcause - 2 freq
caius - 3 freq
€™cause - 2 freq
kso - 2 freq
guiss - 1 freq
ks - 3 freq
gows - 2 freq
€˜cos - 4 freq
€˜cus - 1 freq
czy - 1 freq
gz - 3 freq
cz - 5 freq
cooÂ’s - 3 freq
qois - 1 freq
kz - 4 freq
khza - 1 freq
casa - 1 freq
gs - 3 freq
kaz - 3 freq
gaz - 1 freq
qcy - 1 freq
ckhhyys - 1 freq
gaza - 1 freq
hx - 5 freq
cossy - 1 freq
wx - 4 freq
qwz - 1 freq
gaÂ’s - 1 freq
gazza - 3 freq
gsy - 1 freq
cos' - 1 freq
yx - 2 freq
hgggssss - 1 freq
wks - 1 freq
wxaw - 1 freq
kci - 1 freq
wxi - 1 freq
kazoo - 1 freq
kks - 1 freq
hhqs - 1 freq
ccyh - 1 freq
qzw - 1 freq
qs - 2 freq
güs - 1 freq
csw - 2 freq
hxi - 1 freq
qaz - 1 freq
qis - 1 freq
guz - 1 freq
czw - 1 freq
goz - 1 freq
kes - 1 freq
yxa - 1 freq
kiss' - 1 freq
gazzah - 3 freq
qz - 1 freq
csa - 1 freq
hhx - 1 freq
yxx - 1 freq
gassy - 1 freq
GUYS
guys - 464 freq
guys - 464 freq
Time to execute Levenshtein function - 0.810445 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.299151 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.097887 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.191309 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000885 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.