A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gean in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gean (0) - 10 freq
gead (1) - 2 freq
guan (1) - 7 freq
gea (1) - 1 freq
goan (1) - 64 freq
gaean (1) - 2 freq
wean (1) - 415 freq
gear (1) - 228 freq
bean (1) - 41 freq
gyan (1) - 104 freq
gern (1) - 1 freq
sean (1) - 16 freq
geian (1) - 1 freq
dean (1) - 18 freq
getan (1) - 1 freq
mean (1) - 939 freq
gan (1) - 768 freq
gen (1) - 11 freq
geyan (1) - 17 freq
giean (1) - 4 freq
jean (1) - 167 freq
glean (1) - 4 freq
gran (1) - 232 freq
gaan (1) - 244 freq
ean (1) - 46 freq
gean (0) - 10 freq
gen (1) - 11 freq
gan (1) - 768 freq
geian (1) - 1 freq
geyan (1) - 17 freq
gaan (1) - 244 freq
gein (1) - 37 freq
geen (1) - 77 freq
gyan (1) - 104 freq
giean (1) - 4 freq
guan (1) - 7 freq
gaean (1) - 2 freq
goan (1) - 64 freq
goen (2) - 1 freq
guun (2) - 1 freq
geyin (2) - 1 freq
gaen (2) - 190 freq
goane (2) - 1 freq
ageen (2) - 111 freq
goon (2) - 33 freq
gyeen (2) - 1 freq
aegean (2) - 4 freq
gane (2) - 267 freq
gene (2) - 2 freq
agein (2) - 13 freq
SoundEx code - G500
gaun - 1849 freq
gonnae - 588 freq
giein - 436 freq
gone - 282 freq
gin - 1982 freq
gan - 768 freq
gane - 267 freq
gain - 53 freq
gaen - 190 freq
goon - 33 freq
gien - 1014 freq
gaan - 244 freq
gun - 80 freq
game - 642 freq
gamie - 12 freq
gauin - 46 freq
gemme - 102 freq
gawin - 25 freq
gein - 37 freq
goan - 64 freq
'gin - 37 freq
gaein - 147 freq
guano - 1 freq
'giein - 3 freq
gonae - 38 freq
geein - 42 freq
gawn - 34 freq
goun - 23 freq
gaunie - 6 freq
goin - 248 freq
gean - 10 freq
'gawn - 2 freq
gem - 23 freq
gonna - 129 freq
'gonnae - 9 freq
gie'in - 2 freq
gi'en - 7 freq
gowan - 16 freq
gemm - 53 freq
gzemm - 1 freq
'gaun - 14 freq
gyan - 104 freq
gaunae - 39 freq
'gaunae - 1 freq
goonie - 14 freq
gym - 40 freq
gum - 18 freq
genie - 7 freq
gammy - 8 freq
geen - 77 freq
gaean - 2 freq
gaain - 2 freq
gammie - 4 freq
gounie - 7 freq
goin' - 5 freq
guinea - 6 freq
giein' - 4 freq
gaun' - 2 freq
goween - 1 freq
goen - 1 freq
gyaan - 83 freq
gunn - 2 freq
gonny - 17 freq
'gome - 1 freq
gan' - 8 freq
gannaway - 1 freq
ga'en - 1 freq
geem - 5 freq
gene - 2 freq
ga'an - 6 freq
g'in - 6 freq
g'on - 10 freq
gen - 11 freq
ginnae - 1 freq
gaunnae - 135 freq
gimme - 4 freq
gaunna - 3 freq
gemma - 18 freq
gam - 2 freq
guan - 7 freq
ga''en - 1 freq
gn - 8 freq
gi'n - 3 freq
gaime - 1 freq
gin' - 1 freq
geyan - 17 freq
gaun'ae - 28 freq
gehenna - 1 freq
ghana - 1 freq
gyaain - 85 freq
'gn' - 1 freq
gie'im - 1 freq
giean - 4 freq
gie'n - 2 freq
gunnie - 16 freq
gaem - 1 freq
gonnae' - 1 freq
gown - 2 freq
gummy - 6 freq
giem - 1 freq
gjaain - 2 freq
gm - 10 freq
ganie - 6 freq
gonnie - 1 freq
geian - 1 freq
geyin - 1 freq
go-an - 1 freq
gauny - 1 freq
goom - 1 freq
gome - 1 freq
genoa - 1 freq
guiami - 1 freq
g'oan - 1 freq
gwyne - 1 freq
gie-in - 2 freq
'gemma - 1 freq
'gane - 1 freq
'gone - 1 freq
€˜gien - 2 freq
€˜gin - 4 freq
€œgien - 1 freq
go'n - 8 freq
goooo'n - 1 freq
gaunny - 7 freq
gnaw - 6 freq
gyaun - 10 freq
€¦gin - 1 freq
gune - 1 freq
€œgin - 25 freq
gme - 45 freq
g-g-gaun - 1 freq
g-g-g-gaun - 1 freq
gamma - 9 freq
€˜gonnae - 2 freq
gey-an - 1 freq
gonaae - 1 freq
gon - 6 freq
€œgan - 1 freq
gawain - 1 freq
€˜gaun - 2 freq
goane - 1 freq
'gonnae' - 1 freq
€œgoin - 1 freq
gnu - 1 freq
gony - 2 freq
€¦gon - 1 freq
goony - 1 freq
gein' - 1 freq
gim - 1 freq
guun - 1 freq
gyeen - 1 freq
€˜gauuuun - 1 freq
goanna - 1 freq
gae'an - 1 freq
gonnay - 1 freq
gaÂ’an - 1 freq
gaen' - 1 freq
gena - 1 freq
“gon - 1 freq
ggzkun - 1 freq
geeeeeeoannnnn - 1 freq
ganna - 8 freq
gscm - 1 freq
goona - 1 freq
gnh - 1 freq
goinÂ’ - 2 freq
gino - 1 freq
genny - 1 freq
gÂ’on - 2 freq
gaunÂ’ae - 3 freq
gwennie - 1 freq
gkam - 1 freq
gemÂ’ - 1 freq
game” - 1 freq
“gone - 1 freq
gaim - 1 freq
gauna - 1 freq
gien” - 2 freq
gjaan - 1 freq
gahn - 1 freq
gioni - 1 freq
gehmn - 1 freq
ggeanh - 1 freq
game' - 1 freq
MetaPhone code - JN
giein - 436 freq
gin - 1982 freq
gien - 1014 freq
jyne - 115 freq
jeannie - 91 freq
'jonah' - 1 freq
john - 802 freq
gein - 37 freq
join - 123 freq
'gin - 37 freq
june - 101 freq
jean - 167 freq
jan - 25 freq
'giein - 3 freq
jeanie - 34 freq
johnnie - 28 freq
geein - 42 freq
jona - 1 freq
juin - 25 freq
gean - 10 freq
jine - 37 freq
gie'in - 2 freq
gi'en - 7 freq
'jean - 4 freq
johnny - 111 freq
'johnny - 1 freq
jenny - 175 freq
joana - 1 freq
jane - 50 freq
jannie - 10 freq
jonah - 30 freq
jenna - 5 freq
genie - 7 freq
joan - 30 freq
hygiene - 8 freq
geen - 77 freq
'john - 6 freq
'jenny - 1 freq
jon - 4 freq
giein' - 4 freq
jeenie - 1 freq
jeyn - 1 freq
jn - 11 freq
jinny - 28 freq
gene - 2 freq
gen - 11 freq
ginnae - 1 freq
jain - 1 freq
jun - 7 freq
jennie - 7 freq
jannai - 3 freq
gi'n - 3 freq
joan' - 1 freq
gin' - 1 freq
juno - 2 freq
joanna - 7 freq
jön - 1 freq
giean - 4 freq
gie'n - 2 freq
jayne - 2 freq
'jeannie - 2 freq
jaanie - 1 freq
johnie - 9 freq
geian - 1 freq
jonny - 5 freq
janny - 22 freq
genoa - 1 freq
johnne - 1 freq
johne - 1 freq
gie-in - 2 freq
jen - 6 freq
joannie - 1 freq
€˜gien - 2 freq
€˜gin - 4 freq
joanie - 1 freq
€œgien - 1 freq
€¦gin - 1 freq
€œgin - 25 freq
€˜john - 2 freq
jonnie - 1 freq
jen- - 1 freq
gey-an - 1 freq
joni - 3 freq
joahny - 13 freq
joahnny - 3 freq
€˜jenny - 2 freq
jaune' - 1 freq
€œjonnie - 1 freq
gein' - 1 freq
€œjohnny - 1 freq
€œjohn - 2 freq
jeanne - 1 freq
gena - 1 freq
jenni - 1 freq
janey - 10 freq
geeeeeeoannnnn - 1 freq
jnw - 1 freq
gino - 1 freq
genny - 1 freq
janie - 1 freq
johni - 1 freq
jeane - 1 freq
gien” - 2 freq
jynuuh - 1 freq
gioni - 1 freq
joanne - 1 freq
jin - 1 freq
jaan - 1 freq
johnneh - 6 freq
GEAN
Time to execute Levenshtein function - 0.173280 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.334084 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027627 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036582 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000817 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.