A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gems in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gems (0) - 14 freq
gemsy (1) - 1 freq
tems (1) - 1 freq
geis (1) - 8 freq
gers (1) - 8 freq
geds (1) - 1 freq
gels (1) - 1 freq
dems (1) - 6 freq
germs (1) - 4 freq
gums (1) - 16 freq
gegs (1) - 1 freq
gees (1) - 41 freq
nems (1) - 20 freq
gess (1) - 1 freq
gem (1) - 24 freq
gyms (1) - 12 freq
geems (1) - 3 freq
gams (1) - 1 freq
gemm (1) - 53 freq
gebs (1) - 2 freq
ges (1) - 1 freq
geos (1) - 2 freq
gets (1) - 432 freq
gemms (1) - 14 freq
ems (1) - 4 freq
gems (0) - 14 freq
gums (1) - 16 freq
gemsy (1) - 1 freq
gyms (1) - 12 freq
gams (1) - 1 freq
geems (1) - 3 freq
gets (2) - 432 freq
gebs (2) - 2 freq
gemms (2) - 14 freq
ges (2) - 1 freq
hems (2) - 8 freq
gmse (2) - 1 freq
gooms (2) - 4 freq
goams (2) - 2 freq
gumsy (2) - 5 freq
gemm (2) - 53 freq
games (2) - 248 freq
ems (2) - 4 freq
geos (2) - 2 freq
gem (2) - 24 freq
gels (2) - 1 freq
gers (2) - 8 freq
geis (2) - 8 freq
tems (2) - 1 freq
dems (2) - 6 freq
SoundEx code - G520
gang - 1111 freq
going - 248 freq
gangs - 150 freq
ging - 347 freq
gamie's - 2 freq
gyang - 177 freq
gums - 16 freq
gemme's - 5 freq
gemmes - 74 freq
guns - 73 freq
gans - 51 freq
goams - 2 freq
gansey - 17 freq
gings - 75 freq
genyus - 1 freq
genius - 31 freq
games - 248 freq
'gang - 8 freq
gunk - 9 freq
gemm's - 2 freq
gemm-as - 1 freq
gemms - 14 freq
gowns - 2 freq
gangwey - 6 freq
gowans - 22 freq
gyaang - 1 freq
genes - 11 freq
guinness - 16 freq
gangie - 3 freq
gumsy - 5 freq
gyangs - 8 freq
game's - 8 freq
gunns - 2 freq
gink - 1 freq
genie's - 2 freq
'ging - 4 freq
gams - 1 freq
ganesh - 6 freq
gummies - 2 freq
geang - 3 freq
geeang - 1 freq
guineas - 4 freq
gems - 14 freq
gooms - 4 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gomach - 2 freq
gonnag - 1 freq
gauns - 30 freq
geems - 3 freq
games' - 2 freq
'games - 2 freq
gainsay - 2 freq
gyms - 12 freq
gnough - 1 freq
gamies - 11 freq
'gamies' - 1 freq
ganues - 1 freq
genus - 2 freq
gayness - 1 freq
gansh - 12 freq
gangway - 1 freq
'gyang - 1 freq
gunge - 3 freq
gonks - 1 freq
geng - 131 freq
gengs - 15 freq
'geng - 1 freq
'guinness - 1 freq
geung - 5 freq
geungs - 1 freq
gunnie's - 8 freq
goonies - 3 freq
gyings - 3 freq
gying - 38 freq
gjing - 2 freq
gonzo - 1 freq
ganzie - 14 freq
gaunch - 2 freq
gansie - 3 freq
gouns - 12 freq
gyaains - 1 freq
gaeng - 10 freq
ginge - 3 freq
gien's - 1 freq
geenyoch - 1 freq
geeng - 3 freq
genius's' - 1 freq
gyung - 1 freq
gong - 1 freq
gains - 8 freq
gaens - 1 freq
gemma's - 2 freq
€˜geng - 1 freq
gingie - 1 freq
gunes - 2 freq
gmse - 1 freq
gemmies - 1 freq
geeing - 1 freq
gaeins - 1 freq
goins - 2 freq
€˜gang - 1 freq
gimmick - 2 freq
guangwu - 6 freq
€œging - 5 freq
geing - 1 freq
€˜guinness - 1 freq
€œgeng - 1 freq
gaung - 5 freq
gmx - 1 freq
guans - 1 freq
gonkie - 1 freq
gmoko - 1 freq
ginos - 1 freq
gunk” - 1 freq
‘gonc’ - 1 freq
gnsg - 1 freq
gmac - 8 freq
guiness - 2 freq
gwennej - 1 freq
ghini's - 1 freq
geans - 2 freq
gmc - 1 freq
guangxi - 1 freq
ggimq - 1 freq
geens - 1 freq
gange - 3 freq
goineasy - 1 freq
gomez - 1 freq
gnc - 1 freq
gemsy - 1 freq
'going - 1 freq
MetaPhone code - JMS
gemme's - 5 freq
gemmes - 74 freq
james - 283 freq
jamie's - 10 freq
'james - 2 freq
jaimes - 4 freq
jamesy - 25 freq
gemm's - 2 freq
gemm-as - 1 freq
gemms - 14 freq
jimmy's - 8 freq
jeemo's - 2 freq
jamesie - 2 freq
gems - 14 freq
jim's - 7 freq
james' - 2 freq
geems - 3 freq
jeem's - 1 freq
gyms - 12 freq
jeames - 32 freq
jammies - 14 freq
jims' - 1 freq
'jamie's - 1 freq
jeems - 91 freq
jeemie's - 1 freq
jeemsie - 22 freq
jams - 3 freq
jambs - 1 freq
jimsie - 2 freq
gemma's - 2 freq
jamsie - 3 freq
jims - 2 freq
jumbos - 3 freq
gemmies - 1 freq
jimmies - 7 freq
€œjeemsie - 1 freq
jimmie's - 1 freq
jambos - 3 freq
“james - 1 freq
jmz - 1 freq
jimmys - 1 freq
gemsy - 1 freq
GEMS
Time to execute Levenshtein function - 0.204137 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.386081 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029113 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037997 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000929 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.