A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to guns in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
guns (0) - 71 freq
guds (1) - 1 freq
gans (1) - 51 freq
duns (1) - 2 freq
gruns (1) - 7 freq
gubs (1) - 1 freq
nuns (1) - 8 freq
guts (1) - 73 freq
gun's (1) - 1 freq
gune (1) - 1 freq
suns (1) - 15 freq
wuns (1) - 3 freq
gunes (1) - 2 freq
guns' (1) - 1 freq
gunn (1) - 2 freq
gauns (1) - 30 freq
gunns (1) - 2 freq
gunk (1) - 9 freq
guans (1) - 1 freq
yuns (1) - 1 freq
gurs (1) - 1 freq
gurns (1) - 2 freq
gouns (1) - 12 freq
cuns (1) - 1 freq
funs (1) - 7 freq
guns (0) - 71 freq
gauns (1) - 30 freq
gouns (1) - 12 freq
gunes (1) - 2 freq
guans (1) - 1 freq
gans (1) - 51 freq
runs (2) - 72 freq
gun (2) - 80 freq
puns (2) - 4 freq
guvs (2) - 1 freq
buns (2) - 22 freq
gums (2) - 16 freq
guys (2) - 464 freq
goins (2) - 2 freq
genus (2) - 2 freq
gaens (2) - 1 freq
ginos (2) - 1 freq
geens (2) - 1 freq
goons (2) - 9 freq
genes (2) - 10 freq
funs (2) - 7 freq
gains (2) - 8 freq
geans (2) - 2 freq
gus (2) - 19 freq
huns (2) - 10 freq
SoundEx code - G520
gang - 1098 freq
going - 234 freq
gangs - 147 freq
ging - 343 freq
gamie's - 2 freq
gyang - 177 freq
gums - 16 freq
gemme's - 5 freq
gemmes - 74 freq
guns - 71 freq
gans - 51 freq
goams - 2 freq
gansey - 17 freq
gings - 75 freq
genyus - 1 freq
genius - 31 freq
games - 247 freq
'gang - 8 freq
gunk - 9 freq
gemm's - 2 freq
gemm-as - 1 freq
gemms - 14 freq
gowns - 2 freq
gangwey - 6 freq
gowans - 22 freq
gyaang - 1 freq
genes - 10 freq
guinness - 9 freq
gangie - 3 freq
gumsy - 5 freq
gyangs - 8 freq
game's - 8 freq
gunns - 2 freq
gink - 1 freq
genie's - 1 freq
'ging - 4 freq
gams - 1 freq
ganesh - 6 freq
gummies - 2 freq
geang - 3 freq
geeang - 1 freq
guineas - 4 freq
gooms - 4 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gomach - 2 freq
gonnag - 1 freq
gems - 13 freq
gauns - 30 freq
geems - 3 freq
games' - 2 freq
'games - 2 freq
gainsay - 2 freq
gyms - 12 freq
gnough - 1 freq
gamies - 11 freq
'gamies' - 1 freq
ganues - 1 freq
genus - 2 freq
gayness - 1 freq
gansh - 12 freq
gangway - 1 freq
'gyang - 1 freq
gunge - 3 freq
gonks - 1 freq
geng - 131 freq
gengs - 15 freq
'geng - 1 freq
'guinness - 1 freq
geung - 5 freq
geungs - 1 freq
gunnie's - 8 freq
goonies - 3 freq
gyings - 3 freq
gying - 38 freq
gjing - 2 freq
gonzo - 1 freq
ganzie - 14 freq
gaunch - 2 freq
gansie - 3 freq
gouns - 12 freq
gyaains - 1 freq
gaeng - 10 freq
ginge - 3 freq
gien's - 1 freq
geenyoch - 1 freq
geeng - 3 freq
genius's' - 1 freq
gyung - 1 freq
gong - 1 freq
gains - 8 freq
gaens - 1 freq
gemma's - 2 freq
€˜geng - 1 freq
gingie - 1 freq
gunes - 2 freq
gmse - 1 freq
gemmies - 1 freq
geeing - 1 freq
gaeins - 1 freq
goins - 2 freq
€˜gang - 1 freq
gimmick - 2 freq
guangwu - 6 freq
€œging - 5 freq
geing - 1 freq
€˜guinness - 1 freq
€œgeng - 1 freq
gaung - 5 freq
gmx - 1 freq
guans - 1 freq
gonkie - 1 freq
gmoko - 1 freq
ginos - 1 freq
gunk” - 1 freq
‘gonc’ - 1 freq
gnsg - 1 freq
gmac - 8 freq
guiness - 2 freq
gwennej - 1 freq
ghini's - 1 freq
geans - 2 freq
gmc - 1 freq
guangxi - 1 freq
ggimq - 1 freq
geens - 1 freq
gange - 3 freq
goineasy - 1 freq
gomez - 1 freq
gnc - 1 freq
gemsy - 1 freq
'going - 1 freq
MetaPhone code - KNS
kens - 531 freq
coins - 50 freq
queen's - 52 freq
queens - 37 freq
guns - 71 freq
quine's - 18 freq
quines - 155 freq
gans - 51 freq
gansey - 17 freq
quines' - 2 freq
queyn's - 2 freq
kenzie - 49 freq
'kenzie - 2 freq
cans - 45 freq
gowns - 2 freq
guinness - 9 freq
conn's - 4 freq
kynes - 11 freq
ken's - 6 freq
gunns - 2 freq
caw-ins - 1 freq
cons - 4 freq
cones - 2 freq
kins - 15 freq
connie's - 2 freq
quinie's - 12 freq
queans - 5 freq
guineas - 4 freq
queanie's - 1 freq
kines - 21 freq
quinies - 7 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gauns - 30 freq
can's - 1 freq
gainsay - 2 freq
ganues - 1 freq
gayness - 1 freq
canes - 2 freq
cannes - 1 freq
'guinness - 1 freq
cannas - 2 freq
gunnie's - 8 freq
goonies - 3 freq
gonzo - 1 freq
ganzie - 14 freq
kenny's - 1 freq
cuns - 1 freq
gansie - 3 freq
gouns - 12 freq
quince - 2 freq
kyns - 2 freq
kynness - 1 freq
cain's - 1 freq
gains - 8 freq
con's - 3 freq
gaens - 1 freq
kaens - 2 freq
queenis - 1 freq
conns - 1 freq
gunes - 2 freq
canns - 1 freq
coneys - 1 freq
kains - 1 freq
gaeins - 1 freq
canus - 1 freq
goins - 2 freq
keenness - 1 freq
quynes - 1 freq
€˜guinness - 1 freq
quineÂ’s - 1 freq
guans - 1 freq
quins - 1 freq
consi - 8 freq
cains - 1 freq
guiness - 2 freq
kanes - 2 freq
queenies - 1 freq
kenÂ’s - 1 freq
goineasy - 1 freq
GUNS
Time to execute Levenshtein function - 0.179213 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.352889 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029156 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037325 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000889 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.