A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to guns in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
guns (0) - 73 freq
gune (1) - 1 freq
gus (1) - 21 freq
guds (1) - 1 freq
gauns (1) - 30 freq
gunn (1) - 2 freq
suns (1) - 15 freq
guns' (1) - 1 freq
gunes (1) - 2 freq
yuns (1) - 1 freq
puns (1) - 4 freq
gurns (1) - 2 freq
gun (1) - 81 freq
gubs (1) - 1 freq
guans (1) - 1 freq
nuns (1) - 8 freq
gunns (1) - 2 freq
huns (1) - 10 freq
gun's (1) - 1 freq
guys (1) - 472 freq
guvs (1) - 1 freq
gurs (1) - 1 freq
runs (1) - 75 freq
wuns (1) - 3 freq
gans (1) - 51 freq
guns (0) - 73 freq
guans (1) - 1 freq
gans (1) - 51 freq
gunes (1) - 2 freq
gouns (1) - 12 freq
gauns (1) - 30 freq
cuns (2) - 1 freq
duns (2) - 2 freq
ginos (2) - 1 freq
gums (2) - 16 freq
gunn (2) - 2 freq
guts (2) - 73 freq
funs (2) - 7 freq
geans (2) - 2 freq
gruns (2) - 7 freq
gaens (2) - 1 freq
goons (2) - 9 freq
gains (2) - 8 freq
gune (2) - 1 freq
genes (2) - 11 freq
geens (2) - 1 freq
genus (2) - 2 freq
goins (2) - 2 freq
gunk (2) - 9 freq
buns (2) - 22 freq
SoundEx code - G520
gang - 1111 freq
going - 248 freq
gangs - 150 freq
ging - 347 freq
gamie's - 2 freq
gyang - 177 freq
gums - 16 freq
gemme's - 5 freq
gemmes - 74 freq
guns - 73 freq
gans - 51 freq
goams - 2 freq
gansey - 17 freq
gings - 75 freq
genyus - 1 freq
genius - 31 freq
games - 248 freq
'gang - 8 freq
gunk - 9 freq
gemm's - 2 freq
gemm-as - 1 freq
gemms - 14 freq
gowns - 2 freq
gangwey - 6 freq
gowans - 22 freq
gyaang - 1 freq
genes - 11 freq
guinness - 16 freq
gangie - 3 freq
gumsy - 5 freq
gyangs - 8 freq
game's - 8 freq
gunns - 2 freq
gink - 1 freq
genie's - 2 freq
'ging - 4 freq
gams - 1 freq
ganesh - 6 freq
gummies - 2 freq
geang - 3 freq
geeang - 1 freq
guineas - 4 freq
gems - 14 freq
gooms - 4 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gomach - 2 freq
gonnag - 1 freq
gauns - 30 freq
geems - 3 freq
games' - 2 freq
'games - 2 freq
gainsay - 2 freq
gyms - 12 freq
gnough - 1 freq
gamies - 11 freq
'gamies' - 1 freq
ganues - 1 freq
genus - 2 freq
gayness - 1 freq
gansh - 12 freq
gangway - 1 freq
'gyang - 1 freq
gunge - 3 freq
gonks - 1 freq
geng - 131 freq
gengs - 15 freq
'geng - 1 freq
'guinness - 1 freq
geung - 5 freq
geungs - 1 freq
gunnie's - 8 freq
goonies - 3 freq
gyings - 3 freq
gying - 38 freq
gjing - 2 freq
gonzo - 1 freq
ganzie - 14 freq
gaunch - 2 freq
gansie - 3 freq
gouns - 12 freq
gyaains - 1 freq
gaeng - 10 freq
ginge - 3 freq
gien's - 1 freq
geenyoch - 1 freq
geeng - 3 freq
genius's' - 1 freq
gyung - 1 freq
gong - 1 freq
gains - 8 freq
gaens - 1 freq
gemma's - 2 freq
€˜geng - 1 freq
gingie - 1 freq
gunes - 2 freq
gmse - 1 freq
gemmies - 1 freq
geeing - 1 freq
gaeins - 1 freq
goins - 2 freq
€˜gang - 1 freq
gimmick - 2 freq
guangwu - 6 freq
€œging - 5 freq
geing - 1 freq
€˜guinness - 1 freq
€œgeng - 1 freq
gaung - 5 freq
gmx - 1 freq
guans - 1 freq
gonkie - 1 freq
gmoko - 1 freq
ginos - 1 freq
gunk” - 1 freq
‘gonc’ - 1 freq
gnsg - 1 freq
gmac - 8 freq
guiness - 2 freq
gwennej - 1 freq
ghini's - 1 freq
geans - 2 freq
gmc - 1 freq
guangxi - 1 freq
ggimq - 1 freq
geens - 1 freq
gange - 3 freq
goineasy - 1 freq
gomez - 1 freq
gnc - 1 freq
gemsy - 1 freq
'going - 1 freq
MetaPhone code - KNS
kens - 532 freq
coins - 52 freq
queen's - 55 freq
queens - 40 freq
guns - 73 freq
quine's - 18 freq
quines - 156 freq
gans - 51 freq
gansey - 17 freq
quines' - 2 freq
queyn's - 2 freq
kenzie - 49 freq
'kenzie - 2 freq
cans - 55 freq
gowns - 2 freq
guinness - 16 freq
conn's - 4 freq
kynes - 11 freq
ken's - 6 freq
gunns - 2 freq
caw-ins - 1 freq
cons - 4 freq
cones - 3 freq
kins - 15 freq
connie's - 2 freq
quinie's - 12 freq
queans - 5 freq
guineas - 4 freq
queanie's - 1 freq
kines - 21 freq
quinies - 7 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gauns - 30 freq
can's - 1 freq
gainsay - 2 freq
ganues - 1 freq
gayness - 1 freq
canes - 2 freq
cannes - 1 freq
'guinness - 1 freq
cannas - 2 freq
gunnie's - 8 freq
goonies - 3 freq
gonzo - 1 freq
ganzie - 14 freq
kenny's - 1 freq
cuns - 1 freq
gansie - 3 freq
gouns - 12 freq
quince - 2 freq
kyns - 2 freq
kynness - 1 freq
cain's - 1 freq
gains - 8 freq
con's - 3 freq
gaens - 1 freq
kaens - 2 freq
queenis - 1 freq
conns - 1 freq
gunes - 2 freq
canns - 1 freq
coneys - 1 freq
kains - 1 freq
gaeins - 1 freq
canus - 1 freq
goins - 2 freq
keenness - 1 freq
quynes - 1 freq
€˜guinness - 1 freq
quineÂ’s - 1 freq
guans - 1 freq
quins - 1 freq
consi - 8 freq
cains - 1 freq
guiness - 2 freq
kanes - 2 freq
queenies - 1 freq
kenÂ’s - 1 freq
goineasy - 1 freq
GUNS
Time to execute Levenshtein function - 0.300524 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.527135 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.063721 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.079859 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001091 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.