A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to going in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
going (0) - 234 freq
goins (1) - 2 freq
gjing (1) - 2 freq
doing (1) - 82 freq
goin (1) - 248 freq
geing (1) - 1 freq
coing (1) - 1 freq
'going (1) - 1 freq
gying (1) - 38 freq
gong (1) - 1 freq
goin' (1) - 5 freq
ging (1) - 343 freq
mong (2) - 1 freq
do-ing (2) - 1 freq
toting (2) - 1 freq
geng (2) - 131 freq
tong (2) - 6 freq
'ging (2) - 4 freq
swing (2) - 68 freq
boxing (2) - 4 freq
gaint (2) - 1 freq
'hing (2) - 8 freq
zoink (2) - 16 freq
gonnag (2) - 1 freq
ong (2) - 2 freq
going (0) - 234 freq
geing (1) - 1 freq
gong (1) - 1 freq
gying (1) - 38 freq
ging (1) - 343 freq
geang (2) - 3 freq
gaeng (2) - 10 freq
ageing (2) - 1 freq
gyang (2) - 177 freq
ginge (2) - 3 freq
gaung (2) - 5 freq
geeng (2) - 3 freq
geng (2) - 131 freq
gang (2) - 1098 freq
geung (2) - 5 freq
goin (2) - 248 freq
doing (2) - 82 freq
gjing (2) - 2 freq
goins (2) - 2 freq
coing (2) - 1 freq
'going (2) - 1 freq
geeing (2) - 1 freq
gyung (2) - 1 freq
goin' (2) - 5 freq
icing (3) - 8 freq
SoundEx code - G520
gang - 1098 freq
going - 234 freq
gangs - 147 freq
ging - 343 freq
gamie's - 2 freq
gyang - 177 freq
gums - 16 freq
gemme's - 5 freq
gemmes - 74 freq
guns - 71 freq
gans - 51 freq
goams - 2 freq
gansey - 17 freq
gings - 75 freq
genyus - 1 freq
genius - 31 freq
games - 247 freq
'gang - 8 freq
gunk - 9 freq
gemm's - 2 freq
gemm-as - 1 freq
gemms - 14 freq
gowns - 2 freq
gangwey - 6 freq
gowans - 22 freq
gyaang - 1 freq
genes - 10 freq
guinness - 9 freq
gangie - 3 freq
gumsy - 5 freq
gyangs - 8 freq
game's - 8 freq
gunns - 2 freq
gink - 1 freq
genie's - 1 freq
'ging - 4 freq
gams - 1 freq
ganesh - 6 freq
gummies - 2 freq
geang - 3 freq
geeang - 1 freq
guineas - 4 freq
gooms - 4 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gomach - 2 freq
gonnag - 1 freq
gems - 13 freq
gauns - 30 freq
geems - 3 freq
games' - 2 freq
'games - 2 freq
gainsay - 2 freq
gyms - 12 freq
gnough - 1 freq
gamies - 11 freq
'gamies' - 1 freq
ganues - 1 freq
genus - 2 freq
gayness - 1 freq
gansh - 12 freq
gangway - 1 freq
'gyang - 1 freq
gunge - 3 freq
gonks - 1 freq
geng - 131 freq
gengs - 15 freq
'geng - 1 freq
'guinness - 1 freq
geung - 5 freq
geungs - 1 freq
gunnie's - 8 freq
goonies - 3 freq
gyings - 3 freq
gying - 38 freq
gjing - 2 freq
gonzo - 1 freq
ganzie - 14 freq
gaunch - 2 freq
gansie - 3 freq
gouns - 12 freq
gyaains - 1 freq
gaeng - 10 freq
ginge - 3 freq
gien's - 1 freq
geenyoch - 1 freq
geeng - 3 freq
genius's' - 1 freq
gyung - 1 freq
gong - 1 freq
gains - 8 freq
gaens - 1 freq
gemma's - 2 freq
€˜geng - 1 freq
gingie - 1 freq
gunes - 2 freq
gmse - 1 freq
gemmies - 1 freq
geeing - 1 freq
gaeins - 1 freq
goins - 2 freq
€˜gang - 1 freq
gimmick - 2 freq
guangwu - 6 freq
€œging - 5 freq
geing - 1 freq
€˜guinness - 1 freq
€œgeng - 1 freq
gaung - 5 freq
gmx - 1 freq
guans - 1 freq
gonkie - 1 freq
gmoko - 1 freq
ginos - 1 freq
gunk” - 1 freq
‘gonc’ - 1 freq
gnsg - 1 freq
gmac - 8 freq
guiness - 2 freq
gwennej - 1 freq
ghini's - 1 freq
geans - 2 freq
gmc - 1 freq
guangxi - 1 freq
ggimq - 1 freq
geens - 1 freq
gange - 3 freq
goineasy - 1 freq
gomez - 1 freq
gnc - 1 freq
gemsy - 1 freq
'going - 1 freq
MetaPhone code - KNK
gang - 1098 freq
going - 234 freq
cong' - 1 freq
king - 824 freq
keing - 140 freq
'gang - 8 freq
gunk - 9 freq
kink - 8 freq
caaing - 1 freq
keeng - 77 freq
'king - 1 freq
gonnag - 1 freq
käng - 2 freq
kïng - 26 freq
coing - 1 freq
konk - 1 freq
'kinky - 1 freq
keeng' - 1 freq
kanga - 36 freq
conk - 2 freq
quango - 1 freq
gaeng - 10 freq
'keing - 1 freq
kinnik - 1 freq
conneck - 3 freq
kong - 7 freq
kyng - 3 freq
gong - 1 freq
kinky - 1 freq
congo - 1 freq
conc - 1 freq
€˜king - 5 freq
kiang - 1 freq
kang - 1 freq
€˜gang - 1 freq
gaung - 5 freq
kuniwyg - 1 freq
gonkie - 1 freq
gunk” - 1 freq
‘gonc’ - 1 freq
cnag - 1 freq
qnq - 1 freq
cnoc - 1 freq
quinikie - 1 freq
'going - 1 freq
GOING
gae - 501 freq
go - 1915 freq
gone - 282 freq
gaed - 1526 freq
göd - 254 freq
going - 234 freq
gang - 1098 freq
goin - 248 freq
gaun - 1849 freq
went - 1912 freq
gan - 768 freq
wint - 628 freq
gonna - 129 freq
gonnae - 588 freq
Time to execute Levenshtein function - 0.248092 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.479021 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027408 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.075639 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001270 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.