A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gust in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gust (0) - 18 freq
gut (1) - 27 freq
bust (1) - 10 freq
gast (1) - 5 freq
guest (1) - 42 freq
just (1) - 1618 freq
gist (1) - 12 freq
gusd (1) - 1 freq
agust (1) - 1 freq
gurt (1) - 1 freq
gush (1) - 2 freq
dust (1) - 89 freq
must (1) - 687 freq
rust (1) - 13 freq
gusto (1) - 6 freq
lust (1) - 20 freq
gus (1) - 21 freq
gusts (1) - 5 freq
guse (1) - 1 freq
gest (1) - 1 freq
gunkt (2) - 2 freq
pusst (2) - 1 freq
pusht (2) - 11 freq
cuist (2) - 14 freq
geyst (2) - 1 freq
gust (0) - 18 freq
agust (1) - 1 freq
gusto (1) - 6 freq
gist (1) - 12 freq
gest (1) - 1 freq
guest (1) - 42 freq
gast (1) - 5 freq
goest (2) - 1 freq
geyst (2) - 1 freq
just (2) - 1618 freq
goast (2) - 1 freq
august (2) - 66 freq
gustie (2) - 3 freq
guisit (2) - 1 freq
aagust (2) - 1 freq
giest (2) - 1 freq
gaist (2) - 2 freq
geist (2) - 6 freq
gut (2) - 27 freq
guse (2) - 1 freq
dust (2) - 89 freq
gush (2) - 2 freq
gurt (2) - 1 freq
gusd (2) - 1 freq
must (2) - 687 freq
SoundEx code - G230
ghaist - 60 freq
guessed - 31 freq
gast - 5 freq
gust - 18 freq
gazed - 15 freq
gowstie - 7 freq
gawkit - 9 freq
ghost - 73 freq
gawkt - 1 freq
guest - 42 freq
ghostey - 1 freq
gazette - 3 freq
gowkit - 6 freq
giekit - 1 freq
gist - 12 freq
goustie - 4 freq
'goustie - 1 freq
guisit - 1 freq
gassed - 3 freq
goast - 1 freq
ghostie - 1 freq
gicd - 1 freq
gagged - 4 freq
ghousty - 1 freq
gowsty - 5 freq
gawked - 6 freq
gouched - 2 freq
guesst - 5 freq
geckit - 3 freq
ghost' - 1 freq
giekt - 1 freq
gacd - 1 freq
gusto - 6 freq
gushet - 2 freq
gowked - 1 freq
gasket - 2 freq
gaecd - 1 freq
ghawst - 2 freq
gissed - 1 freq
gaskit - 1 freq
geist - 6 freq
gaist - 2 freq
giest - 1 freq
gussied - 1 freq
gustie - 3 freq
geck't - 1 freq
gayest - 1 freq
gushed - 1 freq
gusset - 1 freq
gest - 1 freq
geyst - 1 freq
gaggit - 1 freq
gecked - 3 freq
ghaistie - 1 freq
gusd - 1 freq
€˜ghost - 1 freq
goest - 1 freq
gigot - 1 freq
€œgawkit - 1 freq
gight - 3 freq
gzyesd - 1 freq
ghosty - 1 freq
gzseyxxta - 1 freq
gessed - 1 freq
gyct - 1 freq
geeked - 1 freq
MetaPhone code - KST
kissed - 66 freq
kist - 117 freq
cast - 227 freq
'kist - 1 freq
kistie - 14 freq
coast - 119 freq
caused - 78 freq
guessed - 31 freq
kisst - 6 freq
gast - 5 freq
cost - 114 freq
gust - 18 freq
cuist - 14 freq
gazed - 15 freq
gowstie - 7 freq
guest - 42 freq
gazette - 3 freq
kest - 18 freq
cassidy - 4 freq
keest - 3 freq
cassette - 3 freq
costa - 7 freq
quest - 18 freq
goustie - 4 freq
'goustie - 1 freq
caist - 4 freq
guisit - 1 freq
causit - 3 freq
gceid - 1 freq
gassed - 3 freq
gcid - 4 freq
goast - 1 freq
casset - 2 freq
gowsty - 5 freq
'c'est - 1 freq
caste - 2 freq
cassat - 1 freq
'cost - 1 freq
keistie - 1 freq
caased - 6 freq
guesst - 5 freq
csd - 11 freq
quayside - 3 freq
kïsst - 2 freq
kïst - 1 freq
cosset - 1 freq
gusto - 6 freq
cöst - 3 freq
c'est - 1 freq
kist' - 5 freq
coist - 4 freq
cüst - 2 freq
gaist - 2 freq
kist'' - 1 freq
kost - 1 freq
costo - 1 freq
coost - 1 freq
cosst - 1 freq
gussied - 1 freq
gustie - 3 freq
coasta - 1 freq
€˜kist - 1 freq
€œkist - 2 freq
gusset - 1 freq
cousteau - 1 freq
€˜kssst - 1 freq
gusd - 1 freq
goest - 1 freq
coste - 1 freq
kis't - 1 freq
quizzed - 1 freq
costie - 1 freq
qzd - 1 freq
qzitw - 1 freq
'cast' - 1 freq
qzt - 1 freq
gzet - 1 freq
kstew - 1 freq
yxd - 1 freq
GUST
Time to execute Levenshtein function - 0.687015 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.198525 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.099707 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.108387 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001214 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.