A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gases in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gases (0) - 6 freq
games (1) - 248 freq
gasps (1) - 7 freq
gcses (1) - 3 freq
gass (1) - 4 freq
gates (1) - 104 freq
gazes (1) - 2 freq
gaes (1) - 173 freq
gasts (1) - 1 freq
gapes (1) - 1 freq
bases (1) - 6 freq
gales (1) - 12 freq
eases (1) - 1 freq
ases (1) - 1 freq
cases (1) - 56 freq
vases (1) - 3 freq
gabe (2) - 1 freq
daies (2) - 2 freq
wanes (2) - 2 freq
gaurs (2) - 2 freq
wises (2) - 1 freq
hapes (2) - 1 freq
ses (2) - 6 freq
anes (2) - 222 freq
fuses (2) - 1 freq
gases (0) - 6 freq
gass (1) - 4 freq
goss (2) - 1 freq
cases (2) - 56 freq
ases (2) - 1 freq
guises (2) - 2 freq
gess (2) - 1 freq
gauss (2) - 2 freq
gassy (2) - 1 freq
gaseous (2) - 7 freq
eases (2) - 1 freq
vases (2) - 3 freq
games (2) - 248 freq
gasps (2) - 7 freq
gcses (2) - 3 freq
gales (2) - 12 freq
gazes (2) - 2 freq
gasts (2) - 1 freq
bases (2) - 6 freq
gapes (2) - 1 freq
gaes (2) - 173 freq
gates (2) - 104 freq
ganues (3) - 1 freq
gamies (3) - 11 freq
giess (3) - 5 freq
SoundEx code - G220
gossock - 1 freq
guckie's - 1 freq
gizzes - 2 freq
gauges - 1 freq
gozos - 1 freq
guises - 2 freq
gushes - 5 freq
gazes - 2 freq
geggies - 1 freq
goges - 5 freq
guces - 1 freq
geggy's - 1 freq
geegaws - 8 freq
guesses - 6 freq
gases - 6 freq
goggie's - 1 freq
gaseous - 7 freq
gee-gaws - 1 freq
geishas - 1 freq
gouges - 1 freq
€˜guesses - 1 freq
goose's - 1 freq
giggsys - 1 freq
gechoecko - 1 freq
gawjuss - 3 freq
gsagyzxgz - 1 freq
gagxexzgea - 1 freq
gigas - 1 freq
MetaPhone code - KSS
kisses - 29 freq
causeys - 4 freq
Ă©cossais - 1 freq
cases - 56 freq
cassie's - 2 freq
causies - 6 freq
caises - 2 freq
casees - 3 freq
causes - 25 freq
gozos - 1 freq
guises - 2 freq
gazes - 2 freq
cozzies - 1 freq
guces - 1 freq
ca'ses - 1 freq
keysies - 1 freq
casses - 1 freq
guesses - 6 freq
gases - 6 freq
cassies - 6 freq
caases - 3 freq
caesses - 2 freq
kissies - 1 freq
causays - 1 freq
gaseous - 7 freq
kaces - 1 freq
€˜guesses - 1 freq
goose's - 1 freq
cses - 2 freq
Écossaise - 1 freq
quizzes - 4 freq
cosies - 1 freq
czazo - 1 freq
csez - 1 freq
qzs - 1 freq
hqwzs - 1 freq
wxis - 1 freq
kzhz - 1 freq
GASES
Time to execute Levenshtein function - 0.258424 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.357797 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029598 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041796 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001086 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.