A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to glossary in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
glossary (0) - 15 freq
glossay (1) - 1 freq
glossary' (1) - 1 freq
grossars (2) - 1 freq
glossy (2) - 14 freq
rosary (3) - 6 freq
glosst (3) - 1 freq
glassa (3) - 1 freq
clossan (3) - 2 freq
clossach (3) - 1 freq
glossin (3) - 4 freq
glosses (3) - 2 freq
glassy (3) - 2 freq
gloss (3) - 7 freq
gossipy (3) - 1 freq
clossly (3) - 2 freq
glossed (3) - 2 freq
glessfy (3) - 1 freq
elosser (3) - 1 freq
closser (3) - 13 freq
flossy (3) - 1 freq
glessy (3) - 1 freq
glesgay (3) - 1 freq
glistery (3) - 1 freq
grossly (3) - 1 freq
glossary (0) - 15 freq
glossary' (2) - 1 freq
glossay (2) - 1 freq
glossy (3) - 14 freq
closser (4) - 13 freq
glessfy (4) - 1 freq
elosser (4) - 1 freq
glessy (4) - 1 freq
glossaries (4) - 4 freq
glassily (4) - 1 freq
gloss (4) - 7 freq
glistery (4) - 1 freq
glossed (4) - 2 freq
glassy (4) - 2 freq
glosst (4) - 1 freq
grossars (4) - 1 freq
glossin (4) - 4 freq
glassa (4) - 1 freq
glosses (4) - 2 freq
glessfi (5) - 1 freq
glister (5) - 8 freq
glass (5) - 80 freq
glessie (5) - 3 freq
glaissy (5) - 1 freq
glasses (5) - 24 freq
SoundEx code - G426
glegger - 2 freq
gallacher - 14 freq
gless-grien - 1 freq
glaciers - 2 freq
glossary - 15 freq
glaigerin - 1 freq
glossaries - 4 freq
gilchrist - 3 freq
glacier - 2 freq
gallagher - 4 freq
glossary' - 1 freq
gillgreeny - 1 freq
glasgowworlds - 2 freq
gilesgraeme - 5 freq
gaelicroadsign - 1 freq
ggljrvkv - 1 freq
glazier - 1 freq
MetaPhone code - KLSR
closer - 138 freq
cloaser - 5 freq
closure - 3 freq
closser - 13 freq
clausura - 1 freq
glossary - 15 freq
glacier - 2 freq
glossary' - 1 freq
calzer - 1 freq
glazier - 1 freq
GLOSSARY
Time to execute Levenshtein function - 0.192185 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.365171 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028027 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037944 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000930 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.