A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gleg in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gleg (0) - 127 freq
geg (1) - 15 freq
cleg (1) - 12 freq
glog (1) - 3 freq
gaeg (1) - 1 freq
glug (1) - 6 freq
greg (1) - 5 freq
pleg (1) - 2 freq
gle (1) - 4 freq
glen (1) - 162 freq
gled (1) - 317 freq
gleb (1) - 1 freq
glig (1) - 1 freq
glee (1) - 30 freq
leg (1) - 199 freq
'gleg (1) - 1 freq
fleg (1) - 84 freq
glegs (1) - 1 freq
glegg (1) - 1 freq
iles (2) - 3 freq
ile' (2) - 1 freq
gyed (2) - 6 freq
lle (2) - 1 freq
gg (2) - 4 freq
lzg (2) - 1 freq
gleg (0) - 127 freq
glig (1) - 1 freq
glog (1) - 3 freq
glug (1) - 6 freq
'gleg (2) - 1 freq
leg (2) - 199 freq
glee (2) - 30 freq
fleg (2) - 84 freq
glegs (2) - 1 freq
gaelg (2) - 1 freq
glegg (2) - 1 freq
gulag (2) - 2 freq
gloag (2) - 8 freq
gleb (2) - 1 freq
geg (2) - 15 freq
cleg (2) - 12 freq
greg (2) - 5 freq
gaeg (2) - 1 freq
gle (2) - 4 freq
pleg (2) - 2 freq
gled (2) - 317 freq
glen (2) - 162 freq
laeg (3) - 1 freq
glegly (3) - 18 freq
glit (3) - 1 freq
SoundEx code - G420
gulls - 20 freq
glisk - 70 freq
gless - 392 freq
glesca - 168 freq
gleg - 127 freq
glesgie - 6 freq
'glesca - 1 freq
gylick - 1 freq
gaelic - 638 freq
glesea - 2 freq
glass - 80 freq
gallus - 97 freq
glazie - 5 freq
gills - 8 freq
glog - 3 freq
geylies - 3 freq
glesga - 90 freq
gloss - 7 freq
glows - 6 freq
glasgow - 146 freq
glessie - 3 freq
glaizie - 5 freq
goulash - 2 freq
gulliegaw - 1 freq
ghouls - 5 freq
gallows - 17 freq
goals - 51 freq
glaesga - 1 freq
ghoulies - 2 freq
gallowa's - 1 freq
glaik - 3 freq
gullocks - 1 freq
gales - 12 freq
gallic - 4 freq
giles - 9 freq
ghillies - 1 freq
gleek - 5 freq
glessy - 1 freq
glossy - 12 freq
galaxy - 20 freq
glisks - 10 freq
glassy - 2 freq
glaisscs - 1 freq
glaiss - 62 freq
glaisgae - 6 freq
gill's - 2 freq
glug - 6 freq
gilly-go - 1 freq
glesgae - 17 freq
gleskie - 3 freq
goalie's - 2 freq
gollach - 4 freq
golach - 3 freq
'gleg - 1 freq
gull's - 1 freq
gloag's - 1 freq
gulag - 2 freq
gels - 1 freq
glescae - 7 freq
giless - 4 freq
glesgay - 1 freq
geyleis - 1 freq
gillies - 7 freq
glack - 1 freq
gillis - 1 freq
gellick - 47 freq
gellicks - 5 freq
gauls - 2 freq
glasgw - 1 freq
galleys - 3 freq
glasgae - 17 freq
gaalic - 1 freq
gaeilge - 2 freq
gaels - 17 freq
gallik - 3 freq
glas- - 1 freq
glaze - 3 freq
gleck - 1 freq
ghoulish - 2 freq
gulsh - 2 freq
gallowsha - 2 freq
glasow - 1 freq
gaelige - 2 freq
gullies - 3 freq
golac - 1 freq
gloze - 1 freq
gaelic's - 1 freq
geology - 5 freq
'glasgow' - 1 freq
gowls - 2 freq
guiless - 1 freq
galas - 1 freq
glase - 1 freq
gulsa - 1 freq
glackie - 2 freq
goloch - 2 freq
'glackie' - 2 freq
ghlas - 1 freq
gallous - 1 freq
gloag - 8 freq
'gloag - 1 freq
gale's - 1 freq
gleesh - 1 freq
glesgy - 7 freq
'glesgy - 3 freq
glig - 1 freq
gliss - 1 freq
gals - 2 freq
glaissy - 1 freq
€”gaelic - 1 freq
€œgaelic - 2 freq
geluk - 1 freq
€˜glesca - 2 freq
glaess - 1 freq
glashie - 1 freq
glasgou - 1 freq
gallas - 1 freq
glassa - 1 freq
€œglasgow - 2 freq
glasgaa - 1 freq
glegs - 1 freq
gleg-ee - 1 freq
glais - 1 freq
gaelg - 1 freq
gah-lik - 1 freq
gay-lik - 1 freq
glaisÂ’s - 1 freq
glaschu - 2 freq
golaÂ’s - 1 freq
golloch - 1 freq
glesgo - 1 freq
glassgow - 1 freq
gollywogs - 1 freq
gellick's - 10 freq
glassÂ’ - 2 freq
gilhaus - 1 freq
gillhaus - 1 freq
gihlhaus - 1 freq
glasgow' - 1 freq
glegg - 1 freq
glasgie - 1 freq
guillys - 3 freq
glossay - 1 freq
galloways - 1 freq
gulloch - 4 freq
gullock - 3 freq
gaelsÂ’ - 1 freq
MetaPhone code - KLK
cleg - 12 freq
clock - 194 freq
cleek - 31 freq
claggy - 12 freq
gleg - 127 freq
gaelic - 638 freq
collogue - 62 freq
cloak - 30 freq
cleck - 10 freq
glog - 3 freq
click - 129 freq
claik - 24 freq
cleuk - 3 freq
clack - 9 freq
cloack - 20 freq
claggie - 4 freq
gulliegaw - 1 freq
clak - 2 freq
cleik - 9 freq
cluke - 1 freq
colleague - 15 freq
glaik - 3 freq
clag - 4 freq
gallic - 4 freq
colic - 4 freq
gleek - 5 freq
claick - 1 freq
glug - 6 freq
'gleg - 1 freq
gulag - 2 freq
colleck - 11 freq
coulg - 1 freq
glack - 1 freq
colleg - 2 freq
clicky - 41 freq
cloke - 3 freq
cluck - 2 freq
clique - 2 freq
gaalic - 1 freq
gallik - 3 freq
gleck - 1 freq
claggey - 1 freq
clog - 2 freq
golac - 1 freq
claag - 1 freq
glackie - 2 freq
'glackie' - 2 freq
gloag - 8 freq
'gloag - 1 freq
calico - 1 freq
glig - 1 freq
clug - 1 freq
€”gaelic - 1 freq
€œgaelic - 2 freq
clek - 1 freq
clok - 1 freq
€™clock - 12 freq
cliquey - 1 freq
clook - 3 freq
clegg - 2 freq
€™cloak - 3 freq
€™cloack - 5 freq
clooky - 1 freq
'click' - 1 freq
gleg-ee - 1 freq
klegg - 1 freq
gaelg - 1 freq
gah-lik - 1 freq
gay-lik - 1 freq
clc - 1 freq
qlc - 1 freq
kolkw - 1 freq
cliq - 1 freq
glegg - 1 freq
clokkie - 1 freq
yqlhc - 1 freq
gullock - 3 freq
GLEG
Time to execute Levenshtein function - 0.173745 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.316902 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029811 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036139 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000826 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.