A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gap in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gap (0) - 49 freq
yap (1) - 17 freq
gan (1) - 768 freq
sap (1) - 10 freq
gag (1) - 4 freq
kap (1) - 1 freq
gae (1) - 503 freq
gad (1) - 2 freq
pap (1) - 21 freq
gup (1) - 2 freq
gnap (1) - 1 freq
zap (1) - 1 freq
gal (1) - 4 freq
gcp (1) - 1 freq
lap (1) - 71 freq
gar (1) - 162 freq
ap (1) - 94 freq
bap (1) - 16 freq
gaps (1) - 11 freq
gp (1) - 9 freq
cap (1) - 47 freq
hap (1) - 55 freq
gav (1) - 2 freq
gat (1) - 399 freq
dap (1) - 1 freq
gap (0) - 49 freq
gaap (1) - 1 freq
gaup (1) - 4 freq
gaip (1) - 1 freq
gp (1) - 9 freq
gup (1) - 2 freq
gyp (1) - 3 freq
gip (1) - 3 freq
gape (1) - 4 freq
ga (2) - 29 freq
gdp (2) - 3 freq
gaw (2) - 5 freq
gam (2) - 2 freq
map (2) - 91 freq
aap (2) - 26 freq
gasp (2) - 39 freq
jap (2) - 2 freq
gau (2) - 2 freq
gai (2) - 1 freq
gipe (2) - 1 freq
gype (2) - 47 freq
geyp (2) - 1 freq
iagp (2) - 1 freq
agp (2) - 1 freq
egip (2) - 1 freq
SoundEx code - G100
gave - 241 freq
gab - 32 freq
goave - 6 freq
gf - 5 freq
gove - 14 freq
gap - 49 freq
guff - 57 freq
gob - 15 freq
give - 166 freq
goaf - 1 freq
gawp - 9 freq
'give - 3 freq
'govey - 1 freq
gype - 47 freq
gif - 207 freq
gowf - 26 freq
gaff - 34 freq
gub - 30 freq
gaup - 4 freq
gowff - 3 freq
gie-up - 1 freq
gub' - 2 freq
gyp - 3 freq
gabbie - 1 freq
gpo - 4 freq
gip - 3 freq
gav - 2 freq
gaive - 1 freq
gibby - 6 freq
geyp - 1 freq
gibbie - 4 freq
goab - 1 freq
gowp - 65 freq
gbh - 3 freq
gp - 9 freq
gov - 39 freq
geev - 1 freq
geff - 1 freq
'gif - 3 freq
gaffe - 1 freq
govie - 3 freq
'goavie - 1 freq
giuff - 1 freq
giveaway - 3 freq
giff - 1 freq
guffy - 2 freq
gva - 1 freq
geb - 1 freq
gaip - 1 freq
ghff - 1 freq
gyf - 1 freq
geve - 1 freq
guffaw - 1 freq
gup - 2 freq
gaap - 1 freq
gape - 4 freq
gibb - 12 freq
go-by - 2 freq
€˜gave - 1 freq
gaif - 1 freq
gaub - 1 freq
gabe - 1 freq
€˜give - 1 freq
guffie - 3 freq
gipe - 1 freq
giep - 1 freq
gv - 11 freq
gfb - 1 freq
gsv - 2 freq
geoff - 2 freq
gooby - 2 freq
gsf - 1 freq
gb - 2 freq
gcf - 1 freq
gbbo - 1 freq
ghb - 1 freq
gvu - 1 freq
gcp - 1 freq
gabby - 1 freq
MetaPhone code - KP
cup - 316 freq
keep - 1547 freq
cowp - 64 freq
kep - 136 freq
kip - 38 freq
gap - 49 freq
cap - 47 freq
cope - 36 freq
gawp - 9 freq
coup - 25 freq
copy - 100 freq
'keep - 23 freq
gaup - 4 freq
cuppa - 21 freq
cuppie - 27 freq
co-op - 15 freq
caip - 3 freq
quip - 2 freq
cop - 4 freq
keip - 4 freq
coapy - 2 freq
gpo - 4 freq
coap - 2 freq
cape - 14 freq
capo - 2 freq
gowp - 65 freq
caup - 11 freq
'cowp - 3 freq
gp - 9 freq
keepy - 1 freq
kap - 1 freq
'kappa' - 1 freq
kowp - 1 freq
kop - 2 freq
kepe - 4 freq
copie - 17 freq
copp - 1 freq
cappie - 3 freq
coapie - 4 freq
kep' - 2 freq
kaip - 2 freq
caap - 6 freq
gaip - 1 freq
gup - 2 freq
€™kiep - 1 freq
gaap - 1 freq
kypie - 1 freq
gape - 4 freq
coopie - 2 freq
€œkeep - 8 freq
coop - 1 freq
keppy - 1 freq
€˜keep - 2 freq
kappa - 1 freq
€˜copy - 1 freq
cuppy - 3 freq
€™keep - 1 freq
co-opy - 2 freq
co-oopie - 1 freq
cappy - 3 freq
kaypee - 2 freq
kp - 6 freq
qp - 7 freq
qpo - 1 freq
ykypy - 1 freq
hqpw - 1 freq
ckp - 1 freq
cp - 4 freq
kapo - 1 freq
GAP
Time to execute Levenshtein function - 0.174983 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.334373 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027745 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037285 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000858 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.