A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gaupin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gaupin (0) - 8 freq
gampin (1) - 1 freq
laupin (1) - 2 freq
gaspin (1) - 13 freq
gaapin (1) - 1 freq
gawpin (1) - 33 freq
jaupin (1) - 1 freq
gapin (1) - 10 freq
gauin (1) - 46 freq
goupin (1) - 1 freq
gappin (1) - 2 freq
lampin (2) - 6 freq
cauin (2) - 1 freq
cauvin (2) - 1 freq
causin (2) - 14 freq
taupit (2) - 1 freq
gaudie (2) - 1 freq
bampin (2) - 1 freq
glaepin (2) - 1 freq
gacin (2) - 1 freq
jaupit (2) - 1 freq
laepin (2) - 1 freq
gilpin (2) - 1 freq
raspin (2) - 5 freq
gaunie (2) - 6 freq
gaupin (0) - 8 freq
gapin (1) - 10 freq
gaapin (1) - 1 freq
goupin (1) - 1 freq
gaspin (2) - 13 freq
gopin (2) - 1 freq
gampin (2) - 1 freq
gupan (2) - 1 freq
gappin (2) - 2 freq
gypin (2) - 3 freq
laupin (2) - 2 freq
gawpin (2) - 33 freq
gauin (2) - 46 freq
jaupin (2) - 1 freq
gazin (3) - 18 freq
groupin (3) - 1 freq
gauped (3) - 2 freq
graipin (3) - 3 freq
gropin (3) - 2 freq
yappin (3) - 8 freq
gain (3) - 53 freq
gaun (3) - 1849 freq
gaarin (3) - 1 freq
lapin (3) - 1 freq
raepin (3) - 3 freq
SoundEx code - G150
gowpen - 4 freq
govin - 9 freq
goavyin - 1 freq
givin - 8 freq
gowpin - 30 freq
gapin - 10 freq
given - 68 freq
guffin - 1 freq
gabbin - 19 freq
gavin - 42 freq
gaupin - 8 freq
gawpin - 33 freq
gypin - 3 freq
gibbon - 10 freq
gappen - 2 freq
gappin - 2 freq
gabbana - 1 freq
'gaban' - 1 freq
govan - 10 freq
geffin - 1 freq
gif'n - 5 freq
giban - 1 freq
gvaain - 1 freq
gaffin - 5 freq
gappan - 1 freq
goavin - 1 freq
gubbin - 3 freq
gupan - 1 freq
giovanni - 3 freq
givan - 4 freq
gowfin - 2 freq
gvaan - 1 freq
ghobhainn - 1 freq
gobban - 1 freq
gevin - 1 freq
gopin - 1 freq
gaapin - 1 freq
€˜giovanni - 2 freq
guffan - 1 freq
govn - 2 freq
giovino - 1 freq
'given' - 1 freq
'goupin' - 1 freq
goupin - 1 freq
gbn - 1 freq
gfm - 1 freq
gcbinnie - 2 freq
MetaPhone code - KPN
gowpen - 4 freq
keepin - 213 freq
coupon - 52 freq
cuppin - 6 freq
gowpin - 30 freq
gapin - 10 freq
capone - 1 freq
gaupin - 8 freq
coupin - 7 freq
gawpin - 33 freq
cowpin - 26 freq
keppin - 10 freq
copin - 4 freq
keipin - 2 freq
keepin' - 8 freq
keepen - 1 freq
coupen - 1 freq
gappen - 2 freq
gappin - 2 freq
kippin - 1 freq
keepan - 12 freq
copan - 1 freq
coupan - 1 freq
gappan - 1 freq
gupan - 1 freq
gopin - 1 freq
gaapin - 1 freq
cappin - 1 freq
coopin - 1 freq
capon - 1 freq
kpn - 1 freq
'goupin' - 1 freq
goupin - 1 freq
GAUPIN
Time to execute Levenshtein function - 0.181456 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.343144 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027373 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041138 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000809 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.