A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to govan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
govan (0) - 10 freq
rovan (1) - 1 freq
gowan (1) - 16 freq
go-an (1) - 1 freq
lovan (1) - 3 freq
movan (1) - 10 freq
givan (1) - 4 freq
govn (1) - 2 freq
govin (1) - 9 freq
goan (1) - 64 freq
goin (2) - 248 freq
gan (2) - 768 freq
govern (2) - 3 freq
shovan (2) - 1 freq
guan (2) - 7 freq
glean (2) - 4 freq
boxan (2) - 1 freq
wogan (2) - 1 freq
gon (2) - 6 freq
logan (2) - 12 freq
goad (2) - 105 freq
van (2) - 159 freq
govan's (2) - 1 freq
gove (2) - 14 freq
gown (2) - 2 freq
govan (0) - 10 freq
givan (1) - 4 freq
govn (1) - 2 freq
govin (1) - 9 freq
givin (2) - 8 freq
gevin (2) - 1 freq
given (2) - 68 freq
goavin (2) - 1 freq
gavin (2) - 42 freq
gvaan (2) - 1 freq
movan (2) - 10 freq
lovan (2) - 3 freq
go-an (2) - 1 freq
rovan (2) - 1 freq
gowan (2) - 16 freq
goan (2) - 64 freq
gavsan (3) - 1 freq
graan (3) - 14 freq
gaean (3) - 2 freq
gov (3) - 39 freq
tovin (3) - 8 freq
wavan (3) - 9 freq
go'n (3) - 8 freq
gyaan (3) - 83 freq
cavan (3) - 1 freq
SoundEx code - G150
gowpen - 4 freq
govin - 9 freq
goavyin - 1 freq
givin - 8 freq
gowpin - 30 freq
gapin - 10 freq
given - 68 freq
guffin - 1 freq
gabbin - 19 freq
gavin - 42 freq
gaupin - 8 freq
gawpin - 33 freq
gypin - 3 freq
gibbon - 10 freq
gappen - 2 freq
gappin - 2 freq
gabbana - 1 freq
'gaban' - 1 freq
govan - 10 freq
geffin - 1 freq
gif'n - 5 freq
giban - 1 freq
gvaain - 1 freq
gaffin - 5 freq
gappan - 1 freq
goavin - 1 freq
gubbin - 3 freq
gupan - 1 freq
giovanni - 3 freq
givan - 4 freq
gowfin - 2 freq
gvaan - 1 freq
ghobhainn - 1 freq
gobban - 1 freq
gevin - 1 freq
gopin - 1 freq
gaapin - 1 freq
€˜giovanni - 2 freq
guffan - 1 freq
govn - 2 freq
giovino - 1 freq
'given' - 1 freq
'goupin' - 1 freq
goupin - 1 freq
gbn - 1 freq
gfm - 1 freq
gcbinnie - 2 freq
MetaPhone code - KFN
govin - 9 freq
cavin - 1 freq
caffeine - 11 freq
guffin - 1 freq
coffin - 65 freq
coafin - 1 freq
gavin - 42 freq
coaffin - 5 freq
coughin - 14 freq
coven - 4 freq
cauvin - 1 freq
coffen - 2 freq
kevin - 45 freq
govan - 10 freq
quaffin - 2 freq
gvaain - 1 freq
gaffin - 5 freq
goavin - 1 freq
cofhn - 3 freq
coffeen - 1 freq
coaffin- - 1 freq
gowfin - 2 freq
gvaan - 1 freq
gaughan - 8 freq
cavan - 1 freq
guffan - 1 freq
caven - 1 freq
govn - 2 freq
kevin' - 1 freq
GOVAN
Time to execute Levenshtein function - 0.221792 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.387066 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027101 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036522 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000833 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.