A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to gvaan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
gvaan (0) - 1 freq
gyaan (1) - 83 freq
gjaan (1) - 1 freq
gvaain (1) - 1 freq
gaan (1) - 244 freq
graan (1) - 14 freq
vaam (2) - 3 freq
avaa (2) - 30 freq
grann (2) - 1 freq
gazan (2) - 2 freq
aan (2) - 30 freq
ga'an (2) - 6 freq
staan (2) - 13 freq
gain (2) - 53 freq
swaan (2) - 3 freq
shaan (2) - 26 freq
gaal (2) - 1 freq
grain (2) - 78 freq
graand (2) - 4 freq
graat (2) - 5 freq
vaar (2) - 1 freq
praan (2) - 4 freq
gawn (2) - 34 freq
jaan (2) - 1 freq
gowan (2) - 16 freq
gvaan (0) - 1 freq
gvaain (1) - 1 freq
givan (2) - 4 freq
graan (2) - 14 freq
govan (2) - 10 freq
gyaan (2) - 83 freq
gaan (2) - 244 freq
gjaan (2) - 1 freq
vain (3) - 30 freq
gran (3) - 232 freq
gyaain (3) - 85 freq
gaun (3) - 1877 freq
geyan (3) - 17 freq
van (3) - 161 freq
gaen (3) - 191 freq
gyan (3) - 104 freq
glean (3) - 5 freq
graen (3) - 1 freq
go-an (3) - 1 freq
grayan (3) - 1 freq
giean (3) - 4 freq
guan (3) - 11 freq
groan (3) - 19 freq
gavsan (3) - 1 freq
gva (3) - 1 freq
SoundEx code - G150
gowpen - 5 freq
govin - 9 freq
goavyin - 1 freq
givin - 8 freq
gowpin - 30 freq
gapin - 10 freq
given - 69 freq
guffin - 1 freq
gabbin - 19 freq
gavin - 42 freq
gaupin - 8 freq
gawpin - 34 freq
gypin - 3 freq
gibbon - 10 freq
gappen - 2 freq
gappin - 2 freq
gabbana - 1 freq
'gaban' - 1 freq
govan - 10 freq
geffin - 1 freq
gif'n - 5 freq
giban - 1 freq
gvaain - 1 freq
gaffin - 5 freq
gappan - 1 freq
goavin - 1 freq
gubbin - 3 freq
gupan - 1 freq
giovanni - 3 freq
givan - 4 freq
gowfin - 2 freq
gvaan - 1 freq
ghobhainn - 1 freq
gobban - 1 freq
gevin - 1 freq
gopin - 1 freq
gaapin - 1 freq
€˜giovanni - 2 freq
guffan - 1 freq
govn - 2 freq
giovino - 1 freq
'given' - 1 freq
'goupin' - 1 freq
goupin - 1 freq
gbn - 1 freq
gfm - 1 freq
gcbinnie - 2 freq
MetaPhone code - KFN
govin - 9 freq
cavin - 1 freq
caffeine - 12 freq
guffin - 1 freq
coffin - 67 freq
coafin - 1 freq
gavin - 42 freq
coaffin - 5 freq
coughin - 14 freq
coven - 4 freq
cauvin - 1 freq
coffen - 2 freq
kevin - 46 freq
govan - 10 freq
quaffin - 2 freq
gvaain - 1 freq
gaffin - 5 freq
goavin - 1 freq
cofhn - 3 freq
coffeen - 1 freq
coaffin- - 1 freq
gowfin - 2 freq
gvaan - 1 freq
gaughan - 8 freq
cavan - 1 freq
guffan - 1 freq
caven - 1 freq
govn - 2 freq
kevin' - 1 freq
GVAAN
Time to execute Levenshtein function - 0.208745 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360324 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028837 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039472 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000892 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.