A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to given in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
given (0) - 69 freq
givan (1) - 4 freq
give (1) - 166 freq
wiven (1) - 1 freq
givin (1) - 8 freq
gi'en (1) - 7 freq
aiven (1) - 66 freq
niven (1) - 3 freq
siven (1) - 2 freq
gien (1) - 1024 freq
riven (1) - 15 freq
gives (1) - 20 freq
gipe (2) - 1 freq
gide (2) - 1 freq
gyeen (2) - 1 freq
liver (2) - 34 freq
ripen (2) - 8 freq
goves (2) - 2 freq
gove (2) - 14 freq
govern (2) - 3 freq
gi'ed (2) - 2 freq
giving (2) - 26 freq
coven (2) - 4 freq
kiver (2) - 5 freq
'give (2) - 3 freq
given (0) - 69 freq
givin (1) - 8 freq
givan (1) - 4 freq
govin (2) - 9 freq
govan (2) - 10 freq
govn (2) - 2 freq
gevin (2) - 1 freq
gives (2) - 20 freq
gavin (2) - 42 freq
riven (2) - 15 freq
wiven (2) - 1 freq
give (2) - 166 freq
aiven (2) - 66 freq
gi'en (2) - 7 freq
gien (2) - 1024 freq
niven (2) - 3 freq
siven (2) - 2 freq
girn (3) - 60 freq
divin (3) - 13 freq
giean (3) - 4 freq
mivin (3) - 1 freq
hivin (3) - 27 freq
ga'en (3) - 1 freq
goavin (3) - 1 freq
goen (3) - 1 freq
SoundEx code - G150
gowpen - 5 freq
govin - 9 freq
goavyin - 1 freq
givin - 8 freq
gowpin - 30 freq
gapin - 10 freq
given - 69 freq
guffin - 1 freq
gabbin - 19 freq
gavin - 42 freq
gaupin - 8 freq
gawpin - 34 freq
gypin - 3 freq
gibbon - 10 freq
gappen - 2 freq
gappin - 2 freq
gabbana - 1 freq
'gaban' - 1 freq
govan - 10 freq
geffin - 1 freq
gif'n - 5 freq
giban - 1 freq
gvaain - 1 freq
gaffin - 5 freq
gappan - 1 freq
goavin - 1 freq
gubbin - 3 freq
gupan - 1 freq
giovanni - 3 freq
givan - 4 freq
gowfin - 2 freq
gvaan - 1 freq
ghobhainn - 1 freq
gobban - 1 freq
gevin - 1 freq
gopin - 1 freq
gaapin - 1 freq
€˜giovanni - 2 freq
guffan - 1 freq
govn - 2 freq
giovino - 1 freq
'given' - 1 freq
'goupin' - 1 freq
goupin - 1 freq
gbn - 1 freq
gfm - 1 freq
gcbinnie - 2 freq
MetaPhone code - JFN
givin - 8 freq
given - 69 freq
jeavin - 1 freq
geffin - 1 freq
gif'n - 5 freq
jovian - 3 freq
giovanni - 3 freq
givan - 4 freq
gevin - 1 freq
javan - 4 freq
€˜giovanni - 2 freq
giovino - 1 freq
'given' - 1 freq
GIVEN
gie - 2567 freq
give - 166 freq
gies - 516 freq
gives - 20 freq
gave - 241 freq
gied - 1359 freq
gien - 1024 freq
given - 69 freq
geez - 20 freq
giein - 437 freq
Time to execute Levenshtein function - 0.327429 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.541921 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029922 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.060161 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000970 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.