A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to giant in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
giant (0) - 84 freq
giants (1) - 19 freq
grant (1) - 101 freq
gant (1) - 15 freq
girnt (1) - 4 freq
giaur (2) - 1 freq
graft (2) - 35 freq
scant (2) - 34 freq
galt (2) - 6 freq
tirnt (2) - 21 freq
grann (2) - 1 freq
girns (2) - 19 freq
gind (2) - 1 freq
meant (2) - 454 freq
kant (2) - 4 freq
disnt (2) - 1 freq
ian (2) - 202 freq
gangt (2) - 1 freq
guan (2) - 7 freq
kirnt (2) - 2 freq
miynt (2) - 1 freq
doant (2) - 2 freq
gink (2) - 1 freq
fant (2) - 2 freq
titnt (2) - 1 freq
giant (0) - 84 freq
gant (1) - 15 freq
gaint (2) - 1 freq
gaunt (2) - 8 freq
giants (2) - 19 freq
ginty (2) - 4 freq
gent (2) - 10 freq
grant (2) - 101 freq
girnt (2) - 4 freq
mient (3) - 3 freq
granut (3) - 1 freq
gainit (3) - 2 freq
gien (3) - 1014 freq
signt (3) - 7 freq
gaet (3) - 22 freq
gift (3) - 116 freq
gaan (3) - 244 freq
pant (3) - 2 freq
giean (3) - 4 freq
tant (3) - 1 freq
gibt (3) - 3 freq
ging (3) - 343 freq
gart (3) - 158 freq
pint (3) - 186 freq
laant (3) - 1 freq
SoundEx code - G530
giant - 84 freq
gandy - 1 freq
gant - 15 freq
gnawed - 5 freq
gent - 10 freq
gained - 15 freq
gannet - 10 freq
gnawit - 1 freq
gantae - 6 freq
gaantae - 2 freq
gna'd - 1 freq
gaunt - 8 freq
gontae - 7 freq
goantae - 1 freq
gunned - 1 freq
gentie - 18 freq
gandhi - 1 freq
g-and-t - 1 freq
gundy - 4 freq
gauntae - 1 freq
gamut - 1 freq
ginty - 4 freq
gaaned - 1 freq
gendy - 1 freq
ghandi - 1 freq
giein't - 1 freq
gomed - 18 freq
gainit - 2 freq
goamit - 1 freq
gind - 1 freq
gointy - 2 freq
gonty - 1 freq
gaen-oot - 1 freq
€œgimmet - 1 freq
gaint - 1 freq
gamed - 1 freq
gond - 1 freq
gmde - 1 freq
gnd - 1 freq
MetaPhone code - JNT
joined - 47 freq
jined - 38 freq
jyned - 72 freq
giant - 84 freq
joint - 50 freq
jiande - 1 freq
jeyned - 2 freq
janet - 75 freq
jynt - 45 freq
jant - 2 freq
gent - 10 freq
jundie - 3 freq
jint - 6 freq
'jennet - 4 freq
jennet - 94 freq
jaunty - 2 freq
jynit - 3 freq
jeyn't - 3 freq
gentie - 18 freq
jaunt - 15 freq
ginty - 4 freq
gendy - 1 freq
junta - 1 freq
giein't - 1 freq
junt - 1 freq
gind - 1 freq
jinty - 6 freq
johnnydee - 55 freq
johnnyd - 1 freq
jjnt - 1 freq
janet' - 2 freq
janetÂ’ - 1 freq
GIANT
Time to execute Levenshtein function - 0.185825 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330919 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027284 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039529 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000784 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.