A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ghandi in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ghandi (0) - 1 freq
handin (2) - 15 freq
shansi (2) - 1 freq
handis (2) - 1 freq
ghana (2) - 1 freq
sandi (2) - 5 freq
shanzi (2) - 4 freq
glands (2) - 2 freq
grandie (2) - 1 freq
hands (2) - 175 freq
hand (2) - 319 freq
handy (2) - 55 freq
gandhi (2) - 1 freq
grande (2) - 1 freq
gandy (2) - 1 freq
granda (2) - 274 freq
handw (2) - 1 freq
shand (2) - 11 freq
handit (2) - 27 freq
grands (2) - 1 freq
mandi (2) - 1 freq
hanoi (2) - 1 freq
handie (2) - 3 freq
gyands (2) - 1 freq
shandy (2) - 8 freq
ghandi (0) - 1 freq
grand (3) - 353 freq
handy (3) - 55 freq
grande (3) - 1 freq
gandy (3) - 1 freq
granda (3) - 274 freq
hand (3) - 319 freq
shand (3) - 11 freq
grandie (3) - 1 freq
shandy (3) - 8 freq
hindi (3) - 2 freq
ghana (3) - 1 freq
handie (3) - 3 freq
eoghand (3) - 1 freq
hunde (4) - 1 freq
ghud (4) - 4 freq
haand (4) - 104 freq
gdnd (4) - 1 freq
graund (4) - 24 freq
hund (4) - 1 freq
'haund (4) - 3 freq
hindu (4) - 8 freq
gundy (4) - 4 freq
rhind (4) - 1 freq
ahind (4) - 11 freq
SoundEx code - G530
giant - 84 freq
gandy - 1 freq
gant - 15 freq
gnawed - 5 freq
gent - 10 freq
gained - 15 freq
gannet - 10 freq
gnawit - 1 freq
gantae - 6 freq
gaantae - 2 freq
gna'd - 1 freq
gaunt - 8 freq
gontae - 7 freq
goantae - 1 freq
gunned - 1 freq
gentie - 18 freq
gandhi - 1 freq
g-and-t - 1 freq
gundy - 4 freq
gauntae - 1 freq
gamut - 1 freq
ginty - 4 freq
gaaned - 1 freq
gendy - 1 freq
ghandi - 1 freq
giein't - 1 freq
gomed - 18 freq
gainit - 2 freq
goamit - 1 freq
gind - 1 freq
gointy - 2 freq
gonty - 1 freq
gaen-oot - 1 freq
€œgimmet - 1 freq
gaint - 1 freq
gamed - 1 freq
gond - 1 freq
gmde - 1 freq
gnd - 1 freq
MetaPhone code - FNT
fund - 563 freq
find - 773 freq
found - 220 freq
fond - 100 freq
fond-ae-ae - 1 freq
faint - 35 freq
fanned - 9 freq
foond - 140 freq
fiend - 4 freq
finnd - 125 freq
foondy - 1 freq
fawned - 1 freq
fent - 11 freq
vent - 6 freq
fend - 28 freq
fand - 176 freq
phont - 3 freq
vauntie - 25 freq
fient - 6 freq
phoned - 73 freq
'-fand - 1 freq
fined - 12 freq
fint - 2 freq
veined - 2 freq
finite - 3 freq
vainity - 2 freq
font - 7 freq
funnoot - 3 freq
faant - 1 freq
fyn't - 1 freq
finoot - 1 freq
funoot - 1 freq
foont - 2 freq
vanitie - 2 freq
vanity - 10 freq
fount - 2 freq
funnd - 36 freq
fuund - 2 freq
fin'd - 1 freq
finito - 1 freq
fun'd - 1 freq
phone't - 1 freq
vynd - 4 freq
vand - 2 freq
fondue - 1 freq
founnit - 3 freq
vaned - 1 freq
Øyvind - 19 freq
®Øyvind - 1 freq
feenty - 1 freq
fiind - 2 freq
ghandi - 1 freq
vaunty - 2 freq
funned - 2 freq
fond-o-o - 1 freq
'find' - 1 freq
fant - 2 freq
'fent' - 1 freq
feint - 5 freq
'feigned - 1 freq
founit - 2 freq
faent - 2 freq
founde - 1 freq
€œfind - 1 freq
€˜foond - 1 freq
vaunt - 1 freq
fundie - 7 freq
funday - 1 freq
fundy - 1 freq
fnd - 1 freq
fuind - 1 freq
GHANDI
Time to execute Levenshtein function - 0.220225 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.395965 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031360 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041208 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000877 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.