Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ghandi in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
ghandi (0) - 1 freq handin (2) - 15 freq shansi (2) - 1 freq handis (2) - 1 freq ghana (2) - 1 freq sandi (2) - 5 freq shanzi (2) - 4 freq glands (2) - 2 freq grandie (2) - 1 freq hands (2) - 175 freq hand (2) - 319 freq handy (2) - 55 freq gandhi (2) - 1 freq grande (2) - 1 freq gandy (2) - 1 freq granda (2) - 274 freq handw (2) - 1 freq shand (2) - 11 freq handit (2) - 27 freq grands (2) - 1 freq mandi (2) - 1 freq hanoi (2) - 1 freq handie (2) - 3 freq gyands (2) - 1 freq shandy (2) - 8 freq	ghandi (0) - 1 freq grand (3) - 353 freq handy (3) - 55 freq grande (3) - 1 freq gandy (3) - 1 freq granda (3) - 274 freq hand (3) - 319 freq shand (3) - 11 freq grandie (3) - 1 freq shandy (3) - 8 freq hindi (3) - 2 freq ghana (3) - 1 freq handie (3) - 3 freq eoghand (3) - 1 freq hunde (4) - 1 freq ghud (4) - 4 freq haand (4) - 104 freq gdnd (4) - 1 freq graund (4) - 24 freq hund (4) - 1 freq 'haund (4) - 3 freq hindu (4) - 8 freq gundy (4) - 4 freq rhind (4) - 1 freq ahind (4) - 11 freq	SoundEx code - G530 giant - 84 freq gandy - 1 freq gant - 15 freq gnawed - 5 freq gent - 10 freq gained - 15 freq gannet - 10 freq gnawit - 1 freq gantae - 6 freq gaantae - 2 freq gna'd - 1 freq gaunt - 8 freq gontae - 7 freq goantae - 1 freq gunned - 1 freq gentie - 18 freq gandhi - 1 freq g-and-t - 1 freq gundy - 4 freq gauntae - 1 freq gamut - 1 freq ginty - 4 freq gaaned - 1 freq gendy - 1 freq ghandi - 1 freq giein't - 1 freq gomed - 18 freq gainit - 2 freq goamit - 1 freq gind - 1 freq gointy - 2 freq gonty - 1 freq gaen-oot - 1 freq ��gimmet - 1 freq gaint - 1 freq gamed - 1 freq gond - 1 freq gmde - 1 freq gnd - 1 freq	MetaPhone code - FNT fund - 563 freq find - 773 freq found - 220 freq fond - 100 freq fond-ae-ae - 1 freq faint - 35 freq fanned - 9 freq foond - 140 freq fiend - 4 freq finnd - 125 freq foondy - 1 freq fawned - 1 freq fent - 11 freq vent - 6 freq fend - 28 freq fand - 176 freq phont - 3 freq vauntie - 25 freq fient - 6 freq phoned - 73 freq '-fand - 1 freq fined - 12 freq fint - 2 freq veined - 2 freq finite - 3 freq vainity - 2 freq font - 7 freq funnoot - 3 freq faant - 1 freq fyn't - 1 freq finoot - 1 freq funoot - 1 freq foont - 2 freq vanitie - 2 freq vanity - 10 freq fount - 2 freq funnd - 36 freq fuund - 2 freq fin'd - 1 freq finito - 1 freq fun'd - 1 freq phone't - 1 freq vynd - 4 freq vand - 2 freq fondue - 1 freq founnit - 3 freq vaned - 1 freq Øyvind - 19 freq ®Øyvind - 1 freq feenty - 1 freq fiind - 2 freq ghandi - 1 freq vaunty - 2 freq funned - 2 freq fond-o-o - 1 freq 'find' - 1 freq fant - 2 freq 'fent' - 1 freq feint - 5 freq 'feigned - 1 freq founit - 2 freq faent - 2 freq founde - 1 freq ��find - 1 freq ��foond - 1 freq vaunt - 1 freq fundie - 7 freq funday - 1 freq fundy - 1 freq fnd - 1 freq fuind - 1 freq	GHANDI
Time to execute Levenshtein function - 0.220225 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.395965 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.031360 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.041208 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000877 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics