A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to vath in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
vath (0) - 1 freq
hath (1) - 3 freq
path (1) - 178 freq
math (1) - 6 freq
vat (1) - 12 freq
cath (1) - 4 freq
dath (1) - 1 freq
oath (1) - 13 freq
bath (1) - 98 freq
vats (1) - 2 freq
kath (1) - 3 freq
walth (2) - 35 freq
'ach (2) - 27 freq
rah (2) - 5 freq
rate (2) - 100 freq
bathy (2) - 2 freq
boath (2) - 2 freq
aat- (2) - 1 freq
late (2) - 496 freq
watl (2) - 1 freq
vaar (2) - 1 freq
vaa (2) - 1 freq
heth (2) - 25 freq
wat (2) - 52 freq
laith (2) - 25 freq
vath (0) - 1 freq
oath (2) - 13 freq
vats (2) - 2 freq
vethe (2) - 1 freq
dath (2) - 1 freq
kath (2) - 3 freq
bath (2) - 98 freq
cath (2) - 4 freq
hath (2) - 3 freq
math (2) - 6 freq
vat (2) - 12 freq
path (2) - 178 freq
bathe (3) - 7 freq
vuty (3) - 1 freq
neth (3) - 5 freq
lateh (3) - 1 freq
tuath (3) - 1 freq
vjh (3) - 1 freq
vtv (3) - 1 freq
earth (3) - 253 freq
vuh (3) - 1 freq
vote (3) - 253 freq
woth (3) - 1 freq
doth (3) - 9 freq
eeth (3) - 1 freq
SoundEx code - V300
vote - 253 freq
void - 17 freq
vat - 12 freq
vood - 2 freq
vet - 37 freq
voot - 1 freq
vidi - 1 freq
vitae - 3 freq
vow'd - 1 freq
vot - 1 freq
voodoo - 2 freq
video - 105 freq
vatta - 1 freq
voddie - 8 freq
viewed - 9 freq
veet - 6 freq
voddy - 2 freq
vid - 4 freq
vit - 3 freq
vott - 2 freq
vowed - 5 freq
vodd - 2 freq
voyd - 2 freq
voyied - 1 freq
vod - 1 freq
voo'd - 1 freq
ve'd - 1 freq
vaudie - 1 freq
vita - 1 freq
voat - 1 freq
vóat - 1 freq
€œvote - 1 freq
vuty - 1 freq
vwdh - 1 freq
vada - 1 freq
vidyo - 1 freq
vbt - 1 freq
voodo - 1 freq
vath - 1 freq
vwdy - 1 freq
vethe - 1 freq
vpt - 1 freq
vido - 1 freq
vd - 2 freq
vbwtwe - 1 freq
MetaPhone code - F0
faith - 162 freq
fouthie - 16 freq
faith' - 1 freq
fouth - 45 freq
fowth - 5 freq
faithey - 1 freq
'faith - 1 freq
feth - 45 freq
footh - 4 freq
fouthy - 2 freq
€¦faith - 2 freq
fyth - 1 freq
€˜fowth - 1 freq
fethy - 2 freq
€œfeth - 2 freq
foothie - 3 freq
vath - 1 freq
vethe - 1 freq
fith - 1 freq
fth - 2 freq
ghaoth - 1 freq
VATH
Time to execute Levenshtein function - 0.186753 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.366559 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032846 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037540 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000911 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.