A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to venues in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
venues (0) - 19 freq
avenues (1) - 5 freq
venue (1) - 16 freq
venus (1) - 32 freq
velues (1) - 6 freq
veneer (2) - 1 freq
lenses (2) - 10 freq
revenues (2) - 4 freq
henges (2) - 1 freq
vents (2) - 1 freq
velue (2) - 17 freq
genres (2) - 13 freq
vexes (2) - 3 freq
veneers (2) - 1 freq
venge (2) - 1 freq
values (2) - 14 freq
fences (2) - 22 freq
menses (2) - 1 freq
vennels (2) - 12 freq
'venus (2) - 1 freq
senses (2) - 56 freq
pences (2) - 3 freq
menes (2) - 6 freq
tenures (2) - 1 freq
veluet (2) - 1 freq
venues (0) - 19 freq
avenues (1) - 5 freq
venus (1) - 32 freq
vines (2) - 6 freq
vanes (2) - 2 freq
velues (2) - 6 freq
venue (2) - 16 freq
wenes (3) - 1 freq
denies (3) - 12 freq
verus (3) - 1 freq
enes (3) - 2 freq
eenies (3) - 15 freq
venusia (3) - 1 freq
avenue (3) - 19 freq
evens (3) - 1 freq
vans (3) - 22 freq
veins (3) - 44 freq
ovens (3) - 4 freq
menus (3) - 3 freq
genes (3) - 11 freq
ganues (3) - 1 freq
genus (3) - 2 freq
vaines (3) - 1 freq
venge (3) - 1 freq
values (3) - 14 freq
SoundEx code - V520
vans - 22 freq
venus - 32 freq
vainish - 9 freq
veins - 44 freq
van's - 1 freq
vines - 6 freq
venice - 11 freq
vamoose - 1 freq
venge - 1 freq
'venus - 1 freq
venues - 19 freq
vanessa - 1 freq
vonnie's - 1 freq
viennese - 2 freq
venusia - 1 freq
vinci - 1 freq
vanes - 2 freq
vanish - 6 freq
vaines - 1 freq
vinnyÂ’s - 1 freq
viewing - 6 freq
vonoq - 1 freq
vying - 1 freq
vvmj - 1 freq
vance - 6 freq
vmas - 1 freq
MetaPhone code - FNS
fence - 154 freq
fancy - 281 freq
vans - 22 freq
phone's - 9 freq
funcy - 90 freq
venus - 32 freq
veins - 44 freq
'fancy - 4 freq
fans - 141 freq
phones - 50 freq
van's - 1 freq
fins - 37 freq
founess - 1 freq
vines - 6 freq
'fines - 1 freq
venice - 11 freq
fainness - 3 freq
funs - 7 freq
fines - 7 freq
fownes - 2 freq
fin's - 1 freq
faansee - 1 freq
fansee - 1 freq
funsee - 1 freq
fauns - 1 freq
'venus - 1 freq
venues - 19 freq
fannies - 12 freq
vanessa - 1 freq
fince - 5 freq
fan's - 3 freq
'funcy - 1 freq
vonnie's - 1 freq
funess - 1 freq
yvonne's - 13 freq
fauncy - 3 freq
fancie - 17 freq
viennese - 2 freq
finesse - 2 freq
foons - 7 freq
fanny's - 1 freq
'fancy' - 1 freq
vinci - 1 freq
fens - 3 freq
vanes - 2 freq
fines' - 1 freq
fionza - 1 freq
fancy' - 1 freq
fons - 1 freq
founs - 9 freq
faan's - 1 freq
funns - 1 freq
vaines - 1 freq
funcie - 1 freq
fonns - 1 freq
feinis - 2 freq
€˜fancy - 2 freq
fansy - 1 freq
feyness - 1 freq
fiona's - 1 freq
finns - 1 freq
€œfancy - 1 freq
€™fancy - 1 freq
faun's - 3 freq
finÂ’s - 1 freq
vinnyÂ’s - 1 freq
funzo - 1 freq
fannys - 1 freq
ghini's - 1 freq
fanniiiieees - 1 freq
phoneÂ’s - 1 freq
vance - 6 freq
fones - 1 freq
VENUES
Time to execute Levenshtein function - 0.693152 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.907707 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.087030 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.108894 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000878 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.