A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to vegan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
vegan (0) - 12 freq
vegan' (1) - 1 freq
regan (1) - 1 freq
began (1) - 298 freq
vegas (1) - 1 freq
megan (1) - 10 freq
vega (1) - 1 freq
egan (1) - 3 freq
vegans (1) - 1 freq
sega (2) - 1 freq
segas (2) - 1 freq
deuan (2) - 16 freq
lepan (2) - 1 freq
nbgan (2) - 1 freq
lean (2) - 48 freq
sean (2) - 16 freq
yeman (2) - 1 freq
sagan (2) - 1 freq
leman (2) - 3 freq
logan (2) - 12 freq
gan (2) - 768 freq
seean (2) - 14 freq
egon (2) - 1 freq
begane (2) - 1 freq
ean (2) - 46 freq
vegan (0) - 12 freq
vegans (2) - 1 freq
vigin (2) - 1 freq
vaegin (2) - 1 freq
vega (2) - 1 freq
egan (2) - 3 freq
megan (2) - 10 freq
vegan' (2) - 1 freq
vegas (2) - 1 freq
regan (2) - 1 freq
began (2) - 298 freq
gean (3) - 10 freq
pagan (3) - 17 freq
evan (3) - 12 freq
veg (3) - 17 freq
ongan (3) - 2 freq
geyan (3) - 17 freq
hogan (3) - 1 freq
veggi (3) - 1 freq
ingan (3) - 11 freq
regain (3) - 10 freq
keegan (3) - 2 freq
vergin (3) - 2 freq
wogan (3) - 1 freq
ragan (3) - 1 freq
SoundEx code - V250
vexin - 5 freq
veesion - 25 freq
vision - 61 freq
voyagin - 5 freq
vacyoom - 1 freq
vaccum - 1 freq
voicin - 1 freq
vegan - 12 freq
'veesion' - 1 freq
vïsion - 2 freq
vizzyin - 1 freq
vikeen - 2 freq
vaccine - 6 freq
vaegin - 1 freq
vaigin - 4 freq
vissiein - 3 freq
veisioun - 2 freq
vacuum - 5 freq
vigin - 1 freq
vågen - 1 freq
vísion - 1 freq
veision - 2 freq
vexxin - 1 freq
vixen - 1 freq
voican - 1 freq
vaccuum - 1 freq
vogm - 1 freq
vconi - 1 freq
vwkm - 1 freq
vegan' - 1 freq
vaccine' - 1 freq
vcm - 1 freq
vijm - 1 freq
'vision' - 1 freq
vvqmy - 1 freq
vosene - 1 freq
vsn - 1 freq
vicschoen - 1 freq
MetaPhone code - FKN
'fuckin - 29 freq
fuckin - 1032 freq
fykin - 2 freq
f'kn - 2 freq
f'ckin - 1 freq
fuck'n - 1 freq
fook'n - 1 freq
fknnn - 1 freq
vegan - 12 freq
fakin - 3 freq
fowkin - 1 freq
fukin - 1 freq
vikeen - 2 freq
feckin - 51 freq
fuckan - 7 freq
faggan - 1 freq
€œfuckin - 1 freq
fag-en - 1 freq
€˜fuckin - 16 freq
voican - 1 freq
fackin - 1 freq
faackin - 2 freq
€™fuckin - 1 freq
feckinÂ’ - 1 freq
foggin - 1 freq
fuckn - 1 freq
fookin - 2 freq
vconi - 1 freq
fecken - 3 freq
vegan' - 1 freq
phkan - 1 freq
fikeane - 1 freq
fekkin - 1 freq
fuckinÂ’ - 2 freq
VEGAN
Time to execute Levenshtein function - 0.191922 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.344955 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030917 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.045842 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000894 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.