A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to vtz in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
vtz (0) - 1 freq
rtz (1) - 2 freq
vz (1) - 4 freq
vtv (1) - 1 freq
vpz (1) - 1 freq
itz (1) - 2 freq
gtz (1) - 1 freq
viz (1) - 6 freq
vcz (1) - 1 freq
otz (1) - 1 freq
ptz (1) - 1 freq
vtl (1) - 1 freq
vjq (2) - 1 freq
vsn (2) - 1 freq
liz (2) - 71 freq
jtc (2) - 1 freq
vot (2) - 1 freq
vj (2) - 8 freq
itn (2) - 2 freq
pzz (2) - 1 freq
oto (2) - 1 freq
lcz (2) - 1 freq
gtc (2) - 1 freq
tzx (2) - 1 freq
vaa (2) - 1 freq
vtz (0) - 1 freq
viz (2) - 6 freq
vcz (2) - 1 freq
vtl (2) - 1 freq
gtz (2) - 1 freq
otz (2) - 1 freq
ptz (2) - 1 freq
rtz (2) - 2 freq
itz (2) - 2 freq
vz (2) - 4 freq
vtv (2) - 1 freq
vpz (2) - 1 freq
ootz (3) - 1 freq
vtuq (3) - 1 freq
vote (3) - 253 freq
vets (3) - 9 freq
taz (3) - 21 freq
citz (3) - 2 freq
vita (3) - 1 freq
ktze (3) - 1 freq
vit (3) - 3 freq
vats (3) - 2 freq
ytzo (3) - 1 freq
vefz (3) - 1 freq
vet (3) - 37 freq
SoundEx code - V320
vdus - 1 freq
vodka - 24 freq
vats - 2 freq
vets - 9 freq
votes - 69 freq
veet's - 1 freq
videos - 19 freq
viddies - 1 freq
voits - 2 freq
vettese - 1 freq
voytek - 1 freq
vet's - 1 freq
vidjo - 3 freq
vteso - 1 freq
video's - 2 freq
vtz - 1 freq
vids - 2 freq
vdqzy - 1 freq
vtuq - 1 freq
vdcy - 1 freq
MetaPhone code - FTS
fit's - 210 freq
vdus - 1 freq
fits - 130 freq
fauts - 27 freq
photies - 76 freq
fatties - 1 freq
fates - 4 freq
foties - 6 freq
'photos - 1 freq
photos - 40 freq
'fit's - 16 freq
fat's - 6 freq
fades - 23 freq
fatty's - 2 freq
fitt's - 14 freq
feeds - 23 freq
fuit's - 1 freq
vats - 2 freq
photes - 1 freq
ffitteeeeessshhh - 1 freq
fota's - 3 freq
fuds - 8 freq
fads - 2 freq
photo's - 4 freq
fiddies - 1 freq
fatsu - 1 freq
fuits - 2 freq
vets - 9 freq
fate's - 1 freq
photaes - 9 freq
votes - 69 freq
veet's - 1 freq
feuds - 3 freq
fetes - 1 freq
fïts - 1 freq
videos - 19 freq
faats - 7 freq
viddies - 1 freq
foods - 6 freq
ghds - 1 freq
feets - 1 freq
foaties - 1 freq
feeties - 5 freq
fats - 2 freq
feats - 3 freq
photos' - 1 freq
foetus - 1 freq
voits - 2 freq
ghettoes - 1 freq
fite's - 1 freq
fuids - 2 freq
photoies - 1 freq
photas - 2 freq
'photies' - 1 freq
vettese - 1 freq
€˜fits - 1 freq
€œfits - 1 freq
vet's - 1 freq
fitÂ’s - 27 freq
vteso - 1 freq
video's - 2 freq
vtz - 1 freq
fuitÂ’s - 1 freq
fittÂ’s - 2 freq
fotees - 3 freq
vids - 2 freq
photis - 1 freq
fotos - 1 freq
fotaes - 1 freq
vdcy - 1 freq
VTZ
Time to execute Levenshtein function - 0.495690 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.786555 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033270 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.097942 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000918 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.