A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to vats in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
vats (0) - 2 freq
'ats (1) - 10 freq
lats (1) - 26 freq
mats (1) - 8 freq
nats (1) - 2 freq
cats (1) - 124 freq
vas (1) - 1 freq
vat (1) - 12 freq
hats (1) - 46 freq
oats (1) - 37 freq
bats (1) - 34 freq
ats (1) - 126 freq
pats (1) - 15 freq
tats (1) - 1 freq
aats (1) - 10 freq
vets (1) - 9 freq
vais (1) - 2 freq
eats (1) - 30 freq
sats (1) - 1 freq
fats (1) - 2 freq
vans (1) - 22 freq
dats (1) - 21 freq
vath (1) - 1 freq
rats (1) - 29 freq
gats (1) - 1 freq
vats (0) - 2 freq
vets (1) - 9 freq
fats (2) - 2 freq
sats (2) - 1 freq
dats (2) - 21 freq
vais (2) - 2 freq
eats (2) - 30 freq
vath (2) - 1 freq
voits (2) - 2 freq
votes (2) - 69 freq
gats (2) - 1 freq
rats (2) - 29 freq
aats (2) - 10 freq
vans (2) - 22 freq
lats (2) - 26 freq
vas (2) - 1 freq
nats (2) - 2 freq
'ats (2) - 10 freq
mats (2) - 8 freq
tats (2) - 1 freq
vat (2) - 12 freq
cats (2) - 124 freq
pats (2) - 15 freq
bats (2) - 34 freq
ats (2) - 126 freq
SoundEx code - V320
vdus - 1 freq
vodka - 24 freq
vats - 2 freq
vets - 9 freq
votes - 69 freq
veet's - 1 freq
videos - 19 freq
viddies - 1 freq
voits - 2 freq
vettese - 1 freq
voytek - 1 freq
vet's - 1 freq
vidjo - 3 freq
vteso - 1 freq
video's - 2 freq
vtz - 1 freq
vids - 2 freq
vdqzy - 1 freq
vtuq - 1 freq
vdcy - 1 freq
MetaPhone code - FTS
fit's - 210 freq
vdus - 1 freq
fits - 130 freq
fauts - 27 freq
photies - 76 freq
fatties - 1 freq
fates - 4 freq
foties - 6 freq
'photos - 1 freq
photos - 40 freq
'fit's - 16 freq
fat's - 6 freq
fades - 23 freq
fatty's - 2 freq
fitt's - 14 freq
feeds - 23 freq
fuit's - 1 freq
vats - 2 freq
photes - 1 freq
ffitteeeeessshhh - 1 freq
fota's - 3 freq
fuds - 8 freq
fads - 2 freq
photo's - 4 freq
fiddies - 1 freq
fatsu - 1 freq
fuits - 2 freq
vets - 9 freq
fate's - 1 freq
photaes - 9 freq
votes - 69 freq
veet's - 1 freq
feuds - 3 freq
fetes - 1 freq
fïts - 1 freq
videos - 19 freq
faats - 7 freq
viddies - 1 freq
foods - 6 freq
ghds - 1 freq
feets - 1 freq
foaties - 1 freq
feeties - 5 freq
fats - 2 freq
feats - 3 freq
photos' - 1 freq
foetus - 1 freq
voits - 2 freq
ghettoes - 1 freq
fite's - 1 freq
fuids - 2 freq
photoies - 1 freq
photas - 2 freq
'photies' - 1 freq
vettese - 1 freq
€˜fits - 1 freq
€œfits - 1 freq
vet's - 1 freq
fitÂ’s - 27 freq
vteso - 1 freq
video's - 2 freq
vtz - 1 freq
fuitÂ’s - 1 freq
fittÂ’s - 2 freq
fotees - 3 freq
vids - 2 freq
photis - 1 freq
fotos - 1 freq
fotaes - 1 freq
vdcy - 1 freq
VATS
Time to execute Levenshtein function - 0.273939 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.544768 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.067762 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.077970 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001004 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.