A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to videos in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
videos (0) - 19 freq
video (1) - 105 freq
video's (1) - 2 freq
idees (2) - 3 freq
vipers (2) - 5 freq
rides (2) - 16 freq
wide-os (2) - 1 freq
hides (2) - 14 freq
side's (2) - 1 freq
tide's (2) - 4 freq
hideous (2) - 7 freq
vids (2) - 2 freq
dides (2) - 1 freq
'ides (2) - 1 freq
tides (2) - 22 freq
bides (2) - 228 freq
vivers (2) - 1 freq
ideas (2) - 145 freq
vices (2) - 12 freq
vines (2) - 6 freq
vidjo (2) - 3 freq
riders (2) - 8 freq
views (2) - 58 freq
vidyo (2) - 1 freq
vido (2) - 1 freq
videos (0) - 19 freq
video (2) - 105 freq
video's (2) - 2 freq
vids (2) - 2 freq
vidjo (3) - 3 freq
vices (3) - 12 freq
views (3) - 58 freq
vines (3) - 6 freq
vibes (3) - 6 freq
vdus (3) - 1 freq
sides (3) - 155 freq
bides (3) - 228 freq
vido (3) - 1 freq
vidyo (3) - 1 freq
ideas (3) - 145 freq
idees (3) - 3 freq
evades (3) - 2 freq
tides (3) - 22 freq
hideous (3) - 7 freq
dides (3) - 1 freq
rides (3) - 16 freq
'ides (3) - 1 freq
hides (3) - 14 freq
voles (4) - 2 freq
godes (4) - 2 freq
SoundEx code - V320
vdus - 1 freq
vodka - 24 freq
vats - 2 freq
vets - 9 freq
votes - 69 freq
veet's - 1 freq
videos - 19 freq
viddies - 1 freq
voits - 2 freq
vettese - 1 freq
voytek - 1 freq
vet's - 1 freq
vidjo - 3 freq
vteso - 1 freq
video's - 2 freq
vtz - 1 freq
vids - 2 freq
vdqzy - 1 freq
vtuq - 1 freq
vdcy - 1 freq
MetaPhone code - FTS
fit's - 209 freq
vdus - 1 freq
fits - 128 freq
fauts - 27 freq
photies - 75 freq
fatties - 1 freq
fates - 4 freq
foties - 6 freq
'photos - 1 freq
photos - 40 freq
'fit's - 16 freq
fat's - 6 freq
fades - 21 freq
fatty's - 2 freq
fitt's - 14 freq
feeds - 20 freq
fuit's - 1 freq
vats - 2 freq
photes - 1 freq
ffitteeeeessshhh - 1 freq
fota's - 3 freq
fuds - 8 freq
fads - 2 freq
photo's - 4 freq
fatsu - 1 freq
fuits - 2 freq
vets - 9 freq
fate's - 1 freq
photaes - 9 freq
votes - 69 freq
veet's - 1 freq
feuds - 3 freq
fetes - 1 freq
fïts - 1 freq
videos - 19 freq
faats - 7 freq
viddies - 1 freq
foods - 6 freq
ghds - 1 freq
feets - 1 freq
foaties - 1 freq
feeties - 5 freq
fats - 2 freq
feats - 3 freq
photos' - 1 freq
foetus - 1 freq
voits - 2 freq
ghettoes - 1 freq
fite's - 1 freq
fuids - 2 freq
photoies - 1 freq
photas - 2 freq
'photies' - 1 freq
vettese - 1 freq
€˜fits - 1 freq
€œfits - 1 freq
vet's - 1 freq
fitÂ’s - 27 freq
vteso - 1 freq
video's - 2 freq
vtz - 1 freq
fuitÂ’s - 1 freq
fittÂ’s - 2 freq
fotees - 3 freq
vids - 2 freq
photis - 1 freq
fotos - 1 freq
fotaes - 1 freq
vdcy - 1 freq
VIDEOS
Time to execute Levenshtein function - 0.172359 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337105 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027368 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037042 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000843 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.