A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to violinist in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
violinist (0) - 1 freq
violent (3) - 18 freq
violin (3) - 3 freq
virologist (3) - 1 freq
vainist (3) - 1 freq
virginis (3) - 1 freq
virginity (4) - 1 freq
honist (4) - 1 freq
millins (4) - 1 freq
fillins (4) - 2 freq
siblins (4) - 10 freq
violently (4) - 16 freq
volition (4) - 2 freq
kitlins (4) - 6 freq
tholins (4) - 1 freq
visitit (4) - 2 freq
violence (4) - 37 freq
geologist (4) - 1 freq
airliest (4) - 6 freq
violet's (4) - 19 freq
violet (4) - 316 freq
villains (4) - 3 freq
vainish (4) - 9 freq
jillings (4) - 10 freq
volieys (4) - 1 freq
violinist (0) - 1 freq
vainist (4) - 1 freq
violent (4) - 18 freq
violin (5) - 3 freq
virologist (5) - 1 freq
colonists (6) - 1 freq
solist (6) - 1 freq
holiness (6) - 1 freq
botanist (6) - 1 freq
rivlins (6) - 2 freq
feminist (6) - 2 freq
violets' (6) - 1 freq
eilins (6) - 4 freq
silliest (6) - 1 freq
polist (6) - 4 freq
ologist (6) - 1 freq
vainisht (6) - 5 freq
airlines (6) - 1 freq
violence- (6) - 1 freq
colonise (6) - 2 freq
feinist (6) - 8 freq
ailins (6) - 1 freq
yirlins (6) - 1 freq
vaelensi (6) - 2 freq
elitist (6) - 4 freq
SoundEx code - V452
violence - 37 freq
violence- - 1 freq
volumes - 16 freq
villainous - 1 freq
violinist - 1 freq
valence's - 1 freq
veelence - 1 freq
vaelensi - 2 freq
valuing - 3 freq
valencia - 8 freq
valencian - 6 freq
valencia's - 2 freq
valencians - 1 freq
veillanous - 1 freq
valència - 1 freq
valencià - 1 freq
vollums - 1 freq
vowel-length - 1 freq
villains - 3 freq
valmcdermid - 11 freq
valensercla - 1 freq
vlanceb - 1 freq
MetaPhone code - FLNST
violinist - 1 freq
flounced - 1 freq
VIOLINIST
Time to execute Levenshtein function - 0.703045 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.490309 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.100847 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.183532 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000957 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.