A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to validate in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
validate (0) - 4 freq
validates (1) - 1 freq
validat (1) - 1 freq
validatit (2) - 1 freq
salivate (2) - 1 freq
validity (2) - 6 freq
validatin (2) - 1 freq
haliday (3) - 1 freq
palate (3) - 4 freq
delicate (3) - 21 freq
validation (3) - 2 freq
halidays (3) - 1 freq
latinate (3) - 21 freq
laminate (3) - 2 freq
caliphate (3) - 1 freq
valid (3) - 26 freq
valiant (3) - 5 freq
valyae (3) - 1 freq
animate (3) - 1 freq
walligate (3) - 1 freq
evaluate (3) - 1 freq
climate (3) - 34 freq
vacate (3) - 1 freq
mandate (3) - 7 freq
invalidated (3) - 1 freq
validate (0) - 4 freq
validat (1) - 1 freq
validity (2) - 6 freq
validates (2) - 1 freq
validatin (3) - 1 freq
validatit (3) - 1 freq
valiant (4) - 5 freq
evaluate (4) - 1 freq
valid (4) - 26 freq
validation (4) - 2 freq
salivate (4) - 1 freq
haldane (5) - 2 freq
invalidated (5) - 1 freq
mandate (5) - 7 freq
invalidity (5) - 2 freq
ivaldie (5) - 2 freq
valet (5) - 1 freq
vald (5) - 1 freq
glidit (5) - 1 freq
laidit (5) - 9 freq
vacate (5) - 1 freq
waldit (5) - 1 freq
haliday (5) - 1 freq
palate (5) - 4 freq
delicate (5) - 21 freq
SoundEx code - V433
vaultit - 2 freq
validation - 2 freq
violated - 2 freq
validate - 4 freq
validity - 6 freq
vaulted - 1 freq
validatin - 1 freq
validatit - 1 freq
validat - 1 freq
validates - 1 freq
MetaPhone code - FLTT
folded - 14 freq
flittit - 44 freq
foldit - 12 freq
vaultit - 2 freq
flitted - 30 freq
fauldit - 16 freq
fluded - 1 freq
floated - 14 freq
filleted - 1 freq
'flittit - 1 freq
fluided - 3 freq
fleetit - 2 freq
violated - 2 freq
validate - 4 freq
flooded - 8 freq
fouldet - 2 freq
foldet - 4 freq
floodit - 2 freq
floatit - 7 freq
faaldit - 4 freq
validity - 6 freq
vaulted - 1 freq
faulded - 4 freq
flaitit - 3 freq
flat-oot - 1 freq
fluted - 1 freq
falded - 1 freq
fowlded - 3 freq
flotit - 1 freq
fauldid - 1 freq
fleeted - 3 freq
flyted - 2 freq
fluidit - 3 freq
flaited - 1 freq
fleudid - 1 freq
validat - 1 freq
fludit - 1 freq
fleudit - 2 freq
VALIDATE
Time to execute Levenshtein function - 0.297172 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.626470 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031022 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.071778 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000950 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.