A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to avacado in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
avacado (0) - 1 freq
avocado (1) - 2 freq
avocados (2) - 1 freq
vacand (3) - 1 freq
arcade (3) - 8 freq
vacate (3) - 1 freq
cacao (3) - 14 freq
avatar (3) - 2 freq
macao (3) - 1 freq
avatars (3) - 3 freq
vacant (3) - 8 freq
facade (3) - 4 freq
acad (3) - 2 freq
bravado (3) - 9 freq
avaa (3) - 30 freq
vaccum (4) - 1 freq
'vapid (4) - 1 freq
granada (4) - 3 freq
hamada (4) - 1 freq
valpo (4) - 1 freq
advocate (4) - 7 freq
vido (4) - 1 freq
vack (4) - 1 freq
quando (4) - 1 freq
avoids (4) - 3 freq
avacado (0) - 1 freq
avocado (1) - 2 freq
avocados (3) - 1 freq
acad (4) - 2 freq
vyced (4) - 3 freq
facade (4) - 4 freq
arcade (4) - 8 freq
vacand (4) - 1 freq
vacate (4) - 1 freq
anced (5) - 1 freq
accede (5) - 1 freq
vacyoom (5) - 1 freq
peacod (5) - 1 freq
faced (5) - 68 freq
vada (5) - 1 freq
acid (5) - 6 freq
cad (5) - 40 freq
yafcad (5) - 1 freq
evaded (5) - 1 freq
arcadia (5) - 7 freq
vacuum (5) - 5 freq
scad (5) - 6 freq
vicar (5) - 1 freq
avowed (5) - 1 freq
overdo (5) - 1 freq
SoundEx code - A123
apostrophes - 17 freq
apostrophe - 13 freq
affectionately - 5 freq
affection - 22 freq
affecting - 8 freq
awbesit - 2 freq
affectin - 1 freq
affected - 10 freq
affixed - 1 freq
aff-cut - 1 freq
abstraction - 3 freq
afaistane - 5 freq
affset - 2 freq
abstainit - 3 freq
affect - 10 freq
abstrack - 4 freq
affects - 9 freq
abjoot - 1 freq
abstain - 6 freq
affest - 1 freq
affections - 2 freq
apostles - 6 freq
apostrophe' - 2 freq
abasht - 1 freq
affstage - 1 freq
abused - 6 freq
affectionate - 3 freq
affside - 4 freq
affectation - 4 freq
aff-stage - 3 freq
aff-step - 1 freq
abstract - 12 freq
apostophe - 1 freq
abstractions - 1 freq
abstains - 2 freq
aufsteigt - 1 freq
apostolic - 1 freq
affectioun - 1 freq
affectit - 11 freq
avysit - 1 freq
avocado - 2 freq
apostolis - 1 freq
abuised - 1 freq
avast - 3 freq
apposeet - 1 freq
affshoots - 1 freq
aff-piste - 1 freq
affshuit - 1 freq
appeased - 1 freq
afektan - 1 freq
awfastraight - 1 freq
avacado - 1 freq
abcedminded - 1 freq
afcct - 3 freq
afecwuat - 1 freq
avochat - 1 freq
awbesits - 1 freq
afoust - 1 freq
avocados - 1 freq
aavzidhr - 1 freq
abstained - 1 freq
afctranent - 6 freq
abstainin - 1 freq
afcdunbar - 1 freq
abstaining - 1 freq
appggtr - 1 freq
abbeycottage - 1 freq
MetaPhone code - AFKT
aff-cut - 1 freq
affect - 10 freq
avocado - 2 freq
avacado - 1 freq
AVACADO
Time to execute Levenshtein function - 0.223074 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.436042 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.039238 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.050864 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001037 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.