A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to authenticity in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
authenticity (0) - 6 freq
aathenticity (1) - 2 freq
athenticitee (3) - 1 freq
authentic (3) - 13 freq
ethnicity (4) - 3 freq
athentisitee (4) - 1 freq
'authentic (4) - 1 freq
'ethnicity (4) - 1 freq
owthenticitie (4) - 2 freq
authority (5) - 55 freq
auchentocher (5) - 1 freq
domesticity (5) - 1 freq
thenicht (5) - 4 freq
ethneecity (5) - 1 freq
athentik (5) - 4 freq
entity (6) - 17 freq
acteevity (6) - 7 freq
mendacity (6) - 1 freq
austerity (6) - 17 freq
publicity (6) - 9 freq
apathetic (6) - 1 freq
attentive (6) - 1 freq
atheist (6) - 1 freq
utility (6) - 7 freq
athenry (6) - 2 freq
authenticity (0) - 6 freq
aathenticity (1) - 2 freq
athenticitee (3) - 1 freq
authentic (4) - 13 freq
athentisitee (5) - 1 freq
owthenticitie (5) - 2 freq
ethnicity (5) - 3 freq
ethneecity (6) - 1 freq
'authentic (6) - 1 freq
'ethnicity (6) - 1 freq
thenicht (7) - 4 freq
athentik (7) - 4 freq
ethnicitie (7) - 1 freq
twentiet (8) - 3 freq
thenight (8) - 5 freq
thenkit (8) - 2 freq
atheistic (8) - 1 freq
inaathentic (8) - 1 freq
then-bit (8) - 1 freq
athletic (8) - 7 freq
athletics (8) - 1 freq
atlantic (8) - 20 freq
thertiet (8) - 1 freq
auchentocher (8) - 1 freq
tentit (8) - 8 freq
SoundEx code - A353
attendit - 28 freq
attention - 179 freq
admit - 94 freq
attentions - 3 freq
a-dundert - 1 freq
attendance - 13 freq
attended - 16 freq
atween-times - 1 freq
admitted - 16 freq
admittit - 10 freq
automatic - 11 freq
automaton - 1 freq
attendin - 11 freq
attend - 36 freq
attimt - 1 freq
attintien - 6 freq
attained - 1 freq
admittedly - 6 freq
automated - 5 freq
admits - 2 freq
'admit - 1 freq
automatically - 47 freq
attintion - 3 freq
attintions - 3 freq
admittin - 6 freq
'authentic - 1 freq
authentic - 13 freq
attendant - 10 freq
attentioun - 1 freq
admït - 1 freq
automation - 1 freq
aduneit - 1 freq
aatomaetion - 1 freq
aathenticity - 2 freq
a-dunder - 1 freq
attends - 1 freq
attendence - 1 freq
authenticity - 6 freq
attendants - 8 freq
attendant's - 1 freq
attendan - 3 freq
attentik - 1 freq
attuned - 2 freq
adenoidal - 1 freq
admittance - 1 freq
automatik - 1 freq
autentyfe - 1 freq
admeittedlie - 1 freq
attention-tax - 1 freq
attendees - 1 freq
attent - 1 freq
addendums - 1 freq
attentiveness - 1 freq
attentive - 1 freq
attendances - 5 freq
aidentitee - 1 freq
athentik - 4 freq
athentisitee - 1 freq
athenticitee - 1 freq
admitting - 1 freq
attention-seekin - 1 freq
automatit - 1 freq
attending - 4 freq
MetaPhone code - A0NTST
aathenticity - 2 freq
authenticity - 6 freq
athentisitee - 1 freq
athenticitee - 1 freq
AUTHENTICITY
Time to execute Levenshtein function - 0.213800 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.383353 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027616 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036672 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000809 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.