A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to authors in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
authors (0) - 16 freq
author (1) - 37 freq
'authors (1) - 2 freq
authort (1) - 2 freq
author's (1) - 6 freq
actors (2) - 31 freq
authourt (2) - 2 freq
thors (2) - 1 freq
authorise (2) - 1 freq
athort (2) - 122 freq
'author's (2) - 1 freq
fautors (2) - 1 freq
anchors (2) - 1 freq
aithers (2) - 7 freq
aathor's (2) - 1 freq
uthers (2) - 1 freq
owthors (2) - 15 freq
tutors (2) - 5 freq
muthers (2) - 1 freq
arthurs (2) - 1 freq
autos (2) - 1 freq
athout (3) - 27 freq
thord (3) - 3 freq
atoms (3) - 8 freq
aither (3) - 140 freq
authors (0) - 16 freq
thors (2) - 1 freq
uthers (2) - 1 freq
aithers (2) - 7 freq
authorise (2) - 1 freq
'authors (2) - 2 freq
author's (2) - 6 freq
author (2) - 37 freq
authort (2) - 2 freq
arthurs (3) - 1 freq
eithers (3) - 1 freq
utheris (3) - 1 freq
thirs (3) - 41 freq
ithirs (3) - 10 freq
muthers (3) - 1 freq
thars (3) - 2 freq
others (3) - 52 freq
aathor's (3) - 1 freq
thurs (3) - 16 freq
athort (3) - 122 freq
authourt (3) - 2 freq
tutors (3) - 5 freq
thers (3) - 1 freq
ithers (3) - 574 freq
owthors (3) - 15 freq
SoundEx code - A362
address - 141 freq
addressed - 38 freq
addressin - 19 freq
attracted - 5 freq
addresses - 16 freq
attraction - 17 freq
attractive - 11 freq
authors - 16 freq
aithers - 7 freq
aathor's - 1 freq
adders - 1 freq
atrocities - 3 freq
authorised - 4 freq
attrection - 1 freq
'address - 1 freq
attrack - 4 freq
attractit - 6 freq
attracks - 1 freq
address'll - 1 freq
author's - 6 freq
attrac - 1 freq
addresst - 4 freq
attractin - 5 freq
adressin - 1 freq
attractions - 7 freq
addresg - 1 freq
attrakkit - 2 freq
atrack - 1 freq
attractiveness - 2 freq
adores - 2 freq
attracts - 2 freq
attract - 6 freq
addressin' - 2 freq
attractin' - 1 freq
attractive' - 1 freq
addressees - 2 freq
addressan - 1 freq
adressed - 2 freq
attracktin - 1 freq
addressee - 11 freq
a-dressin - 2 freq
addreesed - 1 freq
attrackit - 1 freq
aid-wirkers - 1 freq
autoreise - 1 freq
attrak - 1 freq
addressing - 2 freq
atrocious - 3 freq
authorise - 1 freq
addressit - 4 freq
aathorised - 1 freq
attersome - 1 freq
attercap - 1 freq
authorship - 4 freq
€˜aduersitie - 1 freq
aduersitie - 1 freq
adreich - 1 freq
authorjla - 7 freq
authoricrats - 1 freq
audreyjarvis - 1 freq
addressinglife - 2 freq
'author's - 1 freq
athrockmorton - 3 freq
'authors - 2 freq
audreys - 1 freq
autoricht - 1 freq
MetaPhone code - A0RS
authors - 16 freq
aithers - 7 freq
aathor's - 1 freq
author's - 6 freq
authorise - 1 freq
'author's - 1 freq
'authors - 2 freq
AUTHORS
Time to execute Levenshtein function - 0.285164 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.473453 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027592 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036706 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000774 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.