A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to autoreise in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
autoreise (0) - 1 freq
authorise (2) - 1 freq
surpreise (3) - 6 freq
authorised (3) - 4 freq
atomise (3) - 1 freq
auctorite (3) - 3 freq
autoritie (3) - 3 freq
authorite (3) - 1 freq
futures (4) - 7 freq
altrive (4) - 1 freq
cautrine (4) - 3 freq
atheist (4) - 1 freq
authoritie (4) - 5 freq
futrets (4) - 2 freq
fautors (4) - 1 freq
antoinine (4) - 1 freq
authority (4) - 55 freq
arreist (4) - 3 freq
storeys' (4) - 1 freq
actors' (4) - 1 freq
storie (4) - 28 freq
wunorse (4) - 2 freq
actress (4) - 23 freq
surpreises (4) - 1 freq
surpreised (4) - 5 freq
autoreise (0) - 1 freq
authorise (3) - 1 freq
autoritie (4) - 3 freq
atomise (4) - 1 freq
treis (4) - 9 freq
trais (5) - 1 freq
tutors (5) - 5 freq
tires (5) - 2 freq
uterine (5) - 1 freq
arise (5) - 10 freq
acoorse (5) - 7 freq
autopsy (5) - 2 freq
tortoise (5) - 26 freq
adores (5) - 2 freq
trees (5) - 430 freq
atore (5) - 2 freq
tirse (5) - 3 freq
turse (5) - 1 freq
toryism (5) - 1 freq
araise (5) - 1 freq
azores (5) - 1 freq
antares (5) - 1 freq
tooers (5) - 6 freq
storeys (5) - 3 freq
tories (5) - 115 freq
SoundEx code - A362
address - 141 freq
addressed - 38 freq
addressin - 19 freq
attracted - 5 freq
addresses - 16 freq
attraction - 17 freq
attractive - 11 freq
authors - 16 freq
aithers - 7 freq
aathor's - 1 freq
adders - 1 freq
atrocities - 3 freq
authorised - 4 freq
attrection - 1 freq
'address - 1 freq
attrack - 4 freq
attractit - 6 freq
attracks - 1 freq
address'll - 1 freq
author's - 6 freq
attrac - 1 freq
addresst - 4 freq
attractin - 5 freq
adressin - 1 freq
attractions - 7 freq
addresg - 1 freq
attrakkit - 2 freq
atrack - 1 freq
attractiveness - 2 freq
adores - 2 freq
attracts - 2 freq
attract - 6 freq
addressin' - 2 freq
attractin' - 1 freq
attractive' - 1 freq
addressees - 2 freq
addressan - 1 freq
adressed - 2 freq
attracktin - 1 freq
addressee - 11 freq
a-dressin - 2 freq
addreesed - 1 freq
attrackit - 1 freq
aid-wirkers - 1 freq
autoreise - 1 freq
attrak - 1 freq
addressing - 2 freq
atrocious - 3 freq
authorise - 1 freq
addressit - 4 freq
aathorised - 1 freq
attersome - 1 freq
attercap - 1 freq
authorship - 4 freq
€˜aduersitie - 1 freq
aduersitie - 1 freq
adreich - 1 freq
authorjla - 7 freq
authoricrats - 1 freq
audreyjarvis - 1 freq
addressinglife - 2 freq
'author's - 1 freq
athrockmorton - 3 freq
'authors - 2 freq
audreys - 1 freq
autoricht - 1 freq
MetaPhone code - ATRS
address - 141 freq
adders - 1 freq
'address - 1 freq
adores - 2 freq
addressee - 11 freq
autoreise - 1 freq
audreys - 1 freq
AUTOREISE
Time to execute Levenshtein function - 0.217649 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.404167 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032979 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038085 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000928 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.