A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to address in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
address (0) - 143 freq
addresst (1) - 4 freq
addresg (1) - 1 freq
'address (1) - 1 freq
addressin (2) - 19 freq
addresses (2) - 18 freq
audreys (2) - 1 freq
redress (2) - 3 freq
andrews (2) - 45 freq
addressit (2) - 4 freq
adders (2) - 1 freq
addressed (2) - 38 freq
ardross (2) - 1 freq
duress (2) - 4 freq
adores (2) - 2 freq
addressan (2) - 1 freq
actress (2) - 24 freq
undress (2) - 3 freq
addressee (2) - 11 freq
dress (2) - 98 freq
taddies (3) - 1 freq
repress (3) - 6 freq
impress (3) - 23 freq
dores (3) - 1 freq
redness (3) - 1 freq
address (0) - 143 freq
'address (2) - 1 freq
addressee (2) - 11 freq
addresg (2) - 1 freq
addresst (2) - 4 freq
addresses (3) - 18 freq
duress (3) - 4 freq
addressan (3) - 1 freq
addressin (3) - 19 freq
undress (3) - 3 freq
ardross (3) - 1 freq
addressed (3) - 38 freq
redress (3) - 3 freq
dress (3) - 98 freq
addressit (3) - 4 freq
adders (3) - 1 freq
dross (4) - 18 freq
idders (4) - 40 freq
digress (4) - 3 freq
depress (4) - 1 freq
heiddress (4) - 1 freq
addressees (4) - 2 freq
addreesed (4) - 1 freq
dresse (4) - 1 freq
udders (4) - 4 freq
SoundEx code - A362
address - 143 freq
addressed - 38 freq
addressin - 19 freq
attracted - 6 freq
addresses - 18 freq
attraction - 17 freq
attractive - 11 freq
authors - 16 freq
aithers - 7 freq
aathor's - 1 freq
adders - 1 freq
attract - 7 freq
atrocities - 3 freq
authorised - 4 freq
attrection - 1 freq
'address - 1 freq
attrack - 4 freq
attractit - 6 freq
attracks - 1 freq
address'll - 1 freq
author's - 6 freq
attrac - 1 freq
addresst - 4 freq
attractin - 5 freq
adressin - 1 freq
attractions - 7 freq
addresg - 1 freq
attrakkit - 2 freq
atrack - 1 freq
attractiveness - 2 freq
adores - 2 freq
attracts - 2 freq
addressin' - 2 freq
attractin' - 1 freq
attractive' - 1 freq
addressees - 2 freq
addressan - 1 freq
adressed - 2 freq
attracktin - 1 freq
addressee - 11 freq
a-dressin - 2 freq
addreesed - 1 freq
attrackit - 1 freq
aid-wirkers - 1 freq
autoreise - 1 freq
attrak - 1 freq
addressing - 2 freq
atrocious - 3 freq
authorise - 1 freq
addressit - 4 freq
aathorised - 1 freq
attersome - 1 freq
attercap - 1 freq
authorship - 4 freq
€˜aduersitie - 1 freq
aduersitie - 1 freq
adreich - 1 freq
authorjla - 7 freq
authoricrats - 1 freq
audreyjarvis - 1 freq
addressinglife - 2 freq
'author's - 1 freq
athrockmorton - 3 freq
'authors - 2 freq
audreys - 1 freq
autoricht - 1 freq
MetaPhone code - ATRS
address - 143 freq
adders - 1 freq
'address - 1 freq
adores - 2 freq
addressee - 11 freq
autoreise - 1 freq
audreys - 1 freq
ADDRESS
Time to execute Levenshtein function - 0.209804 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.356518 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027718 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037047 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000908 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.