A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to attract in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
attract (0) - 7 freq
attrac (1) - 1 freq
attracts (1) - 2 freq
attrack (1) - 4 freq
attach (2) - 7 freq
extract (2) - 14 freq
cataract (2) - 5 freq
attrackit (2) - 1 freq
attrak (2) - 1 freq
abstract (2) - 16 freq
attracks (2) - 1 freq
attack (2) - 79 freq
astrict (2) - 3 freq
attracted (2) - 6 freq
tract (2) - 1 freq
attractin (2) - 5 freq
attacht (2) - 1 freq
atrack (2) - 1 freq
attractit (2) - 6 freq
detract (2) - 2 freq
matthat (3) - 4 freq
attracktin (3) - 1 freq
attent (3) - 1 freq
tact (3) - 8 freq
pattrent (3) - 1 freq
attract (0) - 7 freq
attracts (2) - 2 freq
attrack (2) - 4 freq
attrac (2) - 1 freq
attractin (3) - 5 freq
tract (3) - 1 freq
astrict (3) - 3 freq
attractit (3) - 6 freq
cataract (3) - 5 freq
detract (3) - 2 freq
extract (3) - 14 freq
attracted (3) - 6 freq
attrackit (3) - 1 freq
interact (4) - 9 freq
strict (4) - 16 freq
ettrick (4) - 10 freq
attraction (4) - 17 freq
attractive (4) - 11 freq
atrack (4) - 1 freq
attrak (4) - 1 freq
abstract (4) - 16 freq
attracks (4) - 1 freq
attack (4) - 79 freq
attacht (4) - 1 freq
attach (4) - 7 freq
SoundEx code - A362
address - 143 freq
addressed - 38 freq
addressin - 19 freq
attracted - 6 freq
addresses - 18 freq
attraction - 17 freq
attractive - 11 freq
authors - 16 freq
aithers - 7 freq
aathor's - 1 freq
adders - 1 freq
attract - 7 freq
atrocities - 3 freq
authorised - 4 freq
attrection - 1 freq
'address - 1 freq
attrack - 4 freq
attractit - 6 freq
attracks - 1 freq
address'll - 1 freq
author's - 6 freq
attrac - 1 freq
addresst - 4 freq
attractin - 5 freq
adressin - 1 freq
attractions - 7 freq
addresg - 1 freq
attrakkit - 2 freq
atrack - 1 freq
attractiveness - 2 freq
adores - 2 freq
attracts - 2 freq
addressin' - 2 freq
attractin' - 1 freq
attractive' - 1 freq
addressees - 2 freq
addressan - 1 freq
adressed - 2 freq
attracktin - 1 freq
addressee - 11 freq
a-dressin - 2 freq
addreesed - 1 freq
attrackit - 1 freq
aid-wirkers - 1 freq
autoreise - 1 freq
attrak - 1 freq
addressing - 2 freq
atrocious - 3 freq
authorise - 1 freq
addressit - 4 freq
aathorised - 1 freq
attersome - 1 freq
attercap - 1 freq
authorship - 4 freq
€˜aduersitie - 1 freq
aduersitie - 1 freq
adreich - 1 freq
authorjla - 7 freq
authoricrats - 1 freq
audreyjarvis - 1 freq
addressinglife - 2 freq
'author's - 1 freq
athrockmorton - 3 freq
'authors - 2 freq
audreys - 1 freq
autoricht - 1 freq
MetaPhone code - ATRKT
attract - 7 freq
attrakkit - 2 freq
attrackit - 1 freq
ATTRACT
Time to execute Levenshtein function - 0.180002 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337337 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027484 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037346 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000961 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.