A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to attracks in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
attracks (0) - 1 freq
attracts (1) - 2 freq
attrack (1) - 4 freq
attacks (1) - 13 freq
attecks (2) - 1 freq
extracks (2) - 3 freq
tracks (2) - 49 freq
attract (2) - 7 freq
attrac (2) - 1 freq
attrackit (2) - 1 freq
atrack (2) - 1 freq
attrak (2) - 1 freq
attack (2) - 79 freq
cataracts (3) - 1 freq
straiks (3) - 11 freq
paitricks (3) - 7 freq
cattrick (3) - 1 freq
attractit (3) - 6 freq
astricts (3) - 1 freq
strack (3) - 9 freq
trucks (3) - 11 freq
mattrass (3) - 5 freq
attacked (3) - 13 freq
abstrack (3) - 4 freq
wracks (3) - 11 freq
attracks (0) - 1 freq
attacks (2) - 13 freq
attrack (2) - 4 freq
attracts (2) - 2 freq
attrackit (3) - 1 freq
tracks (3) - 49 freq
attecks (3) - 1 freq
extracks (3) - 3 freq
trucks (4) - 11 freq
trecks (4) - 1 freq
eattocks (4) - 1 freq
ettrick (4) - 10 freq
paitricks (4) - 7 freq
stricks (4) - 1 freq
tricks (4) - 39 freq
attrak (4) - 1 freq
attrac (4) - 1 freq
attract (4) - 7 freq
attack (4) - 79 freq
atrack (4) - 1 freq
etterick (5) - 1 freq
straks (5) - 1 freq
attackit (5) - 2 freq
tirricks (5) - 5 freq
extrack (5) - 2 freq
SoundEx code - A362
address - 143 freq
addressed - 38 freq
addressin - 19 freq
attracted - 6 freq
addresses - 18 freq
attraction - 17 freq
attractive - 11 freq
authors - 16 freq
aithers - 7 freq
aathor's - 1 freq
adders - 1 freq
attract - 7 freq
atrocities - 3 freq
authorised - 4 freq
attrection - 1 freq
'address - 1 freq
attrack - 4 freq
attractit - 6 freq
attracks - 1 freq
address'll - 1 freq
author's - 6 freq
attrac - 1 freq
addresst - 4 freq
attractin - 5 freq
adressin - 1 freq
attractions - 7 freq
addresg - 1 freq
attrakkit - 2 freq
atrack - 1 freq
attractiveness - 2 freq
adores - 2 freq
attracts - 2 freq
addressin' - 2 freq
attractin' - 1 freq
attractive' - 1 freq
addressees - 2 freq
addressan - 1 freq
adressed - 2 freq
attracktin - 1 freq
addressee - 11 freq
a-dressin - 2 freq
addreesed - 1 freq
attrackit - 1 freq
aid-wirkers - 1 freq
autoreise - 1 freq
attrak - 1 freq
addressing - 2 freq
atrocious - 3 freq
authorise - 1 freq
addressit - 4 freq
aathorised - 1 freq
attersome - 1 freq
attercap - 1 freq
authorship - 4 freq
€˜aduersitie - 1 freq
aduersitie - 1 freq
adreich - 1 freq
authorjla - 7 freq
authoricrats - 1 freq
audreyjarvis - 1 freq
addressinglife - 2 freq
'author's - 1 freq
athrockmorton - 3 freq
'authors - 2 freq
audreys - 1 freq
autoricht - 1 freq
MetaPhone code - ATRKS
attracks - 1 freq
ATTRACKS
Time to execute Levenshtein function - 0.291266 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.375585 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027864 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038893 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001120 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.