A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to aldersley-williams in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
aldersley-williams (0) - 1 freq
ethwilliams (9) - 1 freq
peely-wallie (10) - 1 freq
alleluias (10) - 1 freq
double-bills (10) - 1 freq
ill-willie (10) - 4 freq
huwdwilliams (10) - 1 freq
duncanwilliam (10) - 1 freq
weirdy-willie (10) - 1 freq
alderslowe (10) - 1 freq
alleyways (10) - 1 freq
allwullie's (10) - 2 freq
williams (10) - 32 freq
concrete-pillart (11) - 1 freq
sleep-walking (11) - 1 freq
dirl-dirlin (11) - 1 freq
lesleyfletche (11) - 1 freq
andrewwilson (11) - 2 freq
neer-de-weels (11) - 2 freq
neer-dee-weels (11) - 1 freq
williams' (11) - 3 freq
tirli-wirli (11) - 3 freq
alsweill (11) - 1 freq
inverclydelibs (11) - 1 freq
caterpillars (11) - 2 freq
aldersley-williams (0) - 1 freq
allwullie's (15) - 2 freq
alderslowe (15) - 1 freq
double-bills (15) - 1 freq
ill-willie (15) - 4 freq
ethwilliams (15) - 1 freq
eild-ills (16) - 1 freq
ill-will (16) - 6 freq
fare-ye-weills (16) - 1 freq
treacle-well (16) - 2 freq
treacle-well-eh (16) - 2 freq
williams (16) - 32 freq
huwdwilliams (16) - 1 freq
duncanwilliam (16) - 1 freq
weirdy-willie (16) - 1 freq
ill-willy (16) - 3 freq
peely-wallie (16) - 1 freq
peely-wally (17) - 19 freq
will-willan (17) - 1 freq
middle-class (17) - 11 freq
marseillaise (17) - 1 freq
marseilles (17) - 1 freq
water-lilies (17) - 1 freq
traicle-wall (17) - 1 freq
hollywills (17) - 2 freq
SoundEx code - A436
aulder - 245 freq
alternative - 46 freq
alternatively - 3 freq
aulder'n - 2 freq
altar - 23 freq
altered - 17 freq
alliteration - 5 freq
alternatives - 8 freq
aalder - 24 freq
auld-warld - 2 freq
'aulder - 1 freq
'alternative - 1 freq
alternate - 3 freq
alternately - 1 freq
althar - 5 freq
altars - 2 freq
aaltar - 2 freq
aalter - 1 freq
alder - 1 freq
alter - 15 freq
alterin - 2 freq
alt-arkaeolojist - 1 freq
althered - 4 freq
alther - 1 freq
aaltir - 1 freq
auldearn - 1 freq
alterations - 1 freq
aldersley-williams - 1 freq
alleiterative - 1 freq
altrive - 1 freq
auldrife - 1 freq
alteration - 1 freq
alliterative - 2 freq
altruism - 6 freq
aulder-anes - 2 freq
altruistic - 1 freq
aalternative - 1 freq
aleeteration - 1 freq
alleeteration - 1 freq
'altar' - 1 freq
alluterlie - 1 freq
alteran - 1 freq
auld-warldy - 1 freq
aaldwarld - 1 freq
alternatin - 1 freq
€œalternatin - 1 freq
alternativet - 1 freq
alt-richters - 1 freq
auldearnbadger - 1 freq
alderslowe - 1 freq
alternatecelt - 1 freq
MetaPhone code - ALTRSLWL
ALDERSLEY-WILLIAMS
Time to execute Levenshtein function - 0.244561 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.423043 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027983 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037758 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000937 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.