A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to auld-warld in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
auld-warld (0) - 2 freq
auld-warldy (1) - 1 freq
aaldwarld (2) - 1 freq
auld-ails (3) - 1 freq
aff-warld (3) - 13 freq
aalwarld (3) - 3 freq
auld-maid (3) - 1 freq
auld-deid (4) - 1 freq
aukward (4) - 1 freq
auldearn (4) - 1 freq
auld-farrand (4) - 2 freq
cauld-kail (4) - 1 freq
aul-man (5) - 4 freq
award (5) - 59 freq
durward (5) - 2 freq
airn-hard (5) - 1 freq
auldhill (5) - 26 freq
wullyard (5) - 1 freq
aakward (5) - 7 freq
guldered (5) - 10 freq
hold-all (5) - 3 freq
awald (5) - 3 freq
cauld-watter (5) - 1 freq
tuim-wamed (5) - 1 freq
widd-wark (5) - 1 freq
auld-warld (0) - 2 freq
auld-warldy (1) - 1 freq
aaldwarld (3) - 1 freq
aalwarld (5) - 3 freq
aff-warld (5) - 13 freq
auld-maid (6) - 1 freq
auld-ails (6) - 1 freq
auld-farrand (7) - 2 freq
ill-waured (7) - 1 freq
auld-deid (7) - 1 freq
itherwarld (8) - 3 freq
ill-wulled (8) - 1 freq
ull-wull (8) - 2 freq
unnerwarld (8) - 4 freq
weel-wared (8) - 2 freq
edward (8) - 33 freq
quakeworld (8) - 2 freq
ae-word (8) - 1 freq
out-harled (8) - 1 freq
landward (8) - 6 freq
launward (8) - 1 freq
post-world (8) - 2 freq
auld-angles (8) - 3 freq
yammer-warld (8) - 1 freq
deid-cauld (8) - 1 freq
SoundEx code - A436
aulder - 245 freq
alternative - 46 freq
alternatively - 3 freq
aulder'n - 2 freq
altar - 23 freq
altered - 17 freq
alliteration - 5 freq
alternatives - 8 freq
aalder - 24 freq
auld-warld - 2 freq
'aulder - 1 freq
'alternative - 1 freq
alternate - 3 freq
alternately - 1 freq
althar - 5 freq
altars - 2 freq
aaltar - 2 freq
aalter - 1 freq
alder - 1 freq
alter - 15 freq
alterin - 2 freq
alt-arkaeolojist - 1 freq
althered - 4 freq
alther - 1 freq
aaltir - 1 freq
auldearn - 1 freq
alterations - 1 freq
aldersley-williams - 1 freq
alleiterative - 1 freq
altrive - 1 freq
auldrife - 1 freq
alteration - 1 freq
alliterative - 2 freq
altruism - 6 freq
aulder-anes - 2 freq
altruistic - 1 freq
aalternative - 1 freq
aleeteration - 1 freq
alleeteration - 1 freq
'altar' - 1 freq
alluterlie - 1 freq
alteran - 1 freq
auld-warldy - 1 freq
aaldwarld - 1 freq
alternatin - 1 freq
€œalternatin - 1 freq
alternativet - 1 freq
alt-richters - 1 freq
auldearnbadger - 1 freq
alderslowe - 1 freq
alternatecelt - 1 freq
MetaPhone code - ALTWRLT
auld-warld - 2 freq
auld-warldy - 1 freq
aaldwarld - 1 freq
AULD-WARLD
Time to execute Levenshtein function - 0.218246 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.354988 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028180 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037475 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000969 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.