A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bbceducation in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bbceducation (0) - 1 freq
eaceducation (2) - 17 freq
'education (3) - 2 freq
neducation (3) - 4 freq
re-education (3) - 3 freq
education (3) - 415 freq
educatioun (4) - 7 freq
abdication (4) - 1 freq
dedication (4) - 8 freq
seduction (4) - 2 freq
educatin (4) - 4 freq
edication (4) - 3 freq
benediction (4) - 4 freq
medication (4) - 8 freq
predication (4) - 4 freq
abduction (4) - 1 freq
reduction (4) - 3 freq
uofgeducation (4) - 2 freq
deductions (5) - 1 freq
abductin (5) - 1 freq
graduation (5) - 2 freq
steemulation (5) - 2 freq
meditation (5) - 12 freq
dedicatin (5) - 2 freq
indication (5) - 4 freq
bbceducation (0) - 1 freq
eaceducation (4) - 17 freq
abdication (6) - 1 freq
education (6) - 415 freq
benediction (6) - 4 freq
abduction (6) - 1 freq
neducation (6) - 4 freq
'education (6) - 2 freq
re-education (6) - 3 freq
uofgeducation (7) - 2 freq
abductin (7) - 1 freq
reduction (7) - 3 freq
educatioun (7) - 7 freq
predication (7) - 4 freq
seduction (7) - 2 freq
dedication (7) - 8 freq
edication (7) - 3 freq
educatin (7) - 4 freq
medication (7) - 8 freq
abbreviation (8) - 1 freq
bbccin (8) - 1 freq
benedictine (8) - 1 freq
vindication (8) - 4 freq
dedicatioun (8) - 1 freq
objection (8) - 3 freq
SoundEx code - B232
beasts - 144 freq
baskets - 8 freq
bissett's - 1 freq
buckets - 25 freq
besides - 35 freq
biscuits - 41 freq
'backstage - 1 freq
buchts - 6 freq
busts - 2 freq
beastie's - 3 freq
beasties - 51 freq
baist's - 4 freq
bisides - 1 freq
'beasts - 5 freq
bests - 2 freq
baests - 13 freq
begets - 4 freq
beastis - 1 freq
baisties - 3 freq
beest's - 1 freq
basket's - 1 freq
bestows - 1 freq
beast's - 7 freq
baists - 22 freq
best-sellin - 1 freq
bestkennt - 1 freq
best-kennt - 1 freq
backstage - 5 freq
bestest - 5 freq
boasts - 3 freq
beckwith's - 2 freq
bastes - 4 freq
bust's - 1 freq
bucket's - 1 freq
best-kent - 6 freq
bukkits - 1 freq
biests - 1 freq
beists - 1 freq
baesties - 1 freq
best-seller - 2 freq
bestseller - 1 freq
bouchts - 1 freq
boosts - 1 freq
'biscuits' - 1 freq
besyds - 2 freq
buists - 1 freq
bukkets - 1 freq
best-keepit - 1 freq
bouquets - 1 freq
backsides - 2 freq
€œbesides - 1 freq
€˜besides - 1 freq
basket-swords - 1 freq
bigots - 4 freq
bbcdouglasf - 1 freq
baists' - 1 freq
bbcthesocial - 10 freq
bbcscotcomms - 1 freq
besties - 1 freq
bizquits - 1 freq
bustage - 1 freq
bgstxyfwtn - 1 freq
bigotsureejits - 1 freq
biscuits' - 1 freq
bfkthsjgf - 1 freq
bycatch - 1 freq
bpqtg - 1 freq
bbcsouthscot - 8 freq
bzggedyk - 1 freq
bkotg - 1 freq
bbceducation - 1 freq
bestcanton - 1 freq
bbckitchencafe - 2 freq
bctgb - 1 freq
biscuitsgod - 1 freq
MetaPhone code - BSTKXN
bbceducation - 1 freq
BBCEDUCATION
Time to execute Levenshtein function - 0.444072 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.694633 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.055474 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037458 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000785 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.