A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to boost in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
boost (0) - 21 freq
noost (1) - 4 freq
loost (1) - 8 freq
boo't (1) - 2 freq
joost (1) - 230 freq
toost (1) - 1 freq
boot (1) - 112 freq
roost (1) - 41 freq
boos (1) - 3 freq
yoost (1) - 1 freq
boose (1) - 2 freq
boolt (1) - 1 freq
coost (1) - 1 freq
boosts (1) - 1 freq
foost (1) - 5 freq
boast (1) - 7 freq
bookt (1) - 3 freq
bos (2) - 3 freq
bros (2) - 1 freq
mosst (2) - 53 freq
hosst (2) - 1 freq
moosh (2) - 2 freq
booze (2) - 33 freq
'lost (2) - 1 freq
booin (2) - 17 freq
boost (0) - 21 freq
boast (1) - 7 freq
buist (2) - 5 freq
bookt (2) - 3 freq
beest (2) - 11 freq
bust (2) - 10 freq
best (2) - 1613 freq
beast (2) - 141 freq
baest (2) - 27 freq
biest (2) - 1 freq
bst (2) - 2 freq
beist (2) - 6 freq
baist (2) - 80 freq
foost (2) - 5 freq
joost (2) - 230 freq
toost (2) - 1 freq
boo't (2) - 2 freq
loost (2) - 8 freq
noost (2) - 4 freq
boosts (2) - 1 freq
boot (2) - 112 freq
boolt (2) - 1 freq
coost (2) - 1 freq
boose (2) - 2 freq
yoost (2) - 1 freq
SoundEx code - B230
bizzed - 3 freq
basket - 62 freq
beast - 141 freq
beekit - 2 freq
biscuit - 36 freq
best - 1613 freq
baked - 28 freq
buskit - 33 freq
bakst - 1 freq
bocht - 219 freq
bashed - 8 freq
begood - 11 freq
biggit - 245 freq
begged - 30 freq
beast- - 1 freq
bossed - 6 freq
'best - 4 freq
buzzed - 1 freq
booked - 31 freq
beside - 63 freq
biscuity - 1 freq
bucket - 75 freq
based - 84 freq
begoud - 104 freq
beastie - 38 freq
bust - 10 freq
bought - 83 freq
backed - 27 freq
boakit - 4 freq
big-ee'd - 1 freq
boked - 2 freq
b'goad - 1 freq
biased - 4 freq
baist - 80 freq
beukit - 3 freq
baest - 27 freq
backside - 26 freq
begat - 2 freq
busked - 5 freq
boucht - 5 freq
backseat - 6 freq
buist - 5 freq
baukit - 12 freq
begot - 1 freq
bucht - 4 freq
bochte - 1 freq
beset - 4 freq
busied - 1 freq
beast' - 2 freq
bakt - 1 freq
bouquet - 3 freq
bakside - 2 freq
baisket - 1 freq
beachit - 6 freq
baaket - 1 freq
beestie - 5 freq
beest - 11 freq
boxed - 7 freq
boacht - 1 freq
bestow - 1 freq
bussed - 3 freq
beached - 9 freq
boost - 21 freq
buget - 1 freq
boughtie - 1 freq
bigged - 30 freq
bushido - 1 freq
best' - 2 freq
bassett - 1 freq
bagged - 4 freq
backchat - 4 freq
'baked - 1 freq
boast - 7 freq
bookit - 3 freq
busty - 1 freq
bexit - 1 freq
baised - 3 freq
backid - 1 freq
baeside - 1 freq
boxt - 1 freq
baakt - 1 freq
backit - 8 freq
baistie - 3 freq
buckhead - 2 freq
besta - 2 freq
beaked - 1 freq
behest - 2 freq
bakit - 8 freq
bst - 2 freq
beist - 6 freq
backeth - 1 freq
boakt - 1 freq
backt - 1 freq
bouquet' - 1 freq
big-shot - 2 freq
backhaud - 1 freq
back-seat - 1 freq
bockid - 1 freq
boaked - 3 freq
beskit - 3 freq
becked - 1 freq
'biggit - 1 freq
boukit - 5 freq
bakkit - 2 freq
baste - 8 freq
biggid - 4 freq
buskid - 1 freq
bosied - 7 freq
begude - 2 freq
busta - 6 freq
bukksed - 1 freq
boght - 2 freq
beget - 2 freq
baguette - 2 freq
bow-hoched - 1 freq
bigot - 4 freq
bicht - 2 freq
begoot - 2 freq
bogged - 3 freq
begouth - 2 freq
biest - 1 freq
baessed - 5 freq
biwast - 1 freq
bisooth - 1 freq
baggit - 4 freq
baestie - 1 freq
bisset - 4 freq
biegt - 1 freq
bash't - 1 freq
backet - 8 freq
bycht - 1 freq
bigget - 1 freq
bigg't - 1 freq
beckett - 3 freq
€˜best - 1 freq
besty - 2 freq
biked - 1 freq
bekked - 1 freq
bauxyte - 1 freq
bekeit - 1 freq
beistie - 1 freq
bouchtie - 1 freq
bekkit - 1 freq
back-sate - 1 freq
€œbest - 4 freq
bookt - 3 freq
biskit - 1 freq
bisto - 1 freq
baikit - 1 freq
baesed - 1 freq
€˜beukit - 1 freq
€˜booked - 1 freq
bestowe - 1 freq
buckt - 1 freq
bikit - 1 freq
boggit - 1 freq
beestee - 1 freq
back-chat - 1 freq
bwisd - 1 freq
bgt - 1 freq
bbcqt - 1 freq
boycott - 2 freq
boxset - 2 freq
bbsit - 1 freq
bbctwo - 1 freq
biscotti - 1 freq
bissett - 3 freq
bsqt - 1 freq
bag'd - 1 freq
MetaPhone code - BST
bizzed - 3 freq
beast - 141 freq
best - 1613 freq
beast- - 1 freq
bossed - 6 freq
'best - 4 freq
buzzed - 1 freq
beside - 63 freq
based - 84 freq
beastie - 38 freq
bust - 10 freq
biased - 4 freq
baist - 80 freq
baest - 27 freq
buist - 5 freq
beset - 4 freq
busied - 1 freq
beast' - 2 freq
beestie - 5 freq
beest - 11 freq
bestow - 1 freq
bussed - 3 freq
boost - 21 freq
best' - 2 freq
bassett - 1 freq
boast - 7 freq
busty - 1 freq
baised - 3 freq
baeside - 1 freq
baistie - 3 freq
besta - 2 freq
bst - 2 freq
beist - 6 freq
baste - 8 freq
bosied - 7 freq
busta - 6 freq
biest - 1 freq
baessed - 5 freq
baestie - 1 freq
bisset - 4 freq
€˜best - 1 freq
besty - 2 freq
beistie - 1 freq
€œbest - 4 freq
bisto - 1 freq
baesed - 1 freq
beestee - 1 freq
bbsit - 1 freq
wwbst - 1 freq
bissett - 3 freq
BOOST
Time to execute Levenshtein function - 0.214242 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.771175 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.067674 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043824 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001131 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.