A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bailiff in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bailiff (0) - 5 freq
bailiffs (1) - 1 freq
bailie (2) - 6 freq
bailies (2) - 2 freq
bailin (2) - 3 freq
gliff (3) - 35 freq
kailwife (3) - 2 freq
baillie (3) - 17 freq
hailit (3) - 1 freq
failins (3) - 1 freq
bluff (3) - 4 freq
pailin (3) - 8 freq
sailin (3) - 37 freq
baalin (3) - 3 freq
railing (3) - 6 freq
karloff (3) - 2 freq
ailins (3) - 1 freq
beilin (3) - 3 freq
tailie (3) - 7 freq
bali (3) - 1 freq
bawlin (3) - 19 freq
bailey (3) - 6 freq
cardiff (3) - 1 freq
dailin (3) - 1 freq
railins (3) - 22 freq
bailiff (0) - 5 freq
bailiffs (2) - 1 freq
beleiff (3) - 1 freq
blaff (3) - 2 freq
bluff (3) - 4 freq
balefu (4) - 2 freq
bailies (4) - 2 freq
bailie (4) - 6 freq
liff (4) - 2 freq
balf (4) - 1 freq
banff (4) - 45 freq
gliff (4) - 35 freq
belief (4) - 60 freq
cliff (4) - 27 freq
baff (4) - 3 freq
bailin (4) - 3 freq
bailie's (5) - 1 freq
baffe (5) - 2 freq
bellyfu (5) - 1 freq
baling (5) - 1 freq
bailliol (5) - 1 freq
belf (5) - 3 freq
cliffy (5) - 1 freq
bayliss (5) - 1 freq
flaff (5) - 5 freq
SoundEx code - B410
believe - 596 freq
bellpou - 1 freq
belyve - 33 freq
bluff - 4 freq
'believe - 3 freq
bulb - 10 freq
belief - 60 freq
blip - 4 freq
blyave - 2 freq
blab - 3 freq
bileeve - 4 freq
bileev - 5 freq
blub - 1 freq
behalf - 18 freq
belive - 2 freq
blype - 1 freq
beleive - 1 freq
baelieve - 6 freq
bealieve - 1 freq
bailiff - 5 freq
blob - 4 freq
belly-up - 2 freq
boolfu - 1 freq
beleiff - 1 freq
bleep - 1 freq
bolivia - 2 freq
blaff - 2 freq
bulfie - 1 freq
bellyfou - 1 freq
balefu - 2 freq
beleve - 6 freq
€œbeleve - 1 freq
bellyfu - 1 freq
boleivia - 1 freq
bowlfae - 1 freq
bowelfae - 1 freq
€œbelieve - 2 freq
blow-up - 1 freq
balboa - 1 freq
balf - 1 freq
bilbao - 3 freq
ballboy - 2 freq
baleev - 1 freq
bowlfu - 1 freq
beleeve - 5 freq
belf - 3 freq
MetaPhone code - BLF
believe - 596 freq
belyve - 33 freq
bluff - 4 freq
'believe - 3 freq
belief - 60 freq
bileeve - 4 freq
bileev - 5 freq
belive - 2 freq
beleive - 1 freq
baelieve - 6 freq
bealieve - 1 freq
bailiff - 5 freq
boolfu - 1 freq
beleiff - 1 freq
bolivia - 2 freq
blaff - 2 freq
bulfie - 1 freq
bellyfou - 1 freq
balefu - 2 freq
beleve - 6 freq
€œbeleve - 1 freq
bellyfu - 1 freq
boleivia - 1 freq
bowlfae - 1 freq
€œbelieve - 2 freq
balf - 1 freq
baleev - 1 freq
bowlfu - 1 freq
beleeve - 5 freq
belf - 3 freq
BAILIFF
Time to execute Levenshtein function - 0.306501 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.524875 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.064580 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039460 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000833 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.