A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to boastin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
boastin (0) - 3 freq
biastin (1) - 2 freq
toastin (1) - 2 freq
bastin (1) - 1 freq
blastin (1) - 11 freq
hoastin (1) - 18 freq
coastin (1) - 1 freq
boastit (1) - 1 freq
boastin' (1) - 1 freq
roastin (1) - 23 freq
boatin (1) - 1 freq
baskin (2) - 10 freq
beastie (2) - 38 freq
roostin (2) - 5 freq
beastis (2) - 1 freq
hoastit (2) - 11 freq
baitin (2) - 1 freq
roustin (2) - 1 freq
soartin (2) - 5 freq
postin (2) - 21 freq
boasted (2) - 1 freq
boattie (2) - 1 freq
hoastun (2) - 1 freq
poatin (2) - 1 freq
oustin (2) - 1 freq
boastin (0) - 3 freq
bastin (1) - 1 freq
biastin (1) - 2 freq
roastin (2) - 23 freq
boatin (2) - 1 freq
bastion (2) - 2 freq
boston (2) - 2 freq
boastin' (2) - 1 freq
buistin (2) - 1 freq
blastin (2) - 11 freq
hoastin (2) - 18 freq
boastit (2) - 1 freq
coastin (2) - 1 freq
toastin (2) - 2 freq
boltin (3) - 3 freq
basin (3) - 22 freq
boast (3) - 7 freq
castin (3) - 48 freq
bloatin (3) - 1 freq
pastin (3) - 2 freq
baetin (3) - 4 freq
lastin (3) - 9 freq
hostin (3) - 16 freq
bustan (3) - 1 freq
hoistin (3) - 2 freq
SoundEx code - B235
bogton - 1 freq
boston - 2 freq
boastin' - 1 freq
buchtin - 1 freq
biastin - 2 freq
boastin - 3 freq
big-time - 1 freq
bogie-eatin - 1 freq
bastions - 1 freq
besotten - 1 freq
bastion - 2 freq
bak-hauddin - 1 freq
bustan - 1 freq
backtaen - 1 freq
bog-cotton - 3 freq
bake-doon - 1 freq
bucketin - 2 freq
bakhtin's - 1 freq
bigton - 1 freq
bystaunner - 1 freq
besettin - 1 freq
besetting - 1 freq
bigtime - 1 freq
bystander - 1 freq
bastin - 1 freq
buistin - 1 freq
bystanders - 1 freq
bog-standard - 1 freq
bystauner - 1 freq
bxsednlsgb - 1 freq
bbcscotnine - 4 freq
bbsitting - 1 freq
bbsittin - 1 freq
boycotting - 1 freq
bbcdn - 1 freq
bbcdomc - 2 freq
bucktonbirder - 1 freq
bigtonfarm - 1 freq
bhoysscouting - 1 freq
bigthunderlips - 1 freq
bestiememories - 1 freq
bequietmedia - 2 freq
backtaenormailty - 1 freq
MetaPhone code - BSTN
boston - 2 freq
boastin' - 1 freq
biastin - 2 freq
boastin - 3 freq
besotten - 1 freq
bustan - 1 freq
besettin - 1 freq
bastin - 1 freq
buistin - 1 freq
bbsittin - 1 freq
BOASTIN
Time to execute Levenshtein function - 0.343097 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.453117 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031532 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039119 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000940 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.