A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to buistin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
buistin (0) - 1 freq
ruistin (1) - 1 freq
cuistin (1) - 3 freq
burstin (1) - 43 freq
twistin (2) - 14 freq
bustan (2) - 1 freq
buntin (2) - 4 freq
uisin (2) - 105 freq
buist (2) - 5 freq
bursten (2) - 1 freq
ruisin (2) - 2 freq
buttin (2) - 6 freq
bristlin (2) - 3 freq
heistin (2) - 1 freq
reistin (2) - 1 freq
caistin (2) - 1 freq
bruistit (2) - 1 freq
fistin (2) - 1 freq
luistit (2) - 1 freq
burstin' (2) - 5 freq
buidin (2) - 1 freq
ruistit (2) - 1 freq
suitin (2) - 1 freq
baistie (2) - 3 freq
ristin (2) - 3 freq
buistin (0) - 1 freq
bustan (2) - 1 freq
bastin (2) - 1 freq
boastin (2) - 3 freq
biastin (2) - 2 freq
burstin (2) - 43 freq
cuistin (2) - 3 freq
ruistin (2) - 1 freq
bastion (3) - 2 freq
deistin (3) - 3 freq
boston (3) - 2 freq
brustin (3) - 5 freq
listin (3) - 4 freq
oustin (3) - 1 freq
bustit (3) - 2 freq
austin (3) - 5 freq
cuisten (3) - 2 freq
blastin (3) - 11 freq
dustin (3) - 12 freq
tistin (3) - 1 freq
kistin (3) - 3 freq
beistie (3) - 1 freq
buists (3) - 1 freq
lustin (3) - 1 freq
buskin (3) - 11 freq
SoundEx code - B235
bogton - 1 freq
boston - 2 freq
boastin' - 1 freq
buchtin - 1 freq
biastin - 2 freq
boastin - 3 freq
big-time - 1 freq
bogie-eatin - 1 freq
bastions - 1 freq
besotten - 1 freq
bastion - 2 freq
bak-hauddin - 1 freq
bustan - 1 freq
backtaen - 1 freq
bog-cotton - 3 freq
bake-doon - 1 freq
bucketin - 2 freq
bakhtin's - 1 freq
bigton - 1 freq
bystaunner - 1 freq
besettin - 1 freq
besetting - 1 freq
bigtime - 1 freq
bystander - 1 freq
bastin - 1 freq
buistin - 1 freq
bystanders - 1 freq
bog-standard - 1 freq
bystauner - 1 freq
bxsednlsgb - 1 freq
bbcscotnine - 4 freq
bbsitting - 1 freq
bbsittin - 1 freq
boycotting - 1 freq
bbcdn - 1 freq
bbcdomc - 2 freq
bucktonbirder - 1 freq
bigtonfarm - 1 freq
bhoysscouting - 1 freq
bigthunderlips - 1 freq
bestiememories - 1 freq
bequietmedia - 2 freq
backtaenormailty - 1 freq
MetaPhone code - BSTN
boston - 2 freq
boastin' - 1 freq
biastin - 2 freq
boastin - 3 freq
besotten - 1 freq
bustan - 1 freq
besettin - 1 freq
bastin - 1 freq
buistin - 1 freq
bbsittin - 1 freq
BUISTIN
Time to execute Levenshtein function - 0.222009 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.400968 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.060388 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.048977 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000970 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.