A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bastion in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bastion (0) - 2 freq
bastin (1) - 1 freq
bastions (1) - 1 freq
baetin (2) - 4 freq
passion (2) - 75 freq
baitin (2) - 1 freq
wastin (2) - 32 freq
bastils (2) - 1 freq
pastin (2) - 2 freq
boastin (2) - 2 freq
barton (2) - 3 freq
aston (2) - 4 freq
blastin (2) - 9 freq
faction (2) - 3 freq
ration (2) - 4 freq
fashion (2) - 71 freq
caption (2) - 1 freq
baskin (2) - 9 freq
basin (2) - 22 freq
naition (2) - 1 freq
bash-on (2) - 1 freq
baton (2) - 3 freq
fastin (2) - 14 freq
battin (2) - 1 freq
quastion (2) - 2 freq
bastion (0) - 2 freq
bastin (1) - 1 freq
boastin (2) - 2 freq
boston (2) - 2 freq
bastions (2) - 1 freq
lastin (3) - 9 freq
castin (3) - 48 freq
battin (3) - 1 freq
baton (3) - 3 freq
fastin (3) - 14 freq
bastarn (3) - 16 freq
bashin (3) - 7 freq
bastern (3) - 2 freq
buistin (3) - 1 freq
easton (3) - 3 freq
tastin (3) - 13 freq
batin (3) - 4 freq
basin (3) - 22 freq
quastion (3) - 2 freq
baitin (3) - 1 freq
aston (3) - 4 freq
barton (3) - 3 freq
pastin (3) - 2 freq
blastin (3) - 9 freq
wastin (3) - 32 freq
SoundEx code - B235
bogton - 1 freq
boston - 2 freq
boastin' - 1 freq
buchtin - 1 freq
big-time - 1 freq
bogie-eatin - 1 freq
bastions - 1 freq
besotten - 1 freq
bastion - 2 freq
bak-hauddin - 1 freq
bustan - 1 freq
backtaen - 1 freq
bog-cotton - 3 freq
bake-doon - 1 freq
bucketin - 2 freq
bakhtin's - 1 freq
bigton - 1 freq
bystaunner - 1 freq
besettin - 1 freq
besetting - 1 freq
bigtime - 1 freq
bystander - 1 freq
bastin - 1 freq
buistin - 1 freq
bystanders - 1 freq
bog-standard - 1 freq
boastin - 2 freq
bystauner - 1 freq
bxsednlsgb - 1 freq
bbcscotnine - 4 freq
bbsitting - 1 freq
bbsittin - 1 freq
boycotting - 1 freq
bbcdn - 1 freq
bbcdomc - 2 freq
bucktonbirder - 1 freq
bigtonfarm - 1 freq
bhoysscouting - 1 freq
bigthunderlips - 1 freq
bestiememories - 1 freq
bequietmedia - 2 freq
backtaenormailty - 1 freq
MetaPhone code - BSXN
beseechin - 2 freq
bastion - 2 freq
BASTION
Time to execute Levenshtein function - 0.363887 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.460826 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030665 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041009 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000912 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.