A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to blog in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
blog (0) - 38 freq
blok (1) - 1 freq
blot (1) - 4 freq
log (1) - 36 freq
glog (1) - 3 freq
blag (1) - 1 freq
clog (1) - 2 freq
blob (1) - 4 freq
flog (1) - 1 freq
blogs (1) - 9 freq
blo (1) - 2 freq
bloc (1) - 6 freq
'log (1) - 1 freq
vlog (1) - 1 freq
bog (1) - 54 freq
blow (1) - 48 freq
bloo (1) - 11 freq
brom (2) - 1 freq
zlo (2) - 1 freq
bod (2) - 2 freq
bob (2) - 213 freq
clos (2) - 14 freq
blyd (2) - 1 freq
brok (2) - 6 freq
blood (2) - 134 freq
blog (0) - 38 freq
blag (1) - 1 freq
'log (2) - 1 freq
blo (2) - 2 freq
bloc (2) - 6 freq
vlog (2) - 1 freq
blow (2) - 48 freq
bog (2) - 54 freq
blogs (2) - 9 freq
bloo (2) - 11 freq
blot (2) - 4 freq
flog (2) - 1 freq
blok (2) - 1 freq
log (2) - 36 freq
blob (2) - 4 freq
glog (2) - 3 freq
clog (2) - 2 freq
lug (3) - 297 freq
brag (3) - 9 freq
gloag (3) - 8 freq
slag (3) - 16 freq
below (3) - 84 freq
gleg (3) - 127 freq
logo (3) - 19 freq
burg (3) - 1 freq
SoundEx code - B420
'black - 4 freq
black - 734 freq
balls - 19 freq
blocks - 17 freq
bleck - 179 freq
bellows - 6 freq
bleeze - 46 freq
blaws - 54 freq
bells - 66 freq
bellies - 21 freq
bill's - 22 freq
-bill's - 2 freq
blaik - 80 freq
bliss - 59 freq
bell's - 6 freq
belike - 5 freq
bowls - 20 freq
bleach - 13 freq
billies - 14 freq
blaas - 3 freq
'belike - 3 freq
block - 55 freq
bullies - 4 freq
blues - 24 freq
blouse - 18 freq
blows - 12 freq
bulls - 10 freq
belly's - 7 freq
blackie - 21 freq
bill-wha's - 1 freq
bleak - 24 freq
bless - 42 freq
bowels - 9 freq
bull's - 2 freq
bolshoi' - 1 freq
bella's - 5 freq
bloke - 24 freq
billie's - 1 freq
blush - 10 freq
blaes - 4 freq
blek - 10 freq
bawls - 5 freq
bools - 23 freq
blaik's - 1 freq
bills - 18 freq
blacks - 3 freq
baileys - 3 freq
belle's - 1 freq
bulk - 6 freq
billy's - 11 freq
blaze - 11 freq
blak - 85 freq
biles - 14 freq
bleckie - 1 freq
black's - 3 freq
blaak - 4 freq
bayl's - 1 freq
bail's - 1 freq
blaw's - 1 freq
bullock's - 1 freq
ballsae - 1 freq
bloack - 4 freq
boils - 4 freq
bollocks - 7 freq
bloacks - 1 freq
bullseye - 2 freq
bollick - 2 freq
bileq - 1 freq
belloch - 3 freq
bales - 17 freq
blaa's - 1 freq
blak' - 1 freq
boolies - 1 freq
bulge - 4 freq
bloggs - 1 freq
bouls - 2 freq
balgay - 1 freq
baillies - 3 freq
biology - 11 freq
bla'k - 1 freq
buhls - 4 freq
blashy - 6 freq
blogs - 9 freq
bolsa' - 1 freq
blois - 5 freq
blois' - 1 freq
bealach - 1 freq
belch - 1 freq
blok - 1 freq
bolas - 4 freq
bleusk - 1 freq
billows - 1 freq
bloose - 5 freq
blakk - 2 freq
blugga - 3 freq
baals - 2 freq
bluish - 6 freq
blue's - 3 freq
blecks - 3 freq
baloos - 1 freq
blocs - 2 freq
bill-heuk - 1 freq
bolshie - 2 freq
buhl's - 1 freq
boolik - 1 freq
blæc - 1 freq
blacc- - 1 freq
black' - 1 freq
bluisk - 1 freq
byles - 5 freq
blowsy - 1 freq
blash - 1 freq
blake - 8 freq
belisha - 1 freq
balsa - 1 freq
€˜black - 3 freq
bellas - 1 freq
bailie's - 1 freq
bull's-ee - 1 freq
bullock - 3 freq
bleize - 1 freq
bailies - 2 freq
blog - 38 freq
€˜blues - 1 freq
billys - 1 freq
blaick - 2 freq
baulk - 1 freq
bells' - 1 freq
bullish - 1 freq
bull's-eye - 1 freq
belies - 1 freq
blag - 1 freq
by-whyles - 10 freq
buls - 1 freq
blowze - 1 freq
blaise - 1 freq
€œballs - 1 freq
blaickie - 2 freq
bilge - 1 freq
boles - 1 freq
blasé - 1 freq
bloc - 6 freq
blackhaw - 1 freq
blis - 1 freq
bullocks - 2 freq
beals - 1 freq
belles - 1 freq
bellyÂ’s - 1 freq
bleck' - 1 freq
bulky - 1 freq
ballys - 1 freq
bellackeee - 1 freq
bollox - 2 freq
belic - 1 freq
bols - 1 freq
blck - 1 freq
bleuch - 2 freq
bluesey - 1 freq
“black - 1 freq
blk - 1 freq
bwlx - 1 freq
bellys - 2 freq
bayliss - 1 freq
bollocks” - 1 freq
MetaPhone code - BLK
'black - 4 freq
black - 734 freq
bleck - 179 freq
blaik - 80 freq
belike - 5 freq
'belike - 3 freq
block - 55 freq
blackie - 21 freq
bleak - 24 freq
bloke - 24 freq
blek - 10 freq
bulk - 6 freq
blak - 85 freq
bleckie - 1 freq
blaak - 4 freq
bloack - 4 freq
bollick - 2 freq
bileq - 1 freq
blak' - 1 freq
balgay - 1 freq
bla'k - 1 freq
blok - 1 freq
blakk - 2 freq
blugga - 3 freq
boolik - 1 freq
blæc - 1 freq
black' - 1 freq
blake - 8 freq
€˜black - 3 freq
bullock - 3 freq
blog - 38 freq
blaick - 2 freq
baulk - 1 freq
blag - 1 freq
blaickie - 2 freq
bloc - 6 freq
bleck' - 1 freq
bulky - 1 freq
bellackeee - 1 freq
belic - 1 freq
blck - 1 freq
“black - 1 freq
blk - 1 freq
BLOG
Time to execute Levenshtein function - 0.493421 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.801866 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.087734 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.099794 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000771 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.