A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bloke in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bloke (0) - 24 freq
blokes (1) - 8 freq
blok (1) - 1 freq
boke (1) - 30 freq
blake (1) - 8 freq
cloke (1) - 3 freq
broke (1) - 136 freq
blone (1) - 2 freq
loke (1) - 2 freq
blocked (2) - 40 freq
blak' (2) - 1 freq
looke (2) - 1 freq
brok (2) - 6 freq
hoke (2) - 11 freq
blonde (2) - 42 freq
slope (2) - 29 freq
flowe (2) - 10 freq
lke (2) - 20 freq
slokk (2) - 1 freq
bike (2) - 161 freq
woke (2) - 90 freq
boose (2) - 2 freq
blood (2) - 134 freq
bore (2) - 42 freq
yoke (2) - 30 freq
bloke (0) - 24 freq
blake (1) - 8 freq
blok (1) - 1 freq
belike (2) - 5 freq
blek (2) - 10 freq
blk (2) - 1 freq
blak (2) - 85 freq
loke (2) - 2 freq
blone (2) - 2 freq
boke (2) - 30 freq
blokes (2) - 8 freq
broke (2) - 136 freq
cloke (2) - 3 freq
flok (3) - 1 freq
fluke (3) - 1 freq
cluke (3) - 1 freq
slake (3) - 2 freq
brooke (3) - 11 freq
bloose (3) - 5 freq
blude (3) - 8 freq
loki (3) - 54 freq
bok (3) - 1 freq
bookie (3) - 25 freq
blade (3) - 55 freq
blowy (3) - 1 freq
SoundEx code - B420
'black - 4 freq
black - 734 freq
balls - 19 freq
blocks - 17 freq
bleck - 179 freq
bellows - 6 freq
bleeze - 46 freq
blaws - 54 freq
bells - 66 freq
bellies - 21 freq
bill's - 22 freq
-bill's - 2 freq
blaik - 80 freq
bliss - 59 freq
bell's - 6 freq
belike - 5 freq
bowls - 20 freq
bleach - 13 freq
billies - 14 freq
blaas - 3 freq
'belike - 3 freq
block - 55 freq
bullies - 4 freq
blues - 24 freq
blouse - 18 freq
blows - 12 freq
bulls - 10 freq
belly's - 7 freq
blackie - 21 freq
bill-wha's - 1 freq
bleak - 24 freq
bless - 42 freq
bowels - 9 freq
bull's - 2 freq
bolshoi' - 1 freq
bella's - 5 freq
bloke - 24 freq
billie's - 1 freq
blush - 10 freq
blaes - 4 freq
blek - 10 freq
bawls - 5 freq
bools - 23 freq
blaik's - 1 freq
bills - 18 freq
blacks - 3 freq
baileys - 3 freq
belle's - 1 freq
bulk - 6 freq
billy's - 11 freq
blaze - 11 freq
blak - 85 freq
biles - 14 freq
bleckie - 1 freq
black's - 3 freq
blaak - 4 freq
bayl's - 1 freq
bail's - 1 freq
blaw's - 1 freq
bullock's - 1 freq
ballsae - 1 freq
bloack - 4 freq
boils - 4 freq
bollocks - 7 freq
bloacks - 1 freq
bullseye - 2 freq
bollick - 2 freq
bileq - 1 freq
belloch - 3 freq
bales - 17 freq
blaa's - 1 freq
blak' - 1 freq
boolies - 1 freq
bulge - 4 freq
bloggs - 1 freq
bouls - 2 freq
balgay - 1 freq
baillies - 3 freq
biology - 11 freq
bla'k - 1 freq
buhls - 4 freq
blashy - 6 freq
blogs - 9 freq
bolsa' - 1 freq
blois - 5 freq
blois' - 1 freq
bealach - 1 freq
belch - 1 freq
blok - 1 freq
bolas - 4 freq
bleusk - 1 freq
billows - 1 freq
bloose - 5 freq
blakk - 2 freq
blugga - 3 freq
baals - 2 freq
bluish - 6 freq
blue's - 3 freq
blecks - 3 freq
baloos - 1 freq
blocs - 2 freq
bill-heuk - 1 freq
bolshie - 2 freq
buhl's - 1 freq
boolik - 1 freq
blæc - 1 freq
blacc- - 1 freq
black' - 1 freq
bluisk - 1 freq
byles - 5 freq
blowsy - 1 freq
blash - 1 freq
blake - 8 freq
belisha - 1 freq
balsa - 1 freq
€˜black - 3 freq
bellas - 1 freq
bailie's - 1 freq
bull's-ee - 1 freq
bullock - 3 freq
bleize - 1 freq
bailies - 2 freq
blog - 38 freq
€˜blues - 1 freq
billys - 1 freq
blaick - 2 freq
baulk - 1 freq
bells' - 1 freq
bullish - 1 freq
bull's-eye - 1 freq
belies - 1 freq
blag - 1 freq
by-whyles - 10 freq
buls - 1 freq
blowze - 1 freq
blaise - 1 freq
€œballs - 1 freq
blaickie - 2 freq
bilge - 1 freq
boles - 1 freq
blasé - 1 freq
bloc - 6 freq
blackhaw - 1 freq
blis - 1 freq
bullocks - 2 freq
beals - 1 freq
belles - 1 freq
bellyÂ’s - 1 freq
bleck' - 1 freq
bulky - 1 freq
ballys - 1 freq
bellackeee - 1 freq
bollox - 2 freq
belic - 1 freq
bols - 1 freq
blck - 1 freq
bleuch - 2 freq
bluesey - 1 freq
“black - 1 freq
blk - 1 freq
bwlx - 1 freq
bellys - 2 freq
bayliss - 1 freq
bollocks” - 1 freq
MetaPhone code - BLK
'black - 4 freq
black - 734 freq
bleck - 179 freq
blaik - 80 freq
belike - 5 freq
'belike - 3 freq
block - 55 freq
blackie - 21 freq
bleak - 24 freq
bloke - 24 freq
blek - 10 freq
bulk - 6 freq
blak - 85 freq
bleckie - 1 freq
blaak - 4 freq
bloack - 4 freq
bollick - 2 freq
bileq - 1 freq
blak' - 1 freq
balgay - 1 freq
bla'k - 1 freq
blok - 1 freq
blakk - 2 freq
blugga - 3 freq
boolik - 1 freq
blæc - 1 freq
black' - 1 freq
blake - 8 freq
€˜black - 3 freq
bullock - 3 freq
blog - 38 freq
blaick - 2 freq
baulk - 1 freq
blag - 1 freq
blaickie - 2 freq
bloc - 6 freq
bleck' - 1 freq
bulky - 1 freq
bellackeee - 1 freq
belic - 1 freq
blck - 1 freq
“black - 1 freq
blk - 1 freq
BLOKE
Time to execute Levenshtein function - 0.304908 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.496395 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027470 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.072676 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000784 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.