A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bum in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bum (0) - 44 freq
bur (1) - 8 freq
bom (1) - 3 freq
rum (1) - 32 freq
dum (1) - 16 freq
buk (1) - 2 freq
bgm (1) - 1 freq
bus (1) - 361 freq
baum (1) - 2 freq
buy (1) - 385 freq
but (1) - 13379 freq
mum (1) - 181 freq
kum (1) - 8 freq
bm (1) - 5 freq
bim (1) - 2 freq
fum (1) - 1 freq
eum (1) - 1 freq
um (1) - 151 freq
bam (1) - 39 freq
sum (1) - 416 freq
hum (1) - 45 freq
lum (1) - 140 freq
wum (1) - 1 freq
bump (1) - 23 freq
bue (1) - 1 freq
bum (0) - 44 freq
bem (1) - 1 freq
bim (1) - 2 freq
bam (1) - 39 freq
baum (1) - 2 freq
bm (1) - 5 freq
bom (1) - 3 freq
baim (2) - 1 freq
bu (2) - 13 freq
tum (2) - 17 freq
bumt (2) - 1 freq
gum (2) - 19 freq
bun (2) - 59 freq
blm (2) - 1 freq
bums (2) - 8 freq
bul (2) - 3 freq
jum (2) - 3 freq
bug (2) - 57 freq
beam (2) - 19 freq
beem (2) - 5 freq
bame (2) - 2 freq
bud (2) - 36 freq
bur (2) - 8 freq
'um (2) - 35 freq
bmi (2) - 1 freq
SoundEx code - B500
bein - 1776 freq
been - 5175 freq
bonnie - 811 freq
behin - 41 freq
bin - 971 freq
ben - 609 freq
be-in - 29 freq
baun - 18 freq
bonny - 501 freq
bowin - 8 freq
bone - 46 freq
boon - 7 freq
beeame - 1 freq
bane - 81 freq
binna - 29 freq
bien - 19 freq
bony - 6 freq
beam - 19 freq
bammy - 2 freq
'bonnie - 2 freq
bunn - 1 freq
beena - 1 freq
bum - 44 freq
bouin - 5 freq
'bam - 1 freq
'bin - 1 freq
bein' - 59 freq
boney - 9 freq
buin - 12 freq
booin - 17 freq
baain - 2 freq
bmw - 4 freq
buyin - 84 freq
bine - 11 freq
bainie - 1 freq
bun - 59 freq
banie - 1 freq
bain - 77 freq
ban - 48 freq
bayan - 4 freq
baney - 12 freq
bany - 1 freq
bean - 43 freq
buyin' - 5 freq
baim - 1 freq
beem - 5 freq
beein - 14 freq
buyen - 1 freq
boom - 30 freq
bohun - 2 freq
bon - 14 freq
beanie - 2 freq
bawiin - 1 freq
bam - 39 freq
beano - 6 freq
bim - 2 freq
bunny - 4 freq
bono - 3 freq
'bein - 2 freq
bonie - 13 freq
binnae - 3 freq
ban' - 26 freq
boney' - 1 freq
'bonny' - 1 freq
baein - 20 freq
baehin - 11 freq
bee-en - 2 freq
behun - 1 freq
been' - 1 freq
bonme - 1 freq
behin' - 2 freq
bann - 9 freq
baen - 8 freq
bo'm - 1 freq
be'n - 2 freq
bayin - 2 freq
baimie - 1 freq
boannie - 59 freq
boy-an - 1 freq
'bonnie' - 2 freq
bön - 23 freq
bom - 3 freq
buyan - 3 freq
'ben' - 1 freq
bum' - 1 freq
benny - 2 freq
bene - 106 freq
be'in - 3 freq
boanny - 4 freq
buena - 1 freq
bame - 2 freq
boun' - 1 freq
bn - 7 freq
bohemia - 2 freq
byn - 2 freq
boun - 10 freq
boanie - 2 freq
boany - 1 freq
bena - 1 freq
baum - 2 freq
beyon - 1 freq
be-an - 17 freq
'bonny - 2 freq
bowan - 1 freq
bane- - 1 freq
beenie - 5 freq
bøn - 18 freq
beinn - 2 freq
beween - 1 freq
byne - 1 freq
€˜binna - 1 freq
bowen - 1 freq
beame - 1 freq
€œbeen - 1 freq
beean - 1 freq
€˜bene - 1 freq
boonie - 2 freq
€˜bonnie - 2 freq
€œbonnie - 1 freq
bean' - 1 freq
binne - 1 freq
beannie - 26 freq
bannie - 1 freq
boyne - 4 freq
€œbein - 2 freq
buon - 1 freq
bai-yun - 8 freq
beina - 1 freq
baiyun - 1 freq
'boon - 1 freq
benn - 1 freq
bhain - 24 freq
bem - 1 freq
beeän - 1 freq
bÂ’in - 3 freq
bm - 5 freq
bpn - 1 freq
bone' - 1 freq
bonn - 2 freq
bmi - 1 freq
beinÂ’ - 2 freq
beenawa - 1 freq
bfn - 1 freq
bÂœn - 1 freq
“bun - 1 freq
bnw - 1 freq
bunno - 1 freq
MetaPhone code - BM
bumbee - 4 freq
bamboo - 9 freq
beeame - 1 freq
beam - 19 freq
bammy - 2 freq
bum - 44 freq
'bam - 1 freq
bomb - 31 freq
bmw - 4 freq
bombay - 5 freq
baim - 1 freq
beem - 5 freq
boom - 30 freq
bam - 39 freq
bim - 2 freq
bo'm - 1 freq
baimie - 1 freq
bom - 3 freq
bum' - 1 freq
bame - 2 freq
baum - 2 freq
bombe - 1 freq
beame - 1 freq
bambi - 1 freq
bem - 1 freq
bm - 5 freq
bmi - 1 freq
bimbo - 1 freq
BUM
Time to execute Levenshtein function - 0.317157 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.521380 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.073454 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038236 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000857 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.