A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to buns in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
buns (0) - 22 freq
bund (1) - 12 freq
bun (1) - 59 freq
runs (1) - 75 freq
bugs (1) - 21 freq
burns (1) - 381 freq
cuns (1) - 1 freq
bunt (1) - 6 freq
buds (1) - 16 freq
bunk (1) - 28 freq
yuns (1) - 1 freq
duns (1) - 2 freq
buys (1) - 13 freq
bins (1) - 34 freq
bus (1) - 361 freq
bens (1) - 47 freq
wuns (1) - 3 freq
buts (1) - 2 freq
bunc (1) - 1 freq
funs (1) - 7 freq
bunks (1) - 4 freq
bauns (1) - 4 freq
bunn (1) - 1 freq
bunts (1) - 1 freq
huns (1) - 10 freq
buns (0) - 22 freq
bauns (1) - 4 freq
bans (1) - 35 freq
bens (1) - 47 freq
bins (1) - 34 freq
bouns (1) - 1 freq
bums (2) - 8 freq
buls (2) - 1 freq
bung (2) - 2 freq
bungs (2) - 1 freq
nuns (2) - 8 freq
buss (2) - 34 freq
puns (2) - 4 freq
guns (2) - 73 freq
banes (2) - 205 freq
beens (2) - 35 freq
abins (2) - 1 freq
bones (2) - 61 freq
banis (2) - 1 freq
baens (2) - 3 freq
beins (2) - 14 freq
buenos (2) - 1 freq
bynes (2) - 1 freq
bines (2) - 1 freq
bonus (2) - 21 freq
SoundEx code - B520
bank - 189 freq
banes - 205 freq
bens - 47 freq
banks - 132 freq
bins - 34 freq
bing' - 2 freq
bink - 26 freq
bannocks - 42 freq
bannock - 18 freq
bang - 100 freq
bunch - 111 freq
bennachie - 43 freq
binch - 6 freq
bank' - 2 freq
bing - 38 freq
banns - 3 freq
being - 306 freq
beans - 66 freq
bones - 61 freq
bench - 58 freq
bungs - 1 freq
banksia - 1 freq
buns - 22 freq
boonce - 61 freq
bingo - 38 freq
banshee - 15 freq
beens - 35 freq
baunk - 30 freq
bunk - 28 freq
bings - 16 freq
bounce - 23 freq
bangs - 13 freq
bams - 28 freq
baying - 1 freq
baunks - 6 freq
beams - 8 freq
banjo - 17 freq
ben-i-hoose - 1 freq
buying - 14 freq
bum's - 2 freq
booms - 3 freq
binks - 7 freq
bonny's - 1 freq
binns - 2 freq
bonshaw - 1 freq
bynes - 1 freq
bouncy - 7 freq
boeing - 1 freq
beins - 14 freq
ben's - 2 freq
bmc - 7 freq
been's - 1 freq
binkee - 1 freq
boonies - 1 freq
bunnies - 4 freq
beings - 4 freq
being' - 1 freq
ban's - 3 freq
bans - 35 freq
banzai - 1 freq
bahamas - 2 freq
bams' - 1 freq
baimns - 1 freq
bannacks - 2 freq
bonus - 21 freq
bank- - 1 freq
bunks - 4 freq
bonnie's - 6 freq
benche - 1 freq
buncha - 2 freq
bane's - 1 freq
banes' - 2 freq
boneys' - 1 freq
boyne's - 1 freq
bawns - 4 freq
booncie - 1 freq
ban'k - 1 freq
bung - 2 freq
buoyancy - 2 freq
baims - 2 freq
boannie-wyes - 3 freq
bunchy - 1 freq
buncy - 1 freq
binnack - 2 freq
binnacks - 1 freq
baim's - 1 freq
bonxie - 8 freq
bums - 8 freq
baank - 1 freq
bonhoga - 1 freq
booncy - 11 freq
'booncy - 1 freq
bonns - 2 freq
benks - 2 freq
'boonce - 1 freq
bouyancy - 1 freq
binnock - 1 freq
bannos - 1 freq
bans' - 1 freq
bannock's - 1 freq
bank-e - 1 freq
bo'ness - 3 freq
'bannock' - 2 freq
bonk - 2 freq
bouns - 1 freq
biomass - 2 freq
boyng - 1 freq
banis - 1 freq
'banes- - 1 freq
bing's - 1 freq
bonsai - 1 freq
benk - 1 freq
be-ins - 2 freq
bankie - 2 freq
bianco - 3 freq
benachie - 2 freq
benjy - 8 freq
bauns - 4 freq
bng - 1 freq
bmg - 1 freq
bungee - 1 freq
€˜bans - 1 freq
binge - 4 freq
bonhoaga - 1 freq
baens - 3 freq
beenge - 1 freq
binkie - 1 freq
bmx - 1 freq
bmxs - 1 freq
€œbingo - 1 freq
boones - 1 freq
€œbang - 1 freq
beange - 1 freq
bines - 1 freq
bynack - 1 freq
beyonce - 1 freq
bong - 3 freq
‘bunksy’ - 1 freq
bunksy - 1 freq
bonce - 1 freq
boneys - 1 freq
bowing - 1 freq
baines - 1 freq
bunc - 1 freq
bungy - 1 freq
bun's - 1 freq
buenos - 1 freq
beyoncé’s - 2 freq
bangu - 1 freq
bnqo - 1 freq
bmjj - 1 freq
bfumx - 1 freq
bmq - 1 freq
'being - 1 freq
bbmjzg - 1 freq
benosey - 1 freq
boness - 3 freq
MetaPhone code - BNS
banes - 205 freq
bens - 47 freq
bins - 34 freq
banns - 3 freq
beans - 66 freq
bones - 61 freq
buns - 22 freq
boonce - 61 freq
beens - 35 freq
bounce - 23 freq
bonny's - 1 freq
binns - 2 freq
bynes - 1 freq
bouncy - 7 freq
beins - 14 freq
ben's - 2 freq
been's - 1 freq
boonies - 1 freq
bunnies - 4 freq
ban's - 3 freq
bans - 35 freq
banzai - 1 freq
bonus - 21 freq
bonnie's - 6 freq
bane's - 1 freq
banes' - 2 freq
boneys' - 1 freq
boyne's - 1 freq
bawns - 4 freq
booncie - 1 freq
buncy - 1 freq
booncy - 11 freq
'booncy - 1 freq
bonns - 2 freq
'boonce - 1 freq
bunghsie - 2 freq
bannos - 1 freq
bans' - 1 freq
banghs - 1 freq
bo'ness - 3 freq
bouns - 1 freq
banis - 1 freq
'banes- - 1 freq
bonsai - 1 freq
be-ins - 2 freq
bauns - 4 freq
€˜bans - 1 freq
baens - 3 freq
boones - 1 freq
bines - 1 freq
bonce - 1 freq
boneys - 1 freq
baines - 1 freq
bun's - 1 freq
buenos - 1 freq
benosey - 1 freq
boness - 3 freq
BUNS
Time to execute Levenshtein function - 0.199969 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.351221 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027501 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037335 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000921 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.