A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to being in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
being (0) - 296 freq
keing (1) - 140 freq
beig (1) - 6 freq
beins (1) - 14 freq
bein (1) - 1744 freq
beinn (1) - 2 freq
'being (1) - 1 freq
bing (1) - 38 freq
boeing (1) - 1 freq
reing (1) - 2 freq
beings (1) - 3 freq
bling (1) - 3 freq
bring (1) - 550 freq
beina (1) - 1 freq
bein' (1) - 59 freq
breing (1) - 1 freq
geing (1) - 1 freq
reings (2) - 2 freq
nein (2) - 4 freq
buin (2) - 12 freq
aewing (2) - 1 freq
daeing (2) - 2 freq
berns (2) - 10 freq
benny (2) - 2 freq
neig (2) - 1 freq
being (0) - 296 freq
boeing (1) - 1 freq
bing (1) - 38 freq
binge (2) - 4 freq
beange (2) - 1 freq
buying (2) - 13 freq
baying (2) - 1 freq
geing (2) - 1 freq
bang (2) - 97 freq
bng (2) - 1 freq
bingo (2) - 38 freq
bong (2) - 3 freq
beenge (2) - 1 freq
breing (2) - 1 freq
boyng (2) - 1 freq
bung (2) - 2 freq
beinn (2) - 2 freq
'being (2) - 1 freq
bein' (2) - 59 freq
bein (2) - 1744 freq
beins (2) - 14 freq
keing (2) - 140 freq
beig (2) - 6 freq
beings (2) - 3 freq
reing (2) - 2 freq
SoundEx code - B520
bank - 186 freq
banes - 202 freq
bens - 43 freq
banks - 130 freq
bins - 32 freq
bing' - 2 freq
bink - 26 freq
bannocks - 42 freq
bannock - 18 freq
bang - 97 freq
bunch - 105 freq
bennachie - 43 freq
binch - 6 freq
bank' - 2 freq
bing - 38 freq
banns - 3 freq
being - 296 freq
beans - 64 freq
bones - 61 freq
bench - 54 freq
bungs - 1 freq
banksia - 1 freq
buns - 22 freq
boonce - 61 freq
bingo - 38 freq
banshee - 15 freq
beens - 35 freq
baunk - 30 freq
bunk - 28 freq
bings - 16 freq
bounce - 23 freq
bangs - 13 freq
bams - 19 freq
baying - 1 freq
baunks - 6 freq
beams - 8 freq
banjo - 14 freq
ben-i-hoose - 1 freq
buying - 13 freq
bum's - 2 freq
booms - 3 freq
binks - 7 freq
bonny's - 1 freq
binns - 2 freq
bonshaw - 1 freq
bynes - 1 freq
bouncy - 7 freq
boeing - 1 freq
beins - 14 freq
ben's - 1 freq
bmc - 7 freq
been's - 1 freq
binkee - 1 freq
boonies - 1 freq
bunnies - 4 freq
bans - 35 freq
banzai - 1 freq
bahamas - 2 freq
bams' - 1 freq
ban's - 2 freq
baimns - 1 freq
bannacks - 2 freq
bonus - 21 freq
bank- - 1 freq
bunks - 4 freq
bonnie's - 6 freq
benche - 1 freq
buncha - 2 freq
bane's - 1 freq
banes' - 2 freq
boneys' - 1 freq
boyne's - 1 freq
bawns - 4 freq
booncie - 1 freq
ban'k - 1 freq
bung - 2 freq
buoyancy - 2 freq
baims - 2 freq
boannie-wyes - 3 freq
bunchy - 1 freq
buncy - 1 freq
binnack - 2 freq
binnacks - 1 freq
baim's - 1 freq
bonxie - 8 freq
bums - 8 freq
baank - 1 freq
bonhoga - 1 freq
booncy - 11 freq
'booncy - 1 freq
bonns - 2 freq
benks - 2 freq
'boonce - 1 freq
bouyancy - 1 freq
binnock - 1 freq
bannos - 1 freq
bans' - 1 freq
bannock's - 1 freq
bank-e - 1 freq
bo'ness - 3 freq
'bannock' - 2 freq
bonk - 2 freq
bouns - 1 freq
biomass - 2 freq
boyng - 1 freq
banis - 1 freq
'banes- - 1 freq
bing's - 1 freq
bonsai - 1 freq
benk - 1 freq
be-ins - 2 freq
bankie - 2 freq
bianco - 3 freq
benachie - 2 freq
benjy - 8 freq
bauns - 4 freq
bng - 1 freq
beings - 3 freq
bmg - 1 freq
bungee - 1 freq
€˜bans - 1 freq
binge - 4 freq
bonhoaga - 1 freq
baens - 3 freq
beenge - 1 freq
binkie - 1 freq
bmx - 1 freq
bmxs - 1 freq
€œbingo - 1 freq
boones - 1 freq
€œbang - 1 freq
beange - 1 freq
bines - 1 freq
bynack - 1 freq
beyonce - 1 freq
bong - 3 freq
‘bunksy’ - 1 freq
bunksy - 1 freq
bonce - 1 freq
boneys - 1 freq
bowing - 1 freq
baines - 1 freq
bunc - 1 freq
bungy - 1 freq
bun's - 1 freq
buenos - 1 freq
beyoncé’s - 2 freq
bangu - 1 freq
bnqo - 1 freq
bmjj - 1 freq
bfumx - 1 freq
bmq - 1 freq
'being - 1 freq
bbmjzg - 1 freq
benosey - 1 freq
boness - 3 freq
MetaPhone code - BNK
bank - 186 freq
bing' - 2 freq
bink - 26 freq
bannock - 18 freq
bang - 97 freq
bank' - 2 freq
bing - 38 freq
being - 296 freq
bingo - 38 freq
baunk - 30 freq
bunk - 28 freq
boeing - 1 freq
binkee - 1 freq
bank- - 1 freq
ban'k - 1 freq
bung - 2 freq
binnack - 2 freq
baank - 1 freq
binnock - 1 freq
bank-e - 1 freq
'bannock' - 2 freq
bonk - 2 freq
boyng - 1 freq
benk - 1 freq
bankie - 2 freq
bianco - 3 freq
bng - 1 freq
binkie - 1 freq
€œbingo - 1 freq
€œbang - 1 freq
bynack - 1 freq
bong - 3 freq
bunc - 1 freq
bangu - 1 freq
bnqo - 1 freq
'being - 1 freq
BEING
am - 1527 freq
be - 14795 freq
wis - 27947 freq
is - 18023 freq
been - 5087 freq
bön - 23 freq
are - 5053 freq
were - 4054 freq
wir - 2162 freq
was - 3361 freq
wus - 1426 freq
wur - 1596 freq
bein - 1744 freq
being - 296 freq
wiz - 1272 freq
wes - 1816 freq
war - 1438 freq
bin - 954 freq
bes - 315 freq
ir - 1540 freq
re - 48 freq
isnae - 544 freq
wisnae - 999 freq
wisna - 1037 freq
ur - 541 freq
isna - 390 freq
Time to execute Levenshtein function - 0.201604 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.402224 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029178 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036861 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001197 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.