A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to being in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
being (0) - 306 freq
beig (1) - 6 freq
beings (1) - 4 freq
bring (1) - 554 freq
being' (1) - 1 freq
beinn (1) - 2 freq
keing (1) - 140 freq
bling (1) - 3 freq
bein' (1) - 59 freq
bein (1) - 1776 freq
breing (1) - 1 freq
beins (1) - 14 freq
bing (1) - 38 freq
beina (1) - 1 freq
'being (1) - 1 freq
reing (1) - 2 freq
geing (1) - 1 freq
boeing (1) - 1 freq
belting (2) - 2 freq
lying (2) - 25 freq
begins (2) - 83 freq
bearing (2) - 6 freq
brig (2) - 266 freq
heinz (2) - 2 freq
swing (2) - 70 freq
being (0) - 306 freq
bing (1) - 38 freq
boeing (1) - 1 freq
boyng (2) - 1 freq
bng (2) - 1 freq
beange (2) - 1 freq
bingo (2) - 38 freq
bang (2) - 100 freq
beenge (2) - 1 freq
bung (2) - 2 freq
baying (2) - 1 freq
binge (2) - 4 freq
buying (2) - 14 freq
bong (2) - 3 freq
reing (2) - 2 freq
geing (2) - 1 freq
beinn (2) - 2 freq
keing (2) - 140 freq
being' (2) - 1 freq
bring (2) - 554 freq
beig (2) - 6 freq
beings (2) - 4 freq
bein' (2) - 59 freq
bling (2) - 3 freq
beina (2) - 1 freq
SoundEx code - B520
bank - 189 freq
banes - 205 freq
bens - 47 freq
banks - 132 freq
bins - 34 freq
bing' - 2 freq
bink - 26 freq
bannocks - 42 freq
bannock - 18 freq
bang - 100 freq
bunch - 111 freq
bennachie - 43 freq
binch - 6 freq
bank' - 2 freq
bing - 38 freq
banns - 3 freq
being - 306 freq
beans - 66 freq
bones - 61 freq
bench - 58 freq
bungs - 1 freq
banksia - 1 freq
buns - 22 freq
boonce - 61 freq
bingo - 38 freq
banshee - 15 freq
beens - 35 freq
baunk - 30 freq
bunk - 28 freq
bings - 16 freq
bounce - 23 freq
bangs - 13 freq
bams - 28 freq
baying - 1 freq
baunks - 6 freq
beams - 8 freq
banjo - 17 freq
ben-i-hoose - 1 freq
buying - 14 freq
bum's - 2 freq
booms - 3 freq
binks - 7 freq
bonny's - 1 freq
binns - 2 freq
bonshaw - 1 freq
bynes - 1 freq
bouncy - 7 freq
boeing - 1 freq
beins - 14 freq
ben's - 2 freq
bmc - 7 freq
been's - 1 freq
binkee - 1 freq
boonies - 1 freq
bunnies - 4 freq
beings - 4 freq
being' - 1 freq
ban's - 3 freq
bans - 35 freq
banzai - 1 freq
bahamas - 2 freq
bams' - 1 freq
baimns - 1 freq
bannacks - 2 freq
bonus - 21 freq
bank- - 1 freq
bunks - 4 freq
bonnie's - 6 freq
benche - 1 freq
buncha - 2 freq
bane's - 1 freq
banes' - 2 freq
boneys' - 1 freq
boyne's - 1 freq
bawns - 4 freq
booncie - 1 freq
ban'k - 1 freq
bung - 2 freq
buoyancy - 2 freq
baims - 2 freq
boannie-wyes - 3 freq
bunchy - 1 freq
buncy - 1 freq
binnack - 2 freq
binnacks - 1 freq
baim's - 1 freq
bonxie - 8 freq
bums - 8 freq
baank - 1 freq
bonhoga - 1 freq
booncy - 11 freq
'booncy - 1 freq
bonns - 2 freq
benks - 2 freq
'boonce - 1 freq
bouyancy - 1 freq
binnock - 1 freq
bannos - 1 freq
bans' - 1 freq
bannock's - 1 freq
bank-e - 1 freq
bo'ness - 3 freq
'bannock' - 2 freq
bonk - 2 freq
bouns - 1 freq
biomass - 2 freq
boyng - 1 freq
banis - 1 freq
'banes- - 1 freq
bing's - 1 freq
bonsai - 1 freq
benk - 1 freq
be-ins - 2 freq
bankie - 2 freq
bianco - 3 freq
benachie - 2 freq
benjy - 8 freq
bauns - 4 freq
bng - 1 freq
bmg - 1 freq
bungee - 1 freq
€˜bans - 1 freq
binge - 4 freq
bonhoaga - 1 freq
baens - 3 freq
beenge - 1 freq
binkie - 1 freq
bmx - 1 freq
bmxs - 1 freq
€œbingo - 1 freq
boones - 1 freq
€œbang - 1 freq
beange - 1 freq
bines - 1 freq
bynack - 1 freq
beyonce - 1 freq
bong - 3 freq
‘bunksy’ - 1 freq
bunksy - 1 freq
bonce - 1 freq
boneys - 1 freq
bowing - 1 freq
baines - 1 freq
bunc - 1 freq
bungy - 1 freq
bun's - 1 freq
buenos - 1 freq
beyoncé’s - 2 freq
bangu - 1 freq
bnqo - 1 freq
bmjj - 1 freq
bfumx - 1 freq
bmq - 1 freq
'being - 1 freq
bbmjzg - 1 freq
benosey - 1 freq
boness - 3 freq
MetaPhone code - BNK
bank - 189 freq
bing' - 2 freq
bink - 26 freq
bannock - 18 freq
bang - 100 freq
bank' - 2 freq
bing - 38 freq
being - 306 freq
bingo - 38 freq
baunk - 30 freq
bunk - 28 freq
boeing - 1 freq
binkee - 1 freq
being' - 1 freq
bank- - 1 freq
ban'k - 1 freq
bung - 2 freq
binnack - 2 freq
baank - 1 freq
binnock - 1 freq
bank-e - 1 freq
'bannock' - 2 freq
bonk - 2 freq
boyng - 1 freq
benk - 1 freq
bankie - 2 freq
bianco - 3 freq
bng - 1 freq
binkie - 1 freq
€œbingo - 1 freq
€œbang - 1 freq
bynack - 1 freq
bong - 3 freq
bunc - 1 freq
bangu - 1 freq
bnqo - 1 freq
'being - 1 freq
BEING
am - 1545 freq
be - 15063 freq
wis - 28134 freq
is - 18321 freq
iz - 434 freq
been - 5175 freq
bön - 23 freq
are - 5138 freq
were - 4095 freq
wir - 2249 freq
was - 3407 freq
wus - 1426 freq
wur - 1618 freq
bein - 1776 freq
being - 306 freq
wiz - 1293 freq
wes - 1883 freq
war - 1446 freq
bin - 971 freq
bes - 315 freq
ir - 1541 freq
re - 48 freq
isnae - 558 freq
wisnae - 1002 freq
wisna - 1039 freq
ur - 604 freq
isna - 390 freq
Time to execute Levenshtein function - 0.185907 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.368929 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027688 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.058428 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001052 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.