A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to begins in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
begins (0) - 83 freq
begin' (1) - 1 freq
be-ins (1) - 2 freq
beins (1) - 14 freq
baegins (1) - 1 freq
begin's (1) - 1 freq
begin (1) - 104 freq
beggs (2) - 1 freq
begunk (2) - 18 freq
beans (2) - 66 freq
behind (2) - 261 freq
becis (2) - 40 freq
behint (2) - 5 freq
bearins (2) - 9 freq
bogies (2) - 3 freq
eins (2) - 2 freq
leains (2) - 3 freq
leggins (2) - 3 freq
begun (2) - 79 freq
beginin (2) - 15 freq
ingins (2) - 15 freq
beinn (2) - 2 freq
be'in (2) - 3 freq
begs (2) - 11 freq
belies (2) - 1 freq
begins (0) - 83 freq
baegins (1) - 1 freq
begin' (2) - 1 freq
begin (2) - 104 freq
begin's (2) - 1 freq
beins (2) - 14 freq
be-ins (2) - 2 freq
beatins (3) - 3 freq
regions (3) - 20 freq
biggins (3) - 54 freq
bens (3) - 47 freq
berns (3) - 10 freq
began (3) - 298 freq
begets (3) - 4 freq
baegin (3) - 2 freq
begs (3) - 11 freq
begen (3) - 11 freq
beens (3) - 35 freq
bins (3) - 34 freq
barins (3) - 4 freq
vegans (3) - 1 freq
bogils (3) - 2 freq
begunks (3) - 2 freq
basins (3) - 1 freq
byganes (3) - 2 freq
SoundEx code - B252
business - 295 freq
begins - 83 freq
biggins - 54 freq
bogging - 2 freq
begunked - 4 freq
becomes - 44 freq
businessman - 11 freq
business-fowk - 1 freq
businesslike - 1 freq
bees-knees - 2 freq
big-yins - 2 freq
begunkit - 8 freq
bosons - 1 freq
beckons - 5 freq
biggans - 6 freq
buzzing - 11 freq
biggings - 3 freq
buchan's - 2 freq
byganes - 2 freq
bisniss - 4 freq
begin's - 1 freq
buchanness - 2 freq
bisoms - 1 freq
bigone's - 1 freq
begunk - 18 freq
bashans - 1 freq
buzness - 7 freq
bookings - 2 freq
boak-makkin - 3 freq
businesses - 19 freq
'business - 1 freq
businessforscotland - 2 freq
businessmen - 3 freq
bisoms- - 1 freq
baegins - 1 freq
becums - 9 freq
booking - 5 freq
buckingham - 4 freq
bïzness - 2 freq
busineass - 1 freq
beisnes - 5 freq
bisness - 3 freq
biggeens - 3 freq
bisnesg - 1 freq
boggin's - 4 freq
'boggin's - 2 freq
beacons - 4 freq
bigsiniss - 1 freq
beseems - 1 freq
busens - 1 freq
bizzness - 1 freq
beijing - 1 freq
baking - 9 freq
backing - 3 freq
bygang - 1 freq
besoms - 2 freq
becumms - 1 freq
bacon's - 1 freq
beisines - 5 freq
basins - 1 freq
biggin-site - 1 freq
beisnessis - 1 freq
beisenes - 2 freq
€œbusiness - 1 freq
bosniak - 1 freq
begunks - 2 freq
boaking - 1 freq
beeching - 1 freq
bazonga - 1 freq
bikinis - 1 freq
beezing - 1 freq
backins - 2 freq
boking - 1 freq
business-like - 2 freq
businesslik - 1 freq
business-lik - 1 freq
buisness - 1 freq
bbcnews - 3 freq
beaujangles - 1 freq
bossing - 2 freq
bbcnewsline - 1 freq
basking - 1 freq
bazmcalister - 1 freq
biking - 2 freq
beckyannshaw - 4 freq
buchanstrypes - 1 freq
bbcmicrobot - 1 freq
bigging - 1 freq
boÂ’shunky - 1 freq
bbcjamescook - 2 freq
bqmzj - 1 freq
beejaymcgee - 1 freq
beijingpalmer - 1 freq
busking - 1 freq
bokinÂ’s - 1 freq
bbcnewsnight - 1 freq
boxing - 4 freq
bissness - 1 freq
bissniss - 1 freq
bfgmoezk - 1 freq
be-gunked - 1 freq
MetaPhone code - BJNS
begins - 83 freq
begin's - 1 freq
baegins - 1 freq
BEGINS
Time to execute Levenshtein function - 0.182479 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.334082 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027509 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037166 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000920 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.