A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to begunk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
begunk (0) - 18 freq
begunks (1) - 2 freq
begun (1) - 79 freq
begeck (2) - 14 freq
begane (2) - 1 freq
begowk (2) - 2 freq
begin' (2) - 1 freq
begunkit (2) - 8 freq
begins (2) - 83 freq
gunk (2) - 9 freq
behun (2) - 1 freq
began (2) - 298 freq
begin (2) - 104 freq
benk (2) - 1 freq
baegun (2) - 6 freq
begunked (2) - 4 freq
begude (2) - 2 freq
bunk (2) - 28 freq
begen (2) - 11 freq
beuk (2) - 108 freq
baunk (2) - 30 freq
aegin (3) - 1 freq
melgund (3) - 1 freq
beuch (3) - 2 freq
rerun (3) - 1 freq
begunk (0) - 18 freq
begun (2) - 79 freq
begunks (2) - 2 freq
baegun (3) - 6 freq
benk (3) - 1 freq
begunked (3) - 4 freq
begen (3) - 11 freq
baunk (3) - 30 freq
began (3) - 298 freq
bunk (3) - 28 freq
begin (3) - 104 freq
begane (3) - 1 freq
begeck (3) - 14 freq
begin' (3) - 1 freq
begowk (3) - 2 freq
begunkit (3) - 8 freq
gunk (3) - 9 freq
begins (3) - 83 freq
blenk (4) - 12 freq
bink (4) - 26 freq
bonk (4) - 2 freq
bygang (4) - 1 freq
bygane (4) - 45 freq
blink (4) - 91 freq
baegin (4) - 2 freq
SoundEx code - B252
business - 295 freq
begins - 83 freq
biggins - 54 freq
bogging - 2 freq
begunked - 4 freq
becomes - 44 freq
businessman - 11 freq
business-fowk - 1 freq
businesslike - 1 freq
bees-knees - 2 freq
big-yins - 2 freq
begunkit - 8 freq
bosons - 1 freq
beckons - 5 freq
biggans - 6 freq
buzzing - 11 freq
biggings - 3 freq
buchan's - 2 freq
byganes - 2 freq
bisniss - 4 freq
begin's - 1 freq
buchanness - 2 freq
bisoms - 1 freq
bigone's - 1 freq
begunk - 18 freq
bashans - 1 freq
buzness - 7 freq
bookings - 2 freq
boak-makkin - 3 freq
businesses - 19 freq
'business - 1 freq
businessforscotland - 2 freq
businessmen - 3 freq
bisoms- - 1 freq
baegins - 1 freq
becums - 9 freq
booking - 5 freq
buckingham - 4 freq
bïzness - 2 freq
busineass - 1 freq
beisnes - 5 freq
bisness - 3 freq
biggeens - 3 freq
bisnesg - 1 freq
boggin's - 4 freq
'boggin's - 2 freq
beacons - 4 freq
bigsiniss - 1 freq
beseems - 1 freq
busens - 1 freq
bizzness - 1 freq
beijing - 1 freq
baking - 9 freq
backing - 3 freq
bygang - 1 freq
besoms - 2 freq
becumms - 1 freq
bacon's - 1 freq
beisines - 5 freq
basins - 1 freq
biggin-site - 1 freq
beisnessis - 1 freq
beisenes - 2 freq
€œbusiness - 1 freq
bosniak - 1 freq
begunks - 2 freq
boaking - 1 freq
beeching - 1 freq
bazonga - 1 freq
bikinis - 1 freq
beezing - 1 freq
backins - 2 freq
boking - 1 freq
business-like - 2 freq
businesslik - 1 freq
business-lik - 1 freq
buisness - 1 freq
bbcnews - 3 freq
beaujangles - 1 freq
bossing - 2 freq
bbcnewsline - 1 freq
basking - 1 freq
bazmcalister - 1 freq
biking - 2 freq
beckyannshaw - 4 freq
buchanstrypes - 1 freq
bbcmicrobot - 1 freq
bigging - 1 freq
boÂ’shunky - 1 freq
bbcjamescook - 2 freq
bqmzj - 1 freq
beejaymcgee - 1 freq
beijingpalmer - 1 freq
busking - 1 freq
bokinÂ’s - 1 freq
bbcnewsnight - 1 freq
boxing - 4 freq
bissness - 1 freq
bissniss - 1 freq
bfgmoezk - 1 freq
be-gunked - 1 freq
MetaPhone code - BKNK
bogging - 2 freq
begunk - 18 freq
booking - 5 freq
baking - 9 freq
backing - 3 freq
bygang - 1 freq
boaking - 1 freq
boking - 1 freq
biking - 2 freq
bigging - 1 freq
BEGUNK
Time to execute Levenshtein function - 0.173534 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.361685 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027346 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036835 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000876 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.