A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to broker in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
broker (0) - 1 freq
proker (1) - 1 freq
brokers (1) - 2 freq
troker (1) - 1 freq
broken (1) - 174 freq
broke (1) - 139 freq
'broken (2) - 1 freq
hooker (2) - 4 freq
broader (2) - 1 freq
brok (2) - 6 freq
proper (2) - 184 freq
grocer (2) - 3 freq
boked (2) - 2 freq
raker (2) - 2 freq
blokes (2) - 8 freq
stoker (2) - 4 freq
brokk (2) - 8 freq
boke' (2) - 1 freq
cooker (2) - 21 freq
basker (2) - 1 freq
boer (2) - 3 freq
troke (2) - 17 freq
joker (2) - 7 freq
biker (2) - 3 freq
brouser (2) - 4 freq
broker (0) - 1 freq
breaker (2) - 4 freq
burker (2) - 2 freq
birker (2) - 1 freq
broke (2) - 139 freq
barker (2) - 1 freq
broken (2) - 174 freq
proker (2) - 1 freq
brokers (2) - 2 freq
troker (2) - 1 freq
bulker (3) - 1 freq
brakes (3) - 17 freq
braer (3) - 1 freq
brer (3) - 2 freq
beaker (3) - 7 freq
breer (3) - 11 freq
bicker (3) - 12 freq
braked (3) - 1 freq
trooker (3) - 2 freq
brokin (3) - 3 freq
braver (3) - 4 freq
brodir (3) - 10 freq
bunker (3) - 23 freq
brewer (3) - 1 freq
brookes (3) - 1 freq
SoundEx code - B626
bruiser - 4 freq
braggarts - 1 freq
braggart's - 1 freq
brickwark - 1 freq
berserk - 3 freq
brochure - 8 freq
bergère - 1 freq
bergerac - 6 freq
berkshire - 4 freq
breakers - 2 freq
brekker - 2 freq
burgauer - 3 freq
bureaucrat - 1 freq
braggart - 2 freq
berserker - 1 freq
berserker's - 1 freq
berserkers - 1 freq
berserks - 1 freq
breaker - 4 freq
broker - 1 freq
barcardi - 1 freq
breezer - 5 freq
brochires - 3 freq
brochire - 1 freq
burger - 13 freq
burkart - 1 freq
'burkers' - 1 freq
burkers - 2 freq
burker - 2 freq
burgers - 11 freq
brekkers - 1 freq
boaregarde - 1 freq
'breezers' - 1 freq
breezers - 1 freq
bursaries - 1 freq
braziers - 2 freq
burghers - 2 freq
birsier - 1 freq
beaurocrats - 1 freq
bureaucracy - 6 freq
bureaucratic - 1 freq
brucker - 1 freq
berrickshire - 1 freq
brouser - 4 freq
brousers - 1 freq
barker - 1 freq
brucre - 1 freq
braegarr - 2 freq
bruisers - 1 freq
bursary - 1 freq
brokerage - 1 freq
braisert - 1 freq
brakkar - 2 freq
birker - 1 freq
brokers - 2 freq
brocher - 2 freq
brazier - 5 freq
bureaucrats - 1 freq
browser - 4 freq
burger-meat - 1 freq
brougher - 3 freq
brsquirrel - 2 freq
barrykerr - 1 freq
brookranger - 1 freq
burgerking - 1 freq
MetaPhone code - BRKR
bergère - 1 freq
brekker - 2 freq
burgauer - 3 freq
breaker - 4 freq
broker - 1 freq
burker - 2 freq
brucker - 1 freq
barker - 1 freq
brucre - 1 freq
braegarr - 2 freq
brakkar - 2 freq
birker - 1 freq
barrykerr - 1 freq
BROKER
Time to execute Levenshtein function - 0.200552 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.374678 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.035240 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039190 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000903 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.