A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to brokken in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
brokken (0) - 17 freq
brakken (1) - 3 freq
brokkan (1) - 1 freq
brocken (1) - 1 freq
brukken (1) - 30 freq
broken (1) - 174 freq
breuken (2) - 4 freq
brokk (2) - 8 freq
braaken (2) - 2 freq
drouken (2) - 1 freq
brookin (2) - 1 freq
lokken (2) - 1 freq
spokken (2) - 47 freq
brucken (2) - 5 freq
broaden (2) - 2 freq
brakked (2) - 1 freq
'broken (2) - 1 freq
broke (2) - 139 freq
bruken (2) - 1 freq
braiken (2) - 1 freq
browden (2) - 4 freq
brakkin (2) - 35 freq
braken (2) - 16 freq
drukken (2) - 12 freq
slokken (2) - 1 freq
brokken (0) - 17 freq
brukken (1) - 30 freq
brokkan (1) - 1 freq
brakken (1) - 3 freq
brekkin (2) - 24 freq
brakkan (2) - 9 freq
brakkin (2) - 35 freq
broken (2) - 174 freq
brocken (2) - 1 freq
blokkin (3) - 1 freq
drukken (3) - 12 freq
braken (3) - 16 freq
brisken (3) - 1 freq
brokin (3) - 3 freq
bracken (3) - 15 freq
bruken (3) - 1 freq
brekker (3) - 2 freq
crakken (3) - 1 freq
braiken (3) - 1 freq
brookin (3) - 1 freq
braaken (3) - 2 freq
brucken (3) - 5 freq
brakked (3) - 1 freq
brokk (3) - 8 freq
breuken (3) - 4 freq
SoundEx code - B625
brekkin - 24 freq
brawsome - 8 freq
bargain - 33 freq
b'wirkin - 1 freq
bracken - 15 freq
broken - 174 freq
barkin - 27 freq
brushin - 10 freq
breuken - 4 freq
breakin - 21 freq
brazen - 3 freq
break-in - 3 freq
burgundy - 6 freq
burgandy - 2 freq
brakkin - 35 freq
birsken - 1 freq
braken - 16 freq
brachen - 1 freq
brukken - 30 freq
bverycunt's - 1 freq
breakneck - 1 freq
brucken - 5 freq
bergamot - 5 freq
burgeonin - 1 freq
breaking - 12 freq
brakin - 15 freq
break-ins - 2 freq
birkenshaw - 3 freq
braakin - 2 freq
breekums - 1 freq
bracin - 2 freq
bargained - 2 freq
brokkan-herted - 1 freq
berkin - 1 freq
breakin' - 3 freq
bourachin - 1 freq
braisant - 1 freq
brousin - 2 freq
braaken - 2 freq
braiken - 1 freq
brigantine - 2 freq
browsin - 2 freq
brackin - 2 freq
brochan - 3 freq
brochanor - 2 freq
browsing - 2 freq
brocken - 1 freq
brekin - 7 freq
bargin - 2 freq
brickin - 2 freq
bruisin - 6 freq
barkan - 2 freq
broken-hairtit - 4 freq
bargain-basement - 1 freq
broken-herted - 2 freq
bargains - 7 freq
brogan - 1 freq
brokenshire - 1 freq
barjin - 2 freq
brekneck - 1 freq
broughan - 1 freq
brichens - 1 freq
briganer - 2 freq
briganers - 1 freq
brigander - 1 freq
brochen - 2 freq
bairgin - 1 freq
brokken - 17 freq
birzin - 3 freq
birsin - 4 freq
'broken - 1 freq
brokken-herted - 1 freq
bruckeen - 1 freq
brissan - 1 freq
brakkan - 9 freq
brassneck - 1 freq
brass-neck - 2 freq
birssin - 1 freq
braggin - 5 freq
bræsin - 1 freq
bergen - 2 freq
brokkan - 1 freq
brakken - 3 freq
brokin - 3 freq
brukken-doon - 1 freq
burgeon - 1 freq
brookiyn - 1 freq
brosnan - 1 freq
braisentlie - 1 freq
broken-doun - 1 freq
brushan - 1 freq
bursen - 6 freq
brigend - 1 freq
brecham - 5 freq
burssen - 1 freq
bergman - 2 freq
bragança - 1 freq
burgomaister - 1 freq
brookin - 1 freq
bark-an-bowff - 1 freq
braak-in - 1 freq
burgeoned - 1 freq
brookmyre - 1 freq
bargie-in - 1 freq
birzzin - 1 freq
berganin - 1 freq
brig-en - 1 freq
braisent - 2 freq
brigganeir - 1 freq
brigganers - 1 freq
briggin - 4 freq
brass-necked - 1 freq
breezing - 1 freq
broken-hearted - 2 freq
breckin - 2 freq
brisknortherly - 2 freq
brechin - 14 freq
brisken - 1 freq
brigands - 1 freq
€˜brigand - 1 freq
breezin - 1 freq
birsen - 1 freq
boroughmuir - 2 freq
browsin' - 1 freq
berkin--- - 1 freq
€œbrekkin - 1 freq
brakna - 1 freq
briganders - 8 freq
bargainin - 1 freq
bruken - 1 freq
braisantlik - 1 freq
berwick-on-tweed - 2 freq
brogans - 1 freq
brickin' - 1 freq
bhrochan - 1 freq
borisjohnson - 12 freq
bryson - 2 freq
brek-in - 1 freq
brakking - 1 freq
brucemcconachie - 1 freq
bericonforensic - 2 freq
breakneckcomedy - 1 freq
bargyin - 1 freq
barrassinÂ’ - 1 freq
bruising - 1 freq
barking - 1 freq
burgundyninja - 2 freq
'breakin - 1 freq
barcamilton - 1 freq
bergamp - 1 freq
bparkinson - 2 freq
brechincityfc - 2 freq
breaching - 1 freq
brigham - 1 freq
burkeman - 1 freq
MetaPhone code - BRKN
brekkin - 24 freq
bargain - 33 freq
bracken - 15 freq
broken - 174 freq
barkin - 27 freq
breuken - 4 freq
breakin - 21 freq
break-in - 3 freq
brakkin - 35 freq
braken - 16 freq
brukken - 30 freq
brucken - 5 freq
brakin - 15 freq
braakin - 2 freq
berkin - 1 freq
breakin' - 3 freq
braaken - 2 freq
braiken - 1 freq
brackin - 2 freq
brocken - 1 freq
brekin - 7 freq
brickin - 2 freq
barkan - 2 freq
brogan - 1 freq
brokken - 17 freq
'broken - 1 freq
bruckeen - 1 freq
brakkan - 9 freq
braggin - 5 freq
brokkan - 1 freq
brakken - 3 freq
brokin - 3 freq
brookiyn - 1 freq
bragança - 1 freq
brookin - 1 freq
braak-in - 1 freq
brig-en - 1 freq
briggin - 4 freq
breckin - 2 freq
berkin--- - 1 freq
€œbrekkin - 1 freq
brakna - 1 freq
bruken - 1 freq
brickin' - 1 freq
brek-in - 1 freq
'breakin - 1 freq
BROKKEN
Time to execute Levenshtein function - 0.178382 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337922 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028263 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037583 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000926 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.