A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to breeze-blocks in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
breeze-blocks (0) - 1 freq
breezeblock (2) - 1 freq
breemsticks (6) - 1 freq
beef-links (6) - 1 freq
reeboks (6) - 1 freq
deevilocks (6) - 1 freq
breezers (6) - 1 freq
dreadlocks (6) - 1 freq
grey-black (6) - 1 freq
breek-legs (6) - 1 freq
breezes (6) - 2 freq
blue-black (6) - 2 freq
reek-less (7) - 1 freq
ooie-socks (7) - 1 freq
blocks (7) - 18 freq
freeze-dried (7) - 1 freq
re-stock (7) - 1 freq
bleach-blonde (7) - 1 freq
greeny-broun (7) - 1 freq
rejecks (7) - 1 freq
knee-breeks (7) - 1 freq
begecks (7) - 1 freq
breeze (7) - 80 freq
resembles (7) - 8 freq
freezes (7) - 4 freq
breeze-blocks (0) - 1 freq
breezeblock (4) - 1 freq
blue-black (9) - 2 freq
grey-black (9) - 1 freq
breemsticks (10) - 1 freq
breek-legs (10) - 1 freq
dreadlocks (10) - 1 freq
beef-links (10) - 1 freq
mirk-bleck (11) - 1 freq
breid-pieces (11) - 2 freq
jet-bleck (11) - 1 freq
ball-cocks (11) - 1 freq
bollocks (11) - 7 freq
brocks (11) - 6 freq
grew-back (11) - 1 freq
peat-black (11) - 1 freq
basewbtcks (11) - 1 freq
creativeblock (11) - 2 freq
spam-block (11) - 1 freq
republics (11) - 2 freq
bullocks (11) - 2 freq
reflecks (11) - 8 freq
tear-blobs (11) - 1 freq
crumlocks (11) - 1 freq
roadblock (11) - 3 freq
SoundEx code - B621
breakfast - 145 freq
bracky-bree - 4 freq
bare-as-birkie - 1 freq
brekfast - 13 freq
brakfast - 33 freq
brakfist - 4 freq
braakfist - 13 freq
braxficld - 1 freq
braxfield - 5 freq
braxfield's - 1 freq
breakfast's - 1 freq
brek-up - 1 freq
brekup - 1 freq
brakfaist - 1 freq
brigfoot - 3 freq
breach-birth - 1 freq
bric-a-brac - 2 freq
bargepole - 2 freq
bruckie-plate - 1 freq
brakkfast - 5 freq
breeze-blocks - 1 freq
breakfist - 2 freq
breakfasts - 4 freq
breekbaund - 1 freq
burrygaves - 3 freq
'burrygave' - 1 freq
birk-branch - 1 freq
brisbane - 1 freq
brakkfast's - 1 freq
brakkfaist - 2 freq
brakkfest - 1 freq
brakefast - 6 freq
brockville - 1 freq
brakfasts - 1 freq
brake-fast - 1 freq
brickbats - 2 freq
€œbrakkfast - 1 freq
breezeblock - 1 freq
brak-up - 1 freq
bar-keep - 1 freq
brokebackybrae - 1 freq
barackobama - 1 freq
breakfa - 1 freq
breakfastbirdwatch - 1 freq
brcf - 1 freq
broxburn - 5 freq
broxburnathfc - 6 freq
MetaPhone code - BRSBLKS
breeze-blocks - 1 freq
BREEZE-BLOCKS
Time to execute Levenshtein function - 0.231005 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.403616 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028203 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038611 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000885 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.