A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to blackcockha in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
blackcockha (0) - 1 freq
blackcock (2) - 2 freq
bleckcocks (3) - 1 freq
blackjack (4) - 1 freq
blackhaw (5) - 1 freq
black-hack (5) - 1 freq
backpacker (5) - 3 freq
backchat (5) - 4 freq
blackwood (5) - 4 freq
black-oot (5) - 1 freq
blackoots (5) - 1 freq
blackoot (5) - 2 freq
blacksmiths (5) - 1 freq
back-chat (5) - 1 freq
backpack (5) - 5 freq
blackcurran (5) - 1 freq
backpackin (5) - 1 freq
blackpool (5) - 16 freq
black-out (5) - 1 freq
black-back (5) - 1 freq
blackcap (5) - 1 freq
blackpoolfc (5) - 1 freq
blackford's (5) - 1 freq
banknockb (5) - 1 freq
black-backed (5) - 1 freq
blackcockha (0) - 1 freq
blackcock (3) - 2 freq
bleckcocks (4) - 1 freq
blackjack (6) - 1 freq
black-backed (8) - 1 freq
blackcap (8) - 1 freq
blackwatch (8) - 1 freq
backpackin (8) - 1 freq
black-back (8) - 1 freq
blackface (8) - 3 freq
blacksmith (8) - 4 freq
blackcurran (8) - 1 freq
backcloth (8) - 1 freq
blackneuk (8) - 1 freq
backpacks (8) - 1 freq
backpacker (8) - 3 freq
backpack (8) - 5 freq
black-hack (8) - 1 freq
knockha (9) - 1 freq
bellyknock (9) - 1 freq
bladenoch (9) - 1 freq
blackford (9) - 6 freq
bladnoch (9) - 3 freq
luckche (9) - 1 freq
ball-cocks (9) - 1 freq
SoundEx code - B422
bellochs - 1 freq
bellises - 1 freq
bulges - 2 freq
bleezes - 6 freq
blackest - 4 freq
bellycastle - 3 freq
bellicose - 1 freq
balzac - 1 freq
blazes - 1 freq
blushes - 3 freq
blokes - 8 freq
blisses - 3 freq
blackie's - 2 freq
blackjack - 1 freq
bluschis - 1 freq
bloosis - 1 freq
blouses - 2 freq
biological - 5 freq
blakaz - 1 freq
byological - 1 freq
blasius - 1 freq
bleckcocks - 1 freq
belches - 1 freq
blackies - 1 freq
ball-cocks - 1 freq
blackcock - 2 freq
blackcockha - 1 freq
black-hack - 1 freq
blogosphere - 1 freq
billykayscot - 341 freq
blxwakw - 1 freq
blazespage - 22 freq
blessesd - 1 freq
blackislepmd - 4 freq
bloke's - 2 freq
ballycastle - 3 freq
MetaPhone code - BLKKKH
blackcockha - 1 freq
BLACKCOCKHA
Time to execute Levenshtein function - 0.192527 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.391640 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028776 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039098 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000949 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.