A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bearded in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bearded (0) - 8 freq
beaded (1) - 1 freq
boarded (1) - 5 freq
rewarded (2) - 2 freq
braided (2) - 2 freq
reared (2) - 24 freq
beande (2) - 1 freq
beard (2) - 57 freq
earned (2) - 19 freq
bearsden (2) - 5 freq
hearted (2) - 5 freq
barred (2) - 15 freq
feared (2) - 35 freq
bended (2) - 4 freq
barked (2) - 9 freq
bearer (2) - 7 freq
branded (2) - 5 freq
herded (2) - 1 freq
guarded (2) - 2 freq
berde (2) - 2 freq
bared (2) - 6 freq
beardy (2) - 5 freq
regarded (2) - 5 freq
heared (2) - 14 freq
learned (2) - 79 freq
bearded (0) - 8 freq
boarded (1) - 5 freq
bairded (2) - 4 freq
braided (2) - 2 freq
boorded (2) - 1 freq
beaded (2) - 1 freq
boarder (3) - 5 freq
beardit (3) - 2 freq
barged (3) - 2 freq
awarded (3) - 6 freq
beardy (3) - 5 freq
bedded (3) - 8 freq
beardies (3) - 1 freq
boarden (3) - 1 freq
beards (3) - 12 freq
beardie (3) - 1 freq
bared (3) - 6 freq
barbed (3) - 4 freq
beeried (3) - 14 freq
boarde (3) - 1 freq
bardet (3) - 1 freq
barked (3) - 9 freq
bended (3) - 4 freq
berde (3) - 2 freq
beard (3) - 57 freq
SoundEx code - B633
breathed - 21 freq
birthday - 204 freq
birthday-praisants - 1 freq
berthed - 1 freq
burthday - 9 freq
brodded - 1 freq
boarded - 5 freq
bearded - 8 freq
birthed - 3 freq
braidit - 3 freq
birthit - 3 freq
bruitit - 1 freq
burdwatcher - 1 freq
bairded - 4 freq
bordit - 1 freq
berthet - 1 freq
birthdae's - 1 freq
boorded - 1 freq
birthday-presents - 2 freq
birthday's - 1 freq
braithtakkin - 2 freq
boordit - 2 freq
breidit - 1 freq
burtit - 1 freq
birthdays - 5 freq
braeth-thick - 1 freq
birthday' - 2 freq
braided - 2 freq
berated - 1 freq
breadtht - 3 freq
bardheid - 1 freq
buirdit - 1 freq
brawtith - 1 freq
broddit - 2 freq
braid-oot - 1 freq
bird-watchin - 2 freq
breathtakin - 1 freq
beardit - 2 freq
bird-watchers - 1 freq
briadwood's - 1 freq
broth-time - 1 freq
braith-takin - 1 freq
beirdit - 1 freq
breithit - 1 freq
bardet - 1 freq
brodiedru - 11 freq
breathtaking - 2 freq
MetaPhone code - BRTT
brodded - 1 freq
boarded - 5 freq
bearded - 8 freq
braidit - 3 freq
bruitit - 1 freq
bairded - 4 freq
bordit - 1 freq
boorded - 1 freq
boordit - 2 freq
breidit - 1 freq
burtit - 1 freq
braided - 2 freq
berated - 1 freq
boortd - 1 freq
buirdit - 1 freq
broddit - 2 freq
braid-oot - 1 freq
beardit - 2 freq
beirdit - 1 freq
bardet - 1 freq
BEARDED
Time to execute Levenshtein function - 0.215028 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.369948 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029891 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040359 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000910 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.