A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bardet in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bardet (0) - 1 freq
barmet (1) - 1 freq
barnet (1) - 3 freq
cardee (2) - 1 freq
burden (2) - 24 freq
harden (2) - 21 freq
balder (2) - 1 freq
arden (2) - 2 freq
'barde' (2) - 1 freq
garnet (2) - 1 freq
market (2) - 51 freq
barkt (2) - 1 freq
barley (2) - 33 freq
marget (2) - 6 freq
carret (2) - 3 freq
fardel (2) - 1 freq
barter (2) - 3 freq
bidet (2) - 1 freq
basket (2) - 62 freq
barges (2) - 2 freq
bairded (2) - 4 freq
bandit (2) - 7 freq
baaket (2) - 1 freq
barber (2) - 7 freq
bordit (2) - 1 freq
bardet (0) - 1 freq
beardit (2) - 2 freq
bordit (2) - 1 freq
barnet (2) - 3 freq
barmet (2) - 1 freq
beret (3) - 3 freq
lardit (3) - 1 freq
bard (3) - 61 freq
bardic (3) - 5 freq
birden (3) - 1 freq
buirdit (3) - 1 freq
breet (3) - 35 freq
burde (3) - 1 freq
bret (3) - 3 freq
boarde (3) - 1 freq
bardie (3) - 1 freq
burdent (3) - 1 freq
bards (3) - 16 freq
bearded (3) - 8 freq
bardo (3) - 1 freq
buriet (3) - 14 freq
barkit (3) - 8 freq
bared (3) - 6 freq
bardin (3) - 1 freq
berde (3) - 2 freq
SoundEx code - B633
breathed - 21 freq
birthday - 204 freq
birthday-praisants - 1 freq
berthed - 1 freq
burthday - 9 freq
brodded - 1 freq
boarded - 5 freq
bearded - 8 freq
birthed - 3 freq
braidit - 3 freq
birthit - 3 freq
bruitit - 1 freq
burdwatcher - 1 freq
bairded - 4 freq
bordit - 1 freq
berthet - 1 freq
birthdae's - 1 freq
boorded - 1 freq
birthday-presents - 2 freq
birthday's - 1 freq
braithtakkin - 2 freq
boordit - 2 freq
breidit - 1 freq
burtit - 1 freq
birthdays - 5 freq
braeth-thick - 1 freq
birthday' - 2 freq
braided - 2 freq
berated - 1 freq
breadtht - 3 freq
bardheid - 1 freq
buirdit - 1 freq
brawtith - 1 freq
broddit - 2 freq
braid-oot - 1 freq
bird-watchin - 2 freq
breathtakin - 1 freq
beardit - 2 freq
bird-watchers - 1 freq
briadwood's - 1 freq
broth-time - 1 freq
braith-takin - 1 freq
beirdit - 1 freq
breithit - 1 freq
bardet - 1 freq
brodiedru - 11 freq
breathtaking - 2 freq
MetaPhone code - BRTT
brodded - 1 freq
boarded - 5 freq
bearded - 8 freq
braidit - 3 freq
bruitit - 1 freq
bairded - 4 freq
bordit - 1 freq
boorded - 1 freq
boordit - 2 freq
breidit - 1 freq
burtit - 1 freq
braided - 2 freq
berated - 1 freq
boortd - 1 freq
buirdit - 1 freq
broddit - 2 freq
braid-oot - 1 freq
beardit - 2 freq
beirdit - 1 freq
bardet - 1 freq
BARDET
Time to execute Levenshtein function - 0.218664 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.411973 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029742 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.054547 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001166 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.