A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to birthday in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
birthday (0) - 199 freq
birthdays (1) - 5 freq
birthday' (1) - 2 freq
burthday (1) - 9 freq
birffday (2) - 1 freq
birthy (2) - 1 freq
birthday's (2) - 1 freq
birthlan (2) - 1 freq
birth (3) - 92 freq
on-birthday (3) - 3 freq
birthers (3) - 1 freq
airthly (3) - 1 freq
births (3) - 6 freq
bethray (3) - 2 freq
birthit (3) - 3 freq
brawday (3) - 9 freq
birthed (3) - 3 freq
birthdae's (3) - 1 freq
birther (3) - 2 freq
riteday (3) - 1 freq
birthin (3) - 6 freq
birthrate (3) - 1 freq
buthlay (3) - 1 freq
wirthy (3) - 7 freq
firstday (3) - 1 freq
birthday (0) - 199 freq
burthday (1) - 9 freq
birthdays (2) - 5 freq
birthday' (2) - 2 freq
birthed (3) - 3 freq
birthy (3) - 1 freq
birther (4) - 2 freq
births (4) - 6 freq
birthit (4) - 3 freq
berthed (4) - 1 freq
birthin (4) - 6 freq
birffday (4) - 1 freq
birthday's (4) - 1 freq
birthlan (4) - 1 freq
birth (4) - 92 freq
barthol (5) - 1 freq
berth (5) - 21 freq
borthw (5) - 2 freq
borthel (5) - 1 freq
burghead (5) - 3 freq
berthet (5) - 1 freq
brithal (5) - 2 freq
berthen (5) - 3 freq
burthen (5) - 6 freq
garthdee (5) - 7 freq
SoundEx code - B633
breathed - 20 freq
birthday - 199 freq
birthday-praisants - 1 freq
berthed - 1 freq
burthday - 9 freq
brodded - 1 freq
boarded - 5 freq
bearded - 7 freq
birthed - 3 freq
braidit - 3 freq
birthit - 3 freq
bruitit - 1 freq
burdwatcher - 1 freq
bairded - 4 freq
bordit - 1 freq
berthet - 1 freq
birthdae's - 1 freq
boorded - 1 freq
birthday-presents - 2 freq
birthday's - 1 freq
braithtakkin - 2 freq
boordit - 2 freq
breidit - 1 freq
burtit - 1 freq
birthdays - 5 freq
braeth-thick - 1 freq
birthday' - 2 freq
braided - 2 freq
berated - 1 freq
breadtht - 3 freq
bardheid - 1 freq
buirdit - 1 freq
brawtith - 1 freq
broddit - 2 freq
braid-oot - 1 freq
bird-watchin - 2 freq
breathtakin - 1 freq
beardit - 2 freq
bird-watchers - 1 freq
briadwood's - 1 freq
broth-time - 1 freq
braith-takin - 1 freq
beirdit - 1 freq
breithit - 1 freq
bardet - 1 freq
brodiedru - 11 freq
breathtaking - 2 freq
MetaPhone code - BR0T
breathed - 20 freq
birthday - 199 freq
berthed - 1 freq
burthday - 9 freq
birthed - 3 freq
birthit - 3 freq
berthet - 1 freq
birthday' - 2 freq
breithit - 1 freq
BIRTHDAY
Time to execute Levenshtein function - 0.195453 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.366179 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030485 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038501 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000874 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.