A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to breakfast in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
breakfast (0) - 138 freq
breakfasts (1) - 4 freq
brakfast (1) - 33 freq
breakfist (1) - 2 freq
brekfast (1) - 13 freq
brakfaist (2) - 1 freq
brakfasts (2) - 1 freq
breakfast's (2) - 1 freq
brakwast (2) - 4 freq
brakkfast (2) - 5 freq
braakfist (2) - 13 freq
brakfist (2) - 4 freq
brakefast (2) - 6 freq
breakfa (2) - 1 freq
broadcast (3) - 16 freq
breakers (3) - 2 freq
brakkfest (3) - 1 freq
breakage (3) - 1 freq
breaks (3) - 17 freq
steadfast (3) - 8 freq
breast (3) - 12 freq
brakkfaist (3) - 2 freq
i'brakfast (3) - 2 freq
buckfast (3) - 12 freq
brake-fast (3) - 1 freq
breakfast (0) - 138 freq
breakfist (1) - 2 freq
brekfast (1) - 13 freq
brakfast (1) - 33 freq
brakfist (2) - 4 freq
brakefast (2) - 6 freq
brakfaist (2) - 1 freq
braakfist (2) - 13 freq
breakfasts (2) - 4 freq
brakkfast (3) - 5 freq
brakwast (3) - 4 freq
brakfasts (3) - 1 freq
brake-fast (4) - 1 freq
buckfast (4) - 12 freq
brakkfest (4) - 1 freq
breakfast's (4) - 1 freq
breakfa (4) - 1 freq
brakkfaist (4) - 2 freq
i'brakfast (4) - 2 freq
belfast (5) - 46 freq
breast (5) - 12 freq
breakers (5) - 2 freq
breaks (5) - 17 freq
broadcast (5) - 16 freq
brast (6) - 1 freq
SoundEx code - B621
breakfast - 138 freq
bracky-bree - 4 freq
bare-as-birkie - 1 freq
brekfast - 13 freq
brakfast - 33 freq
brakfist - 4 freq
braakfist - 13 freq
breakfast's - 1 freq
brek-up - 1 freq
brekup - 1 freq
brakfaist - 1 freq
brigfoot - 3 freq
braxfield - 3 freq
breach-birth - 1 freq
bric-a-brac - 2 freq
bargepole - 2 freq
bruckie-plate - 1 freq
brakkfast - 5 freq
breeze-blocks - 1 freq
breakfist - 2 freq
breakfasts - 4 freq
breekbaund - 1 freq
burrygaves - 3 freq
'burrygave' - 1 freq
birk-branch - 1 freq
brisbane - 1 freq
brakkfast's - 1 freq
brakkfaist - 2 freq
brakkfest - 1 freq
brakefast - 6 freq
brockville - 1 freq
brakfasts - 1 freq
brake-fast - 1 freq
brickbats - 2 freq
€œbrakkfast - 1 freq
breezeblock - 1 freq
brak-up - 1 freq
bar-keep - 1 freq
brokebackybrae - 1 freq
barackobama - 1 freq
breakfa - 1 freq
breakfastbirdwatch - 1 freq
brcf - 1 freq
broxburn - 5 freq
broxburnathfc - 6 freq
MetaPhone code - BRKFST
breakfast - 138 freq
brekfast - 13 freq
brakfast - 33 freq
brakfist - 4 freq
braakfist - 13 freq
brakfaist - 1 freq
brakkfast - 5 freq
breakfist - 2 freq
brakkfaist - 2 freq
brakkfest - 1 freq
brakefast - 6 freq
brake-fast - 1 freq
€œbrakkfast - 1 freq
BREAKFAST
Time to execute Levenshtein function - 0.182335 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.364343 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027662 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037408 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000846 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.