A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to belated in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
belated (0) - 6 freq
elated (1) - 3 freq
berated (1) - 1 freq
beleted (1) - 1 freq
related (1) - 28 freq
belted (1) - 14 freq
blasted (2) - 7 freq
beaded (2) - 1 freq
bellied (2) - 3 freq
blared (2) - 3 freq
melted (2) - 10 freq
bolted (2) - 19 freq
befaaed (2) - 1 freq
blawed (2) - 9 freq
belatedly (2) - 1 freq
debated (2) - 3 freq
'belated' (2) - 1 freq
bloated (2) - 1 freq
plated (2) - 3 freq
relatet (2) - 2 freq
pelted (2) - 5 freq
behatted (2) - 1 freq
belter (2) - 34 freq
relayed (2) - 1 freq
beaten (2) - 20 freq
belated (0) - 6 freq
belted (1) - 14 freq
beleted (1) - 1 freq
bleated (2) - 1 freq
bloated (2) - 1 freq
bolted (2) - 19 freq
elated (2) - 3 freq
related (2) - 28 freq
berated (2) - 1 freq
boalted (3) - 1 freq
slated (3) - 1 freq
blaaed (3) - 1 freq
deleted (3) - 5 freq
belied (3) - 1 freq
bested (3) - 2 freq
beltel (3) - 4 freq
belten (3) - 1 freq
elted (3) - 1 freq
dilated (3) - 1 freq
beluved (3) - 1 freq
beleved (3) - 2 freq
beloved (3) - 25 freq
blamed (3) - 10 freq
blate (3) - 82 freq
belatedly (3) - 1 freq
SoundEx code - B433
buildit - 1 freq
bieldit - 5 freq
bielded - 3 freq
belted - 14 freq
beltit - 14 freq
bleatit - 2 freq
blaudit - 2 freq
bultit - 1 freq
billeted - 1 freq
bluidied - 2 freq
bolted - 19 freq
blid-wat - 1 freq
blytheheid - 1 freq
bull-heidit - 2 freq
baldy-heided - 1 freq
bloatit - 3 freq
blotted - 5 freq
boltit - 4 freq
bolt-haedit - 1 freq
bloated - 1 freq
bluidwytin-fain - 1 freq
belated - 6 freq
bauld-heidit - 1 freq
blottit - 1 freq
bald-heided - 1 freq
beildit - 4 freq
bladdit - 4 freq
bauldy-heid - 1 freq
belatedly - 1 freq
bloodit - 1 freq
boltet - 1 freq
bltidlink - 1 freq
boalted - 1 freq
bleated - 1 freq
bloodied - 1 freq
bluetooth - 2 freq
beleted - 1 freq
'belated' - 1 freq
MetaPhone code - BLTT
buildit - 1 freq
bieldit - 5 freq
bielded - 3 freq
belted - 14 freq
beltit - 14 freq
bleatit - 2 freq
blaudit - 2 freq
bultit - 1 freq
billeted - 1 freq
bluidied - 2 freq
blighted - 2 freq
blutdie - 1 freq
bolted - 19 freq
bloatit - 3 freq
blotted - 5 freq
boltit - 4 freq
bloated - 1 freq
belated - 6 freq
blottit - 1 freq
beildit - 4 freq
bladdit - 4 freq
bloodit - 1 freq
boltet - 1 freq
boalted - 1 freq
bleated - 1 freq
bloodied - 1 freq
beleted - 1 freq
'belated' - 1 freq
BELATED
Time to execute Levenshtein function - 0.299812 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.603238 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031375 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.055412 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000836 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.