A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to remind in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
remind (0) - 39 freq
remine (1) - 2 freq
remin (1) - 1 freq
reminds (1) - 22 freq
remand (1) - 2 freq
rewind (1) - 3 freq
remindet (2) - 4 freq
keind (2) - 1 freq
refine (2) - 4 freq
redmond (2) - 2 freq
remixed (2) - 1 freq
remaned (2) - 1 freq
remain (2) - 39 freq
remains (2) - 61 freq
secind (2) - 1 freq
remindit (2) - 7 freq
relied (2) - 4 freq
reine (2) - 2 freq
resend (2) - 1 freq
reins (2) - 21 freq
resins (2) - 1 freq
resin (2) - 6 freq
reminis (2) - 1 freq
mind (2) - 2299 freq
reeined (2) - 1 freq
remind (0) - 39 freq
remand (1) - 2 freq
remaned (2) - 1 freq
remained (2) - 17 freq
rewind (2) - 3 freq
remin (2) - 1 freq
remine (2) - 2 freq
reminds (2) - 22 freq
reined (3) - 2 freq
reminder (3) - 19 freq
remant (3) - 1 freq
remaint (3) - 1 freq
'mind (3) - 17 freq
remead (3) - 2 freq
reminded (3) - 25 freq
remord (3) - 2 freq
remindin (3) - 19 freq
remeid (3) - 7 freq
emond (3) - 1 freq
demand (3) - 51 freq
rhind (3) - 1 freq
raymond (3) - 2 freq
repond (3) - 1 freq
-mind (3) - 1 freq
reamin (3) - 6 freq
SoundEx code - R553
romantic - 42 freq
remained - 17 freq
reminds - 22 freq
reminded - 25 freq
renowned - 4 freq
remanded - 1 freq
remnant - 7 freq
remeent - 2 freq
remounted - 1 freq
reminder - 19 freq
remind - 39 freq
remand - 2 freq
remindin - 19 freq
romanticised - 1 freq
remnants - 11 freq
remindit - 7 freq
rome-nut - 1 freq
remant - 1 freq
remainder - 3 freq
remainit - 1 freq
remindet - 4 freq
raymond - 2 freq
romuntic - 5 freq
raiment - 2 freq
ruminate - 1 freq
remaned - 1 freq
re-naimed - 1 freq
romanticisin - 4 freq
remindan - 1 freq
remaint - 1 freq
€˜romantic - 1 freq
renamed - 5 freq
romantics - 1 freq
€œreminder - 1 freq
reminders - 4 freq
romanticise - 1 freq
romanticism - 1 freq
renownit - 1 freq
rainhandss - 2 freq
raymondsoltysek - 8 freq
reminding - 2 freq
raymond's - 1 freq
raymondbesant - 3 freq
MetaPhone code - RMNT
remained - 17 freq
remeent - 2 freq
remind - 39 freq
remand - 2 freq
rome-nut - 1 freq
remant - 1 freq
remainit - 1 freq
raymond - 2 freq
raiment - 2 freq
ruminate - 1 freq
remaned - 1 freq
remaint - 1 freq
REMIND
mind - 2299 freq
minds - 149 freq
minded - 96 freq
mindet - 2 freq
minder - 4 freq
minde - freq
mynd - 255 freq
mynded - 9 freq
remind - 39 freq
reminds - 22 freq
'mind - 17 freq
mindit - 162 freq
Time to execute Levenshtein function - 0.395541 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.713527 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031204 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.070671 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000975 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.