A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to reader in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
reader (0) - 89 freq
leader (1) - 58 freq
header (1) - 2 freq
render (1) - 8 freq
reider (1) - 5 freq
redder (1) - 15 freq
reaper (1) - 16 freq
eader (1) - 1 freq
readers (1) - 87 freq
reeder (1) - 3 freq
rendert (2) - 3 freq
leaders (2) - 30 freq
fender (2) - 10 freq
nearer (2) - 72 freq
bender (2) - 4 freq
reiver (2) - 22 freq
broader (2) - 1 freq
beaker (2) - 6 freq
dreamer (2) - 2 freq
ruder (2) - 2 freq
rear (2) - 29 freq
readers' (2) - 2 freq
ridder (2) - 2 freq
leaden (2) - 1 freq
meander (2) - 2 freq
reader (0) - 89 freq
reider (1) - 5 freq
reeder (1) - 3 freq
leader (2) - 58 freq
render (2) - 8 freq
rider (2) - 10 freq
raider (2) - 1 freq
ryder (2) - 1 freq
ruder (2) - 2 freq
readers (2) - 87 freq
header (2) - 2 freq
redder (2) - 15 freq
reaper (2) - 16 freq
eader (2) - 1 freq
radar (2) - 11 freq
neider (3) - 1 freq
herder (3) - 3 freq
roodery (3) - 1 freq
reapir (3) - 1 freq
reiden (3) - 1 freq
rudder (3) - 4 freq
yarder (3) - 1 freq
reeden (3) - 3 freq
trader (3) - 9 freq
rawer (3) - 1 freq
SoundEx code - R360
raither - 273 freq
reader - 89 freq
retour - 32 freq
rotary - 3 freq
rather - 114 freq
rudder - 4 freq
reider - 5 freq
radar - 11 freq
retire - 10 freq
rattray - 2 freq
reeder - 3 freq
rider - 10 freq
rattra - 4 freq
reyther - 1 freq
rether - 53 freq
redder - 15 freq
rae-deer - 1 freq
ridder - 2 freq
reed-raa - 1 freq
redrew - 1 freq
rither - 6 freq
riter - 1 freq
'retro' - 1 freq
reedware - 1 freq
rhethorie - 1 freq
retro - 8 freq
reddir - 6 freq
router - 1 freq
reidware - 1 freq
raether - 1 freq
redraw - 2 freq
reidder - 2 freq
roodery - 1 freq
raider - 1 freq
ruder - 2 freq
ruther - 3 freq
ratier - 1 freq
retir - 1 freq
raitir - 1 freq
ryder - 1 freq
raedir - 1 freq
MetaPhone code - RTR
reader - 89 freq
retour - 32 freq
rotary - 3 freq
rudder - 4 freq
reider - 5 freq
radar - 11 freq
retire - 10 freq
writer - 65 freq
rattray - 2 freq
reeder - 3 freq
rider - 10 freq
rattra - 4 freq
redder - 15 freq
rae-deer - 1 freq
ridder - 2 freq
reed-raa - 1 freq
redrew - 1 freq
riter - 1 freq
'retro' - 1 freq
retro - 8 freq
reddir - 6 freq
router - 1 freq
redraw - 2 freq
reidder - 2 freq
roodery - 1 freq
raider - 1 freq
ruder - 2 freq
ratier - 1 freq
retir - 1 freq
raitir - 1 freq
ryder - 1 freq
raedir - 1 freq
READER
read - 926 freq
reader - 89 freq
reads - 55 freq
reading - 89 freq
readin - 366 freq
Time to execute Levenshtein function - 0.304513 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.518648 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.062924 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036672 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000962 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.