A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ruder in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ruder (0) - 2 freq
ruler (1) - 10 freq
rudder (1) - 4 freq
rider (1) - 10 freq
ryder (1) - 1 freq
rude (1) - 37 freq
cruder (1) - 1 freq
ruders (1) - 1 freq
ouler (2) - 8 freq
bucer (2) - 1 freq
older (2) - 27 freq
ruse (2) - 1 freq
rodger (2) - 4 freq
bude (2) - 14 freq
puer (2) - 1 freq
otder (2) - 1 freq
odder (2) - 1 freq
'under (2) - 2 freq
bunder (2) - 1 freq
rupert (2) - 17 freq
murder (2) - 86 freq
dudes (2) - 1 freq
uner (2) - 2 freq
rymer (2) - 1 freq
rimer (2) - 1 freq
ruder (0) - 2 freq
rider (1) - 10 freq
ryder (1) - 1 freq
reader (2) - 89 freq
order (2) - 277 freq
raider (2) - 1 freq
reeder (2) - 3 freq
ruler (2) - 10 freq
radar (2) - 11 freq
reider (2) - 5 freq
rude (2) - 37 freq
rudder (2) - 4 freq
cruder (2) - 1 freq
ruders (2) - 1 freq
rides (3) - 16 freq
rinder (3) - 1 freq
raedir (3) - 1 freq
redder (3) - 15 freq
der (3) - 303 freq
rower (3) - 2 freq
rounder (3) - 1 freq
edder (3) - 21 freq
idder (3) - 286 freq
owder (3) - 1 freq
lauder (3) - 6 freq
SoundEx code - R360
raither - 273 freq
reader - 89 freq
retour - 32 freq
rotary - 3 freq
rather - 114 freq
rudder - 4 freq
reider - 5 freq
radar - 11 freq
retire - 11 freq
rattray - 2 freq
reeder - 3 freq
rider - 10 freq
rattra - 4 freq
reyther - 1 freq
rether - 53 freq
redder - 15 freq
rae-deer - 1 freq
ridder - 2 freq
reed-raa - 1 freq
redrew - 1 freq
rither - 6 freq
riter - 1 freq
'retro' - 1 freq
reedware - 1 freq
rhethorie - 1 freq
retro - 8 freq
reddir - 6 freq
router - 1 freq
reidware - 1 freq
raether - 1 freq
redraw - 2 freq
reidder - 2 freq
roodery - 1 freq
raider - 1 freq
ruder - 2 freq
ruther - 3 freq
ratier - 1 freq
retir - 1 freq
raitir - 1 freq
ryder - 1 freq
raedir - 1 freq
MetaPhone code - RTR
reader - 89 freq
retour - 32 freq
rotary - 3 freq
rudder - 4 freq
reider - 5 freq
radar - 11 freq
retire - 11 freq
writer - 71 freq
rattray - 2 freq
reeder - 3 freq
rider - 10 freq
rattra - 4 freq
redder - 15 freq
rae-deer - 1 freq
ridder - 2 freq
reed-raa - 1 freq
redrew - 1 freq
riter - 1 freq
'retro' - 1 freq
retro - 8 freq
reddir - 6 freq
router - 1 freq
redraw - 2 freq
reidder - 2 freq
roodery - 1 freq
raider - 1 freq
ruder - 2 freq
ratier - 1 freq
retir - 1 freq
raitir - 1 freq
ryder - 1 freq
raedir - 1 freq
RUDER
Time to execute Levenshtein function - 0.201358 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.426753 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029000 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042891 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001315 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.