A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to roses in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
roses (0) - 102 freq
loses (1) - 6 freq
rones (1) - 3 freq
ross (1) - 100 freq
hoses (1) - 1 freq
rowes (1) - 10 freq
roeses (1) - 1 freq
roles (1) - 12 freq
roset (1) - 2 freq
noses (1) - 20 freq
ruses (1) - 1 freq
ropes (1) - 21 freq
roves (1) - 1 freq
rose (1) - 216 freq
rosies (1) - 2 freq
rises (1) - 41 freq
rose's (1) - 2 freq
moses (1) - 68 freq
doses (1) - 9 freq
robes (1) - 7 freq
poses (1) - 2 freq
raises (2) - 20 freq
fores (2) - 1 freq
rushes (2) - 19 freq
tosses (2) - 3 freq
roses (0) - 102 freq
roeses (1) - 1 freq
ruses (1) - 1 freq
rosies (1) - 2 freq
rises (1) - 41 freq
ross (1) - 100 freq
raises (2) - 20 freq
rossi (2) - 2 freq
arses (2) - 6 freq
erses (2) - 23 freq
russ (2) - 2 freq
ryss (2) - 3 freq
riss (2) - 2 freq
ruises (2) - 1 freq
reeses (2) - 1 freq
arises (2) - 5 freq
eross (2) - 1 freq
irises (2) - 1 freq
poses (2) - 2 freq
doses (2) - 9 freq
noses (2) - 20 freq
robes (2) - 7 freq
roset (2) - 2 freq
roles (2) - 12 freq
hoses (2) - 1 freq
SoundEx code - R220
riches - 23 freq
raxes - 28 freq
roses - 102 freq
rises - 41 freq
reaches - 21 freq
rochs - 1 freq
rushes - 19 freq
raises - 20 freq
rashes - 29 freq
raucous - 8 freq
rakes - 6 freq
ruckus - 2 freq
rejoice - 10 freq
rages - 7 freq
rejig - 1 freq
ruses - 1 freq
rizzio's - 17 freq
rejyyce - 1 freq
reekie's - 5 freq
rogues - 15 freq
races - 17 freq
rogueys - 1 freq
rosie's - 4 freq
roughage - 1 freq
rice's - 2 freq
rucksack - 9 freq
recess - 3 freq
rejeck - 2 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rehashes - 1 freq
rackwick - 1 freq
rucksacks - 2 freq
rugas - 1 freq
racous - 1 freq
rockies - 1 freq
rogues' - 1 freq
riggies - 1 freq
rose's - 2 freq
rejyce - 1 freq
rosehauch - 1 freq
recaws - 2 freq
rashis - 1 freq
rogie's - 1 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
rejecks - 1 freq
“rosies - 1 freq
rojas - 1 freq
rogic - 3 freq
rossies - 1 freq
raucouse - 1 freq
rogueish - 1 freq
rkos - 1 freq
MetaPhone code - RSS
roses - 102 freq
rises - 41 freq
raises - 20 freq
ruses - 1 freq
ross's - 3 freq
rizzio's - 17 freq
races - 17 freq
rosie's - 4 freq
rice's - 2 freq
recess - 3 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rose's - 2 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
“rosies - 1 freq
rossies - 1 freq
ROSES
Time to execute Levenshtein function - 0.202385 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.483945 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031159 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038379 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.004038 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.