A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to “rosies in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
“rosies (0) - 1 freq
rosies (2) - 2 freq
‘rose (3) - 1 freq
rosie's (3) - 5 freq
cosies (3) - 1 freq
frosties (3) - 1 freq
bosies (3) - 18 freq
birsies (3) - 1 freq
sprosie (3) - 1 freq
trowies (3) - 1 freq
curtsies (3) - 1 freq
rowies (3) - 16 freq
posies (3) - 6 freq
roses (3) - 102 freq
poosies (3) - 9 freq
brosie (3) - 11 freq
tronies (3) - 2 freq
rosines (3) - 1 freq
rossies (3) - 1 freq
phrasies (3) - 1 freq
hoosies (3) - 9 freq
moosies (3) - 22 freq
rosie (3) - 78 freq
proxies (3) - 1 freq
cronies (3) - 41 freq
“rosies (0) - 1 freq
rosies (4) - 2 freq
phrasies (5) - 1 freq
‘rose (5) - 1 freq
birsies (5) - 1 freq
roses (5) - 102 freq
cross (6) - 260 freq
“elvis (6) - 1 freq
roeses (6) - 1 freq
eross (6) - 1 freq
“foo's (6) - 1 freq
“this (6) - 3 freq
crises (6) - 1 freq
brouses (6) - 2 freq
horses (6) - 113 freq
viruses (6) - 3 freq
alresis (6) - 1 freq
airses (6) - 3 freq
gross (6) - 6 freq
verses (6) - 59 freq
horsis (6) - 3 freq
“wis (6) - 1 freq
akross (6) - 2 freq
erses (6) - 23 freq
preses (6) - 15 freq
SoundEx code - R220
riches - 23 freq
raxes - 29 freq
roses - 102 freq
rises - 42 freq
reaches - 24 freq
rochs - 1 freq
rushes - 20 freq
raises - 24 freq
rashes - 30 freq
raucous - 8 freq
rakes - 6 freq
ruckus - 2 freq
rejoice - 10 freq
rages - 9 freq
rejig - 1 freq
ruses - 1 freq
rizzio's - 17 freq
rejyyce - 1 freq
reekie's - 5 freq
rogues - 15 freq
rucksack - 11 freq
rescues - 1 freq
rashees - 3 freq
rucksacks - 3 freq
rosie's - 5 freq
races - 17 freq
rogueys - 1 freq
roughage - 1 freq
rice's - 2 freq
recess - 3 freq
rejeck - 2 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rehashes - 1 freq
rackwick - 1 freq
rugas - 1 freq
racous - 1 freq
rockies - 1 freq
rogues' - 1 freq
riggies - 1 freq
rose's - 2 freq
rejyce - 1 freq
rosehauch - 1 freq
recaws - 2 freq
rashis - 1 freq
rogie's - 1 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
rejecks - 1 freq
“rosies - 1 freq
rojas - 1 freq
rogic - 3 freq
rossies - 1 freq
raucouse - 1 freq
rogueish - 1 freq
rkos - 1 freq
MetaPhone code - RSS
roses - 102 freq
rises - 42 freq
raises - 24 freq
ruses - 1 freq
ross's - 3 freq
rizzio's - 17 freq
rosie's - 5 freq
races - 17 freq
rice's - 2 freq
recess - 3 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rose's - 2 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
“rosies - 1 freq
rossies - 1 freq
“ROSIES
Time to execute Levenshtein function - 0.273901 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.522740 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033088 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.047590 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001300 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.