A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to rossies in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
rossies (0) - 1 freq
rosies (1) - 2 freq
rissies (1) - 1 freq
rossie (1) - 3 freq
crosses (2) - 11 freq
bosies (2) - 18 freq
bosses (2) - 9 freq
cosies (2) - 1 freq
tossils (2) - 1 freq
dossiers (2) - 1 freq
posies (2) - 6 freq
poesies (2) - 2 freq
posties (2) - 3 freq
fossils (2) - 10 freq
oossie (2) - 1 freq
mossie (2) - 4 freq
rosie's (2) - 5 freq
roadies (2) - 4 freq
rosines (2) - 1 freq
hossie (2) - 1 freq
dosses (2) - 2 freq
rosie (2) - 78 freq
mosses (2) - 6 freq
ross's (2) - 3 freq
hussies (2) - 1 freq
rossies (0) - 1 freq
rissies (1) - 1 freq
rosies (2) - 2 freq
rossie (2) - 3 freq
kissies (3) - 1 freq
vossis (3) - 2 freq
tassies (3) - 3 freq
missies (3) - 1 freq
jassies (3) - 1 freq
bussies (3) - 1 freq
mossis (3) - 1 freq
tosses (3) - 3 freq
lassies (3) - 308 freq
losses (3) - 15 freq
cassies (3) - 6 freq
rissie (3) - 1 freq
roses (3) - 102 freq
roasties (3) - 1 freq
rossi (3) - 2 freq
jessies (3) - 3 freq
pressies (3) - 3 freq
hussies (3) - 1 freq
roeses (3) - 1 freq
aussies (3) - 1 freq
rosaries (3) - 3 freq
SoundEx code - R220
riches - 23 freq
raxes - 29 freq
roses - 102 freq
rises - 42 freq
reaches - 24 freq
rochs - 1 freq
rushes - 20 freq
raises - 24 freq
rashes - 30 freq
raucous - 8 freq
rakes - 6 freq
ruckus - 2 freq
rejoice - 10 freq
rages - 9 freq
rejig - 1 freq
ruses - 1 freq
rizzio's - 17 freq
rejyyce - 1 freq
reekie's - 5 freq
rogues - 15 freq
rucksack - 11 freq
rescues - 1 freq
rashees - 3 freq
rucksacks - 3 freq
rosie's - 5 freq
races - 17 freq
rogueys - 1 freq
roughage - 1 freq
rice's - 2 freq
recess - 3 freq
rejeck - 2 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rehashes - 1 freq
rackwick - 1 freq
rugas - 1 freq
racous - 1 freq
rockies - 1 freq
rogues' - 1 freq
riggies - 1 freq
rose's - 2 freq
rejyce - 1 freq
rosehauch - 1 freq
recaws - 2 freq
rashis - 1 freq
rogie's - 1 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
rejecks - 1 freq
“rosies - 1 freq
rojas - 1 freq
rogic - 3 freq
rossies - 1 freq
raucouse - 1 freq
rogueish - 1 freq
rkos - 1 freq
MetaPhone code - RSS
roses - 102 freq
rises - 42 freq
raises - 24 freq
ruses - 1 freq
ross's - 3 freq
rizzio's - 17 freq
rosie's - 5 freq
races - 17 freq
rice's - 2 freq
recess - 3 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rose's - 2 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
“rosies - 1 freq
rossies - 1 freq
ROSSIES
Time to execute Levenshtein function - 0.223405 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.392180 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028104 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037380 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000874 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.