A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to races in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
races (0) - 17 freq
aces (1) - 5 freq
laces (1) - 11 freq
race (1) - 139 freq
rapes (1) - 1 freq
raced (1) - 14 freq
rakes (1) - 6 freq
braces (1) - 12 freq
kaces (1) - 1 freq
faces (1) - 230 freq
graces (1) - 16 freq
rates (1) - 18 freq
raes (1) - 2 freq
raxes (1) - 29 freq
racks (1) - 6 freq
traces (1) - 7 freq
racer (1) - 1 freq
paces (1) - 14 freq
raves (1) - 5 freq
racers (1) - 1 freq
maces (1) - 1 freq
rages (1) - 9 freq
grades (2) - 10 freq
rowes (2) - 10 freq
rangs (2) - 2 freq
races (0) - 17 freq
paces (2) - 14 freq
traces (2) - 7 freq
raxes (2) - 29 freq
raves (2) - 5 freq
racks (2) - 6 freq
maces (2) - 1 freq
rcs (2) - 2 freq
racous (2) - 1 freq
rages (2) - 9 freq
raes (2) - 2 freq
racers (2) - 1 freq
racer (2) - 1 freq
rapes (2) - 1 freq
race (2) - 139 freq
rates (2) - 18 freq
laces (2) - 11 freq
rakes (2) - 6 freq
raced (2) - 14 freq
braces (2) - 12 freq
graces (2) - 16 freq
aces (2) - 5 freq
faces (2) - 230 freq
kaces (2) - 1 freq
roses (3) - 102 freq
SoundEx code - R220
riches - 23 freq
raxes - 29 freq
roses - 102 freq
rises - 42 freq
reaches - 24 freq
rochs - 1 freq
rushes - 20 freq
raises - 24 freq
rashes - 30 freq
raucous - 8 freq
rakes - 6 freq
ruckus - 2 freq
rejoice - 10 freq
rages - 9 freq
rejig - 1 freq
ruses - 1 freq
rizzio's - 17 freq
rejyyce - 1 freq
reekie's - 5 freq
rogues - 15 freq
rucksack - 11 freq
rescues - 1 freq
rashees - 3 freq
rucksacks - 3 freq
rosie's - 5 freq
races - 17 freq
rogueys - 1 freq
roughage - 1 freq
rice's - 2 freq
recess - 3 freq
rejeck - 2 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rehashes - 1 freq
rackwick - 1 freq
rugas - 1 freq
racous - 1 freq
rockies - 1 freq
rogues' - 1 freq
riggies - 1 freq
rose's - 2 freq
rejyce - 1 freq
rosehauch - 1 freq
recaws - 2 freq
rashis - 1 freq
rogie's - 1 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
rejecks - 1 freq
“rosies - 1 freq
rojas - 1 freq
rogic - 3 freq
rossies - 1 freq
raucouse - 1 freq
rogueish - 1 freq
rkos - 1 freq
MetaPhone code - RSS
roses - 102 freq
rises - 42 freq
raises - 24 freq
ruses - 1 freq
ross's - 3 freq
rizzio's - 17 freq
rosie's - 5 freq
races - 17 freq
rice's - 2 freq
recess - 3 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rose's - 2 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
“rosies - 1 freq
rossies - 1 freq
RACES
Time to execute Levenshtein function - 0.205433 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.357804 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028480 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037512 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000920 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.