A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to rockies in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
rockies (0) - 1 freq
pockies (1) - 1 freq
rickles (2) - 3 freq
dookies (2) - 1 freq
folkies (2) - 2 freq
rocker (2) - 3 freq
rocking (2) - 2 freq
roadies (2) - 4 freq
rockitet (2) - 1 freq
rockness (2) - 1 freq
roakie (2) - 5 freq
pokkies (2) - 2 freq
rollies (2) - 1 freq
rochles (2) - 1 freq
fowkies (2) - 21 freq
jockie (2) - 28 freq
sickies (2) - 1 freq
porkies (2) - 2 freq
ruckles (2) - 1 freq
cockles (2) - 6 freq
rockin' (2) - 1 freq
rocket (2) - 43 freq
rocks (2) - 85 freq
ronnies (2) - 1 freq
dockins (2) - 1 freq
rockies (0) - 1 freq
rocks (2) - 85 freq
pockies (2) - 1 freq
cookies (3) - 10 freq
yorkies (3) - 1 freq
rock's (3) - 8 freq
luckies (3) - 1 freq
rockin (3) - 18 freq
mackies (3) - 4 freq
brickies (3) - 2 freq
trackies (3) - 8 freq
rocked (3) - 6 freq
rockeen (3) - 1 freq
racks (3) - 6 freq
ruckus (3) - 2 freq
ricks (3) - 1 freq
rucks (3) - 3 freq
lockis (3) - 1 freq
rockit (3) - 4 freq
rockets (3) - 9 freq
buckies (3) - 11 freq
rackie (3) - 1 freq
backies (3) - 5 freq
recks (3) - 20 freq
ruckles (3) - 1 freq
SoundEx code - R220
riches - 23 freq
raxes - 28 freq
roses - 102 freq
rises - 41 freq
reaches - 21 freq
rochs - 1 freq
rushes - 19 freq
raises - 20 freq
rashes - 29 freq
raucous - 8 freq
rakes - 6 freq
ruckus - 2 freq
rejoice - 10 freq
rages - 7 freq
rejig - 1 freq
ruses - 1 freq
rizzio's - 17 freq
rejyyce - 1 freq
reekie's - 5 freq
rogues - 15 freq
races - 17 freq
rogueys - 1 freq
rosie's - 4 freq
roughage - 1 freq
rice's - 2 freq
rucksack - 9 freq
recess - 3 freq
rejeck - 2 freq
reeses - 1 freq
rissies - 1 freq
rosies - 2 freq
rehashes - 1 freq
rackwick - 1 freq
rucksacks - 2 freq
rugas - 1 freq
racous - 1 freq
rockies - 1 freq
rogues' - 1 freq
riggies - 1 freq
rose's - 2 freq
rejyce - 1 freq
rosehauch - 1 freq
recaws - 2 freq
rashis - 1 freq
rogie's - 1 freq
ruises - 1 freq
roeses - 1 freq
reassess - 1 freq
rejecks - 1 freq
“rosies - 1 freq
rojas - 1 freq
rogic - 3 freq
rossies - 1 freq
raucouse - 1 freq
rogueish - 1 freq
rkos - 1 freq
MetaPhone code - RKS
rocks - 85 freq
riggs - 8 freq
rax - 79 freq
racks - 6 freq
rigs - 62 freq
rooks - 6 freq
recks - 20 freq
raucous - 8 freq
rakes - 6 freq
ruckus - 2 freq
rock's - 8 freq
rug's - 1 freq
rugs - 9 freq
rags - 12 freq
reeks - 7 freq
reekie's - 5 freq
rogues - 15 freq
rogueys - 1 freq
roaks - 8 freq
rux - 1 freq
raiks - 5 freq
rucks - 3 freq
wrecks - 2 freq
ruggs - 2 freq
wrack's - 1 freq
wracks - 11 freq
wrecksi - 1 freq
rex - 3 freq
rugas - 1 freq
racous - 1 freq
rockies - 1 freq
ricks - 1 freq
rogues' - 1 freq
riggies - 1 freq
recaws - 2 freq
roogs - 2 freq
reg's - 1 freq
raxx - 4 freq
wraks - 1 freq
roks - 1 freq
recce - 1 freq
rouks - 1 freq
rcs - 2 freq
rxau - 1 freq
rx - 6 freq
raucouse - 1 freq
ruk's - 1 freq
rxe - 1 freq
rkos - 1 freq
ROCKIES
Time to execute Levenshtein function - 0.287319 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.591148 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030520 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.069639 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000945 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.