A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to rules in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
rules (0) - 179 freq
rule (1) - 164 freq
fules (1) - 2 freq
mules (1) - 1 freq
ruled (1) - 25 freq
jules (1) - 1 freq
runes (1) - 6 freq
rupes (1) - 1 freq
rulers (1) - 8 freq
rues (1) - 2 freq
ruses (1) - 1 freq
ruler (1) - 10 freq
roles (1) - 12 freq
scules (2) - 1 freq
males (2) - 10 freq
races (2) - 17 freq
auler (2) - 17 freq
rile (2) - 3 freq
ruggs (2) - 2 freq
cues (2) - 3 freq
rude (2) - 37 freq
rolls (2) - 94 freq
roubles (2) - 1 freq
muses (2) - 7 freq
rls (2) - 7 freq
rules (0) - 179 freq
roles (1) - 12 freq
rouls (2) - 2 freq
ruler (2) - 10 freq
relies (2) - 1 freq
rls (2) - 7 freq
arles (2) - 5 freq
erles (2) - 1 freq
ruses (2) - 1 freq
reuls (2) - 2 freq
mules (2) - 1 freq
jules (2) - 1 freq
rule (2) - 164 freq
ruled (2) - 25 freq
fules (2) - 2 freq
rues (2) - 2 freq
rulers (2) - 8 freq
runes (2) - 6 freq
rupes (2) - 1 freq
rale (3) - 78 freq
routes (3) - 11 freq
holes (3) - 71 freq
ruse (3) - 1 freq
wales (3) - 37 freq
rulit (3) - 2 freq
SoundEx code - R420
rolls - 94 freq
rules - 179 freq
realize - 6 freq
railways - 7 freq
relic - 11 freq
realise - 107 freq
roles - 12 freq
release - 19 freq
rails - 15 freq
railwyes - 1 freq
reealise - 1 freq
realeese - 1 freq
relax - 41 freq
reel's - 2 freq
reels - 20 freq
rawls - 1 freq
relics - 6 freq
really's - 2 freq
rail's - 1 freq
rollies - 1 freq
relish - 7 freq
royals - 9 freq
'rules' - 2 freq
rolex - 2 freq
rilke - 2 freq
relays - 2 freq
rowley's - 8 freq
rills - 1 freq
rowls - 5 freq
ruills - 2 freq
roll's - 1 freq
raleigh - 2 freq
rouls - 2 freq
rueless - 1 freq
rls - 7 freq
royales - 1 freq
raelly-wes - 1 freq
raelise - 4 freq
rls's - 1 freq
reuls - 2 freq
relies - 1 freq
roweloks - 1 freq
rallies - 7 freq
rollicks - 1 freq
€œrelax - 2 freq
railise - 1 freq
railweys - 1 freq
reealais - 1 freq
roolz - 1 freq
rlq - 1 freq
roals - 1 freq
MetaPhone code - RLS
rolls - 94 freq
rules - 179 freq
realize - 6 freq
realise - 107 freq
roles - 12 freq
release - 19 freq
rails - 15 freq
reealise - 1 freq
realeese - 1 freq
reel's - 2 freq
reels - 20 freq
rawls - 1 freq
really's - 2 freq
rail's - 1 freq
rollies - 1 freq
'rules' - 2 freq
relays - 2 freq
rowley's - 8 freq
rills - 1 freq
rowls - 5 freq
wyreless - 4 freq
ruills - 2 freq
roll's - 1 freq
rouls - 2 freq
rueless - 1 freq
rls - 7 freq
raelise - 4 freq
reuls - 2 freq
relies - 1 freq
rallies - 7 freq
railise - 1 freq
reealais - 1 freq
roolz - 1 freq
roals - 1 freq
RULES
Time to execute Levenshtein function - 0.189634 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.338714 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027733 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044008 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000882 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.