A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to rubbish in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
rubbish (0) - 67 freq
rubbits' (2) - 1 freq
rubbit (2) - 49 freq
€œrubbish (2) - 1 freq
rubbing (2) - 3 freq
jubish (2) - 2 freq
rubbin (2) - 48 freq
rubbir (2) - 1 freq
publish (2) - 12 freq
rubbits (2) - 14 freq
turkish (3) - 23 freq
rabbit' (3) - 5 freq
gutnish (3) - 1 freq
jutish (3) - 1 freq
gubbin (3) - 3 freq
cubbie (3) - 2 freq
ruaridh (3) - 1 freq
rubban (3) - 3 freq
duskish (3) - 1 freq
snobbish (3) - 1 freq
puneish (3) - 1 freq
juwish (3) - 1 freq
punish (3) - 4 freq
rubbage (3) - 4 freq
rubbed (3) - 42 freq
rubbish (0) - 67 freq
rubbir (4) - 1 freq
rubbits (4) - 14 freq
rubbin (4) - 48 freq
robbys (4) - 1 freq
publish (4) - 12 freq
jubish (4) - 2 freq
rubbit (4) - 49 freq
€œrubbish (4) - 1 freq
rubbing (4) - 3 freq
rubbits' (4) - 1 freq
grubbiest (5) - 1 freq
rubbery (5) - 7 freq
rubble (5) - 7 freq
rubbled (5) - 1 freq
dubbiest (5) - 1 freq
rubs (5) - 9 freq
biish (5) - 1 freq
rubb (5) - 1 freq
rabbie (5) - 63 freq
rubies (5) - 4 freq
oÂ’rubbish (5) - 1 freq
ribbin (5) - 2 freq
hubbys (5) - 1 freq
reddish (5) - 1 freq
SoundEx code - R120
ruifs - 9 freq
reap's - 1 freq
refuge - 6 freq
'reviews - 2 freq
rubbish - 67 freq
raips - 7 freq
roofs - 11 freq
ribs - 33 freq
robes - 7 freq
rab's - 9 freq
rubs - 9 freq
ropes - 21 freq
rubbage - 4 freq
revise - 6 freq
raves - 5 freq
refuse - 24 freq
revs - 1 freq
rabbie's - 5 freq
robbie's - 5 freq
rivvies - 1 freq
rebuke - 3 freq
rebecca - 16 freq
rubies - 4 freq
raipes - 1 freq
reifs - 1 freq
rbs - 1 freq
rips - 9 freq
reviews - 19 freq
ravish - 1 freq
rfc - 1 freq
rabies - 2 freq
rope's - 1 freq
robbys - 1 freq
robby's - 1 freq
rives - 4 freq
roves - 1 freq
rebekah - 3 freq
refugee - 5 freq
ravsie - 2 freq
ruives - 2 freq
rufus - 5 freq
rufus's - 1 freq
rüfs - 2 freq
reeves - 5 freq
rebigg - 1 freq
reeboks - 1 freq
reives - 2 freq
reefs - 6 freq
rps - 17 freq
'reviews' - 1 freq
rupes - 1 freq
ruffs - 1 freq
refeese - 1 freq
roups - 2 freq
rapes - 1 freq
raps - 1 freq
refaise - 1 freq
ravis - 1 freq
reiffis - 1 freq
rippek - 2 freq
rovies - 1 freq
€œrubbish - 1 freq
re-pack - 1 freq
rubik - 1 freq
rufs - 1 freq
refuise - 1 freq
rob's - 1 freq
rvhs - 1 freq
rpcy - 1 freq
reaps - 11 freq
robskeeee - 1 freq
rbjq - 1 freq
rbj - 1 freq
refs - 6 freq
rpjj - 1 freq
rvjo - 1 freq
rvus - 1 freq
reps - 1 freq
repas - 1 freq
rebs - 3 freq
ruffage - 1 freq
rfk - 1 freq
MetaPhone code - RBX
rubbish - 67 freq
€œrubbish - 1 freq
RUBBISH
Time to execute Levenshtein function - 0.225009 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.429005 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033090 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041409 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000838 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.