A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to rbj in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
rbj (0) - 1 freq
bj (1) - 15 freq
rbi (1) - 1 freq
rb (1) - 8 freq
rbjq (1) - 1 freq
rba (1) - 1 freq
rbm (1) - 4 freq
raj (1) - 1 freq
rj (1) - 2 freq
rbw (1) - 1 freq
rnj (1) - 1 freq
rbs (1) - 1 freq
r (2) - 446 freq
rwg (2) - 1 freq
res (2) - 2 freq
rpb (2) - 1 freq
rig (2) - 45 freq
gbh (2) - 3 freq
ryn (2) - 2 freq
bkj (2) - 1 freq
raf (2) - 7 freq
urbs (2) - 1 freq
aij (2) - 2 freq
rw (2) - 33 freq
roo (2) - 79 freq
rbj (0) - 1 freq
raj (2) - 1 freq
rj (2) - 2 freq
rbs (2) - 1 freq
rbm (2) - 4 freq
rbw (2) - 1 freq
rnj (2) - 1 freq
rba (2) - 1 freq
bj (2) - 15 freq
rbi (2) - 1 freq
rbjq (2) - 1 freq
rb (2) - 8 freq
ribs (3) - 33 freq
robt (3) - 1 freq
robe (3) - 5 freq
rbib (3) - 1 freq
orlj (3) - 1 freq
raja (3) - 1 freq
robo (3) - 1 freq
orb (3) - 2 freq
ruby (3) - 8 freq
rub (3) - 50 freq
rubb (3) - 1 freq
bij (3) - 1 freq
roby (3) - 1 freq
SoundEx code - R120
ruifs - 9 freq
reap's - 1 freq
refuge - 6 freq
'reviews - 2 freq
rubbish - 67 freq
raips - 7 freq
roofs - 11 freq
ribs - 33 freq
robes - 7 freq
rab's - 9 freq
rubs - 9 freq
ropes - 21 freq
rubbage - 4 freq
revise - 6 freq
raves - 5 freq
refuse - 24 freq
revs - 1 freq
rabbie's - 5 freq
robbie's - 5 freq
rivvies - 1 freq
rebuke - 3 freq
rebecca - 16 freq
rubies - 4 freq
raipes - 1 freq
reifs - 1 freq
rbs - 1 freq
rips - 9 freq
reviews - 19 freq
ravish - 1 freq
rfc - 1 freq
rabies - 2 freq
rope's - 1 freq
robbys - 1 freq
robby's - 1 freq
rives - 4 freq
roves - 1 freq
rebekah - 3 freq
refugee - 5 freq
ravsie - 2 freq
ruives - 2 freq
rufus - 5 freq
rufus's - 1 freq
rüfs - 2 freq
reeves - 5 freq
rebigg - 1 freq
reeboks - 1 freq
reives - 2 freq
reefs - 6 freq
rps - 17 freq
'reviews' - 1 freq
rupes - 1 freq
ruffs - 1 freq
refeese - 1 freq
roups - 2 freq
rapes - 1 freq
raps - 1 freq
refaise - 1 freq
ravis - 1 freq
reiffis - 1 freq
rippek - 2 freq
rovies - 1 freq
€œrubbish - 1 freq
re-pack - 1 freq
rubik - 1 freq
rufs - 1 freq
refuise - 1 freq
rob's - 1 freq
rvhs - 1 freq
rpcy - 1 freq
reaps - 11 freq
robskeeee - 1 freq
rbjq - 1 freq
rbj - 1 freq
refs - 6 freq
rpjj - 1 freq
rvjo - 1 freq
rvus - 1 freq
reps - 1 freq
repas - 1 freq
rebs - 3 freq
ruffage - 1 freq
rfk - 1 freq
MetaPhone code - RBJ
rubbage - 4 freq
rbj - 1 freq
RBJ
Time to execute Levenshtein function - 0.197460 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.322316 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027643 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036247 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000802 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.