A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to rh-negative in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
rh-negative (0) - 1 freq
€˜negative (3) - 1 freq
negative (3) - 41 freq
raelative (4) - 1 freq
relative (4) - 29 freq
recreative (4) - 1 freq
negatives (4) - 3 freq
tentative (5) - 3 freq
rhymetime (5) - 1 freq
relegation (5) - 4 freq
reflective (5) - 3 freq
negatively (5) - 2 freq
rinagate (5) - 2 freq
receptive (5) - 4 freq
native (5) - 94 freq
innovative (5) - 9 freq
imperative (5) - 9 freq
restive (5) - 1 freq
intiative (5) - 1 freq
relegatit (5) - 3 freq
'creative (5) - 4 freq
honestie (5) - 1 freq
negate (5) - 2 freq
sedative (5) - 1 freq
hortative (5) - 1 freq
rh-negative (0) - 1 freq
negative (6) - 41 freq
€˜negative (6) - 1 freq
rhinestane (8) - 1 freq
non-native (8) - 2 freq
rinagate (8) - 2 freq
negatives (8) - 3 freq
relative (8) - 29 freq
raelative (8) - 1 freq
recreative (8) - 1 freq
negatin (9) - 2 freq
ringtone (9) - 1 freq
pre-emptive (9) - 1 freq
repetitive (9) - 2 freq
negation (9) - 2 freq
restorative (9) - 3 freq
hiegate (9) - 1 freq
earthnative (9) - 1 freq
argumentative (9) - 1 freq
radio-active (9) - 1 freq
rhanratty (9) - 13 freq
rinagates (9) - 1 freq
thrangitie (9) - 1 freq
renegade (9) - 1 freq
negatit (9) - 1 freq
SoundEx code - R523
reenged - 8 freq
rummaged - 6 freq
rinagates - 1 freq
rnght - 1 freq
ramstam - 10 freq
re-enactments - 1 freq
re-enacts - 1 freq
ramstein - 1 freq
raincoat - 5 freq
rinagate - 2 freq
rainstorm - 1 freq
ram-stam - 8 freq
renegade - 1 freq
rancid - 6 freq
rinsed - 2 freq
reinstate - 1 freq
re-enactin - 1 freq
roncadora - 1 freq
ringed - 7 freq
ronged - 1 freq
reinged - 5 freq
ranged - 1 freq
ringtones - 1 freq
ramstamming - 1 freq
ramscootered - 1 freq
ramstoorie - 2 freq
rem-stem - 1 freq
re-enactors' - 1 freq
'ramstam' - 1 freq
rankit - 1 freq
raincoats - 1 freq
raamished - 2 freq
rain-washed - 1 freq
ramstougerous - 1 freq
ranced - 1 freq
reneged - 1 freq
rune-stones - 1 freq
ramshed - 1 freq
reinstatement - 1 freq
'ramstam - 1 freq
ranked - 1 freq
rhinestane - 1 freq
rh-negative - 1 freq
renshed - 1 freq
ramished - 2 freq
rammstein - 1 freq
ramstamin - 2 freq
re-enactment - 2 freq
rhymester - 1 freq
ringtone - 1 freq
remixed - 1 freq
rnmzdp - 1 freq
renegaderuth - 1 freq
MetaPhone code - RNKTF
rh-negative - 1 freq
RH-NEGATIVE
Time to execute Levenshtein function - 0.237692 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.402969 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.038794 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043873 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000862 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.