A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to romans in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
romans (0) - 44 freq
rowans (1) - 12 freq
womans (1) - 5 freq
roumans (1) - 2 freq
roman (1) - 76 freq
ermans (2) - 1 freq
romanies (2) - 1 freq
royals (2) - 9 freq
somane (2) - 1 freq
womens (2) - 3 freq
groans (2) - 6 freq
roland (2) - 2 freq
nomads (2) - 1 freq
woman (2) - 100 freq
logans (2) - 1 freq
commans (2) - 3 freq
oman (2) - 2 freq
brogans (2) - 1 freq
solans (2) - 3 freq
roaks (2) - 8 freq
roads (2) - 105 freq
romagna (2) - 4 freq
mans (2) - 21 freq
gowans (2) - 22 freq
roals (2) - 1 freq
romans (0) - 44 freq
roumans (1) - 2 freq
remains (2) - 61 freq
ermans (2) - 1 freq
romanies (2) - 1 freq
womans (2) - 5 freq
rowans (2) - 12 freq
roman (2) - 76 freq
remant (3) - 1 freq
moans (3) - 7 freq
romances (3) - 2 freq
roons (3) - 7 freq
humans (3) - 55 freq
ronas (3) - 3 freq
romeos (3) - 1 freq
romance (3) - 39 freq
rouman (3) - 2 freq
demans (3) - 1 freq
lemans (3) - 1 freq
rouns (3) - 4 freq
comins (3) - 4 freq
reminis (3) - 1 freq
remeens (3) - 1 freq
rome's (3) - 1 freq
aromas (3) - 1 freq
SoundEx code - R552
remains - 61 freq
reminiscences - 3 freq
reminiscin - 4 freq
romance - 39 freq
remeens - 1 freq
remonstrate - 1 freq
rhyming - 10 freq
running - 47 freq
romans - 44 freq
ruining - 3 freq
reminisce - 6 freq
romunce - 5 freq
reminiscent - 2 freq
romansch - 4 freq
romance-based - 1 freq
renounced - 1 freq
romances - 2 freq
roumans - 2 freq
romanies - 1 freq
romauncin - 1 freq
romancin - 1 freq
remonstrance - 1 freq
romaunces - 1 freq
ranunculus - 1 freq
reminiscencies - 1 freq
€˜romance - 1 freq
remonstratin - 1 freq
rinning - 7 freq
€œreminiscences - 1 freq
remington - 1 freq
renunciation - 2 freq
reminiscence - 2 freq
raining - 5 freq
rainiangel - 1 freq
rhymingweavers - 1 freq
ronanmcsherryuh - 1 freq
raminski - 1 freq
romanscotland - 2 freq
reminiscing - 1 freq
reminis - 1 freq
romancticness - 1 freq
MetaPhone code - RMNS
remains - 61 freq
romance - 39 freq
remeens - 1 freq
romans - 44 freq
reminisce - 6 freq
romunce - 5 freq
roumans - 2 freq
romanies - 1 freq
€˜romance - 1 freq
reminis - 1 freq
ROMANS
Time to execute Levenshtein function - 0.379206 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.793138 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027609 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.085748 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000936 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.