A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to leprosie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
leprosie (0) - 8 freq
sprosie (2) - 1 freq
leprosy (2) - 1 freq
leerie (3) - 22 freq
depose (3) - 1 freq
melrose (3) - 6 freq
sprose (3) - 2 freq
deposit (3) - 5 freq
brosie (3) - 11 freq
liprosy (3) - 2 freq
prose (3) - 60 freq
lerone (3) - 1 freq
'prosze (3) - 1 freq
prosi (3) - 1 freq
prossie (3) - 1 freq
erosive (3) - 1 freq
therosie (3) - 2 freq
posie (3) - 1 freq
reprise (3) - 1 freq
poosie (3) - 4 freq
provie (3) - 2 freq
leadsie (3) - 1 freq
rosie (3) - 73 freq
versie (3) - 1 freq
erie (4) - 1 freq
leprosie (0) - 8 freq
leprosy (2) - 1 freq
sprosie (3) - 1 freq
liprosy (3) - 2 freq
reprise (4) - 1 freq
sprose (4) - 2 freq
prosi (4) - 1 freq
prose (4) - 60 freq
lepers (4) - 4 freq
leeries (5) - 6 freq
lapraik (5) - 1 freq
apprise (5) - 1 freq
leirs (5) - 1 freq
lapse (5) - 2 freq
prise (5) - 3 freq
leps (5) - 1 freq
lears (5) - 2 freq
lepps (5) - 1 freq
caprois (5) - 1 freq
pursie (5) - 2 freq
leipers (5) - 2 freq
pros (5) - 4 freq
lipprosy (5) - 1 freq
rosie (5) - 73 freq
'prosze (5) - 1 freq
SoundEx code - L162
luvers - 12 freq
lippers - 9 freq
lipperous - 1 freq
laboryus - 1 freq
lea-perk - 2 freq
labours - 5 freq
laverocks - 5 freq
lappers - 1 freq
liver's - 2 freq
lovers - 20 freq
leverock - 1 freq
levers - 6 freq
leavers - 1 freq
leprosie - 8 freq
lepers - 4 freq
laverock's - 2 freq
lipprosy - 1 freq
liprosy - 2 freq
lavroos - 1 freq
luver's - 2 freq
leprosy - 1 freq
'leprechaun - 2 freq
leprechauns - 1 freq
leprechaun - 3 freq
laverock - 14 freq
laveroks - 1 freq
liver-spotted - 1 freq
laebrack - 1 freq
lapraik - 1 freq
lawburrows - 14 freq
laevrick - 2 freq
laborious - 2 freq
lawborrowis - 6 freq
lawborrouis - 1 freq
law-borowis - 1 freq
law-borrowis - 1 freq
lawbawrous - 1 freq
lawbarrowis - 1 freq
lauborris - 1 freq
lawborch - 1 freq
leverage - 3 freq
laebrak - 1 freq
laverick - 1 freq
lawbors - 1 freq
laabours - 1 freq
laphroaig - 2 freq
low-prestige - 2 freq
luve-arras - 1 freq
luvvers - 1 freq
louvres - 2 freq
livers - 1 freq
€œleverage - 1 freq
leipers - 2 freq
liveries - 1 freq
liberace's - 1 freq
loversaberdeen - 1 freq
law-breaking - 1 freq
labourrichard - 2 freq
lavrock's - 1 freq
MetaPhone code - LPRS
lippers - 9 freq
lipperous - 1 freq
lappers - 1 freq
leprosie - 8 freq
lepers - 4 freq
lipprosy - 1 freq
liprosy - 2 freq
leprosy - 1 freq
leipers - 2 freq
LEPROSIE
Time to execute Levenshtein function - 0.223441 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.399429 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028940 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039355 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000935 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.