A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to perineum in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
perineum (0) - 2 freq
pernee (3) - 1 freq
pertners (3) - 1 freq
erinm (3) - 2 freq
perilous (3) - 1 freq
pernyim (3) - 1 freq
pertinent (3) - 3 freq
petroleum (3) - 1 freq
geranium (3) - 1 freq
pertner (3) - 2 freq
pertness (3) - 1 freq
eminem (3) - 2 freq
meringue (3) - 3 freq
erne's (4) - 1 freq
refined (4) - 8 freq
mariner (4) - 4 freq
meiner (4) - 2 freq
definet (4) - 1 freq
reinen (4) - 1 freq
privet (4) - 9 freq
princip (4) - 1 freq
perlins (4) - 3 freq
tripeuk (4) - 22 freq
erne (4) - 4 freq
aeriel (4) - 1 freq
perineum (0) - 2 freq
pernyim (3) - 1 freq
geranium (4) - 1 freq
erinm (4) - 2 freq
pernee (4) - 1 freq
perfume (5) - 33 freq
bernam (5) - 1 freq
prism (5) - 2 freq
uranium (5) - 2 freq
peering (5) - 1 freq
capernaum (5) - 15 freq
prunes (5) - 3 freq
pronoun (5) - 23 freq
peerin (5) - 34 freq
princie (5) - 4 freq
prince (5) - 346 freq
prune (5) - 5 freq
parin (5) - 4 freq
preined (5) - 1 freq
prenup (5) - 2 freq
prim (5) - 2 freq
premium (5) - 2 freq
pyrenees (5) - 1 freq
parins (5) - 3 freq
purned (5) - 1 freq
SoundEx code - P655
pronunciation - 31 freq
pronunciations - 10 freq
permanent - 30 freq
pronounce - 24 freq
preenin - 8 freq
pronoonceable - 1 freq
perimenopause - 1 freq
promenade - 8 freq
premonition - 7 freq
promenader - 1 freq
pronooncit - 2 freq
permanently - 6 freq
pronoonce - 12 freq
prunin - 2 freq
premium - 2 freq
pronooncements - 1 freq
preanin - 1 freq
pronounce''t - 1 freq
prominent - 9 freq
pronounced - 29 freq
preenin' - 1 freq
pronunced - 3 freq
pronouns - 42 freq
pronoun - 23 freq
paranems - 1 freq
pronuncin - 2 freq
premonitions - 2 freq
permanant - 1 freq
pornounciation - 1 freq
pernyim - 1 freq
pre-eminent - 1 freq
pronominal - 1 freq
pronouncit - 2 freq
pronunciatioun - 1 freq
pronunciâtion - 1 freq
permament - 1 freq
pronuncement - 1 freq
pronoonciation - 15 freq
pronoon - 6 freq
pronoonced - 4 freq
promontory - 2 freq
pronouncin - 1 freq
pronunciaetion - 1 freq
pronunciaetions - 1 freq
primming - 1 freq
prominence - 1 freq
permency - 1 freq
permanence - 1 freq
pronooncin - 4 freq
perineum - 2 freq
pronoons - 3 freq
pronunseeaeshins - 1 freq
preeemium - 1 freq
paramountabdn - 1 freq
pronounciation - 1 freq
MetaPhone code - PRNM
perineum - 2 freq
PERINEUM
Time to execute Levenshtein function - 0.215470 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.389457 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027607 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039936 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000894 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.