A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to possum in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
possum (0) - 1 freq
bissum (2) - 2 freq
poss (2) - 5 freq
podium (2) - 6 freq
posse (2) - 4 freq
poyum (2) - 9 freq
lousum (2) - 2 freq
postrum (2) - 1 freq
fousum (2) - 1 freq
losses (3) - 15 freq
ross's (3) - 3 freq
psaums (3) - 2 freq
osnu (3) - 1 freq
poosie (3) - 4 freq
postie (3) - 24 freq
poshie (3) - 1 freq
poash (3) - 1 freq
passan (3) - 7 freq
sodium (3) - 1 freq
passed (3) - 304 freq
posher (3) - 1 freq
postal (3) - 3 freq
passé (3) - 1 freq
poyim (3) - 3 freq
boasom (3) - 1 freq
possum (0) - 1 freq
poss (3) - 5 freq
posse (3) - 4 freq
bissum (3) - 2 freq
passin (4) - 152 freq
poussed (4) - 4 freq
pisses (4) - 4 freq
piss (4) - 72 freq
poussin (4) - 1 freq
pusst (4) - 1 freq
lissom (4) - 2 freq
pussed (4) - 4 freq
pussy (4) - 18 freq
psalm (4) - 39 freq
assume (4) - 24 freq
pissd (4) - 1 freq
prism (4) - 2 freq
pissit (4) - 2 freq
pussie (4) - 6 freq
passes (4) - 68 freq
passen (4) - 2 freq
posies (4) - 6 freq
bessom (4) - 1 freq
pass (4) - 284 freq
ssm (4) - 1 freq
SoundEx code - P250
pushin - 60 freq
passin - 152 freq
pickin - 108 freq
poison - 28 freq
passion - 75 freq
pechin - 85 freq
pishin - 29 freq
peckin - 16 freq
posin - 9 freq
pyson - 4 freq
pushion - 6 freq
pokin - 18 freq
'poison - 1 freq
pizzen - 2 freq
possum - 1 freq
'pushion' - 1 freq
piece-an - 1 freq
pusshin - 1 freq
pigeon - 37 freq
packin - 20 freq
pisen - 1 freq
pacin - 11 freq
pookin - 1 freq
powkin - 24 freq
pickan - 4 freq
passan - 7 freq
pyjama - 1 freq
pagan - 17 freq
pissin - 17 freq
pikkin - 1 freq
passen - 2 freq
picsien - 1 freq
pikken - 6 freq
pouken - 1 freq
peikken - 1 freq
paaken - 1 freq
pakken - 1 freq
pooshen - 1 freq
passin' - 6 freq
pickin' - 1 freq
pussin - 1 freq
pigskin - 1 freq
pechan - 2 freq
pausin - 4 freq
puzhin - 3 freq
poosan - 1 freq
'poison' - 2 freq
puzziin - 1 freq
pig-weeyin - 1 freq
passioun - 1 freq
poachin - 5 freq
püshin - 3 freq
poochin' - 1 freq
'pokin' - 1 freq
pussion - 1 freq
peggan - 1 freq
'pooshin' - 1 freq
pooshin - 2 freq
pickeen - 1 freq
pikan - 2 freq
pushan - 3 freq
'pigeon - 1 freq
passeen - 3 freq
posiuon - 1 freq
puskan - 1 freq
pizen - 3 freq
puckin - 10 freq
pacean - 1 freq
pooshan - 1 freq
paikin - 3 freq
puzzian - 1 freq
poukin - 2 freq
pykin - 2 freq
poushion - 1 freq
puzhion - 1 freq
peekin - 3 freq
pooshion - 1 freq
poussin - 1 freq
pushioun - 1 freq
€˜pygmy - 1 freq
packham - 1 freq
peejin - 1 freq
piecin - 1 freq
pogoin - 1 freq
peghin - 1 freq
€œpiggin - 1 freq
piggin - 10 freq
pysin - 1 freq
pxm - 1 freq
peckin' - 3 freq
pishinÂ’ - 1 freq
pjsnoo - 1 freq
pvjzn - 1 freq
pzno - 1 freq
pqmoy - 1 freq
pishin' - 1 freq
packam - 1 freq
pxim - 1 freq
pasna - 1 freq
pzywn - 1 freq
pcm - 1 freq
pooskin - 1 freq
phkan - 1 freq
MetaPhone code - PSM
possum - 1 freq
POSSUM
Time to execute Levenshtein function - 0.188233 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.366896 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028580 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037609 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000814 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.