A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pxim in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pxim (0) - 1 freq
pxm (1) - 1 freq
prim (1) - 2 freq
leim (2) - 2 freq
poyim (2) - 3 freq
haim (2) - 39 freq
xix (2) - 1 freq
hxi (2) - 1 freq
pail (2) - 50 freq
tim (2) - 47 freq
xxm (2) - 1 freq
peir (2) - 3 freq
pfi (2) - 2 freq
xi (2) - 13 freq
exit (2) - 28 freq
pir (2) - 31 freq
naim (2) - 5 freq
thim (2) - 193 freq
ahim (2) - 3 freq
puil (2) - 37 freq
peig (2) - 1 freq
puir (2) - 548 freq
paum (2) - 2 freq
grim (2) - 53 freq
brim (2) - 17 freq
pxim (0) - 1 freq
pxm (1) - 1 freq
prim (2) - 2 freq
purim (3) - 1 freq
prom (3) - 8 freq
paix (3) - 2 freq
poem (3) - 369 freq
pima (3) - 1 freq
paum (3) - 2 freq
pixie (3) - 4 freq
perm (3) - 3 freq
penim (3) - 1 freq
nxm (3) - 1 freq
pirm (3) - 1 freq
prima (3) - 7 freq
exxm (3) - 1 freq
plum (3) - 15 freq
xm (3) - 1 freq
palm (3) - 33 freq
prime (3) - 36 freq
pcm (3) - 1 freq
pxxy (3) - 1 freq
pom (3) - 20 freq
px (3) - 1 freq
axiom (3) - 1 freq
SoundEx code - P250
pushin - 64 freq
passin - 156 freq
pickin - 111 freq
poison - 28 freq
passion - 76 freq
pechin - 86 freq
pishin - 32 freq
peckin - 16 freq
posin - 9 freq
pyson - 4 freq
pushion - 6 freq
pokin - 19 freq
'poison - 1 freq
pizzen - 2 freq
possum - 1 freq
'pushion' - 1 freq
piece-an - 1 freq
pusshin - 1 freq
pigeon - 37 freq
packin - 20 freq
pisen - 1 freq
pacin - 12 freq
pookin - 1 freq
powkin - 24 freq
pickan - 4 freq
passan - 7 freq
pyjama - 1 freq
pagan - 17 freq
pissin - 17 freq
pikkin - 1 freq
passen - 2 freq
picsien - 1 freq
pikken - 6 freq
pouken - 1 freq
peikken - 1 freq
paaken - 1 freq
pakken - 1 freq
pooshen - 1 freq
passin' - 7 freq
pickin' - 1 freq
peakin - 1 freq
pausin - 5 freq
pussin - 1 freq
pigskin - 1 freq
pechan - 2 freq
puzhin - 3 freq
poosan - 1 freq
'poison' - 2 freq
puzziin - 1 freq
pig-weeyin - 1 freq
passioun - 1 freq
poachin - 5 freq
püshin - 3 freq
poochin' - 1 freq
'pokin' - 1 freq
pussion - 1 freq
peggan - 1 freq
'pooshin' - 1 freq
pooshin - 2 freq
pickeen - 1 freq
pikan - 2 freq
pushan - 3 freq
'pigeon - 1 freq
passeen - 3 freq
posiuon - 1 freq
puskan - 1 freq
pizen - 3 freq
puckin - 10 freq
pacean - 1 freq
pooshan - 1 freq
paikin - 3 freq
puzzian - 1 freq
poukin - 2 freq
pykin - 2 freq
poushion - 1 freq
puzhion - 1 freq
peekin - 3 freq
pooshion - 1 freq
poussin - 1 freq
pushioun - 1 freq
€˜pygmy - 1 freq
packham - 1 freq
peejin - 1 freq
piecin - 1 freq
pogoin - 1 freq
peghin - 1 freq
€œpiggin - 1 freq
piggin - 10 freq
pysin - 1 freq
pxm - 1 freq
peckin' - 3 freq
pishinÂ’ - 1 freq
pjsnoo - 1 freq
pvjzn - 1 freq
pzno - 1 freq
pqmoy - 1 freq
pishin' - 1 freq
packam - 1 freq
pxim - 1 freq
pasna - 1 freq
pzywn - 1 freq
pcm - 1 freq
pooskin - 1 freq
phkan - 1 freq
MetaPhone code - PKSM
pxm - 1 freq
pxim - 1 freq
PXIM
Time to execute Levenshtein function - 0.202755 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373620 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.039130 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037840 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000880 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.