A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to petrie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
petrie (0) - 4 freq
pettie (1) - 1 freq
peirie (1) - 5 freq
peerie (1) - 626 freq
perrie (1) - 1 freq
poetrie (1) - 17 freq
peenie (2) - 25 freq
ferrie (2) - 7 freq
deerie (2) - 1 freq
peasie (2) - 1 freq
poetre (2) - 1 freq
pirie (2) - 5 freq
tearie (2) - 1 freq
fearie (2) - 2 freq
pete (2) - 22 freq
ettie (2) - 4 freq
peeric (2) - 1 freq
€œpetrie (2) - 3 freq
pensie (2) - 3 freq
pitie (2) - 1 freq
peérie (2) - 1 freq
wearie (2) - 10 freq
geerie (2) - 1 freq
petrify (2) - 1 freq
peeries (2) - 2 freq
petrie (0) - 4 freq
poetrie (1) - 17 freq
petir (2) - 1 freq
pettie (2) - 1 freq
peirie (2) - 5 freq
poetre (2) - 1 freq
peerie (2) - 626 freq
perrie (2) - 1 freq
pattie (3) - 2 freq
pitril (3) - 7 freq
pithie (3) - 1 freq
peerier (3) - 17 freq
petit (3) - 1 freq
peeree (3) - 2 freq
peetie (3) - 24 freq
petrol (3) - 36 freq
peri (3) - 2 freq
petrel (3) - 3 freq
peattie (3) - 5 freq
metre (3) - 13 freq
petite (3) - 1 freq
pathie (3) - 9 freq
paetie (3) - 2 freq
entrie (3) - 5 freq
pirrie (3) - 1 freq
SoundEx code - P360
potter - 29 freq
peter - 148 freq
poetrie - 17 freq
poetry - 260 freq
powder - 10 freq
pooder - 49 freq
patter - 56 freq
poother - 5 freq
pooderhaa - 1 freq
pootherie - 1 freq
pouther - 7 freq
petrie - 4 freq
poothery - 1 freq
pottery - 5 freq
poodher - 1 freq
'peter - 3 freq
petèr - 31 freq
petér - 10 freq
petér' - 1 freq
'peter' - 1 freq
poodery - 2 freq
pdr - 2 freq
'poetry - 1 freq
poyetry - 5 freq
poetry' - 2 freq
pu-thru - 1 freq
paitter - 1 freq
peedier - 7 freq
putter - 1 freq
pedro - 4 freq
poatry - 1 freq
pewther - 1 freq
pewter - 3 freq
pouetry - 1 freq
'phaedra' - 1 freq
poetre - 1 freq
poietry - 3 freq
petir - 1 freq
€˜poetry - 1 freq
phaedra - 14 freq
€˜potter - 1 freq
pitter - 2 freq
pouder - 1 freq
€œpetrie - 3 freq
peuther - 1 freq
poyitry - 1 freq
pityer - 1 freq
poiïtree - 6 freq
powdery - 1 freq
pooter - 1 freq
padaro - 1 freq
padair - 8 freq
padre - 1 freq
pattir - 2 freq
powdir - 1 freq
MetaPhone code - PTR
potter - 29 freq
peter - 148 freq
poetrie - 17 freq
poetry - 260 freq
powder - 10 freq
pooder - 49 freq
patter - 56 freq
petrie - 4 freq
pottery - 5 freq
'peter - 3 freq
petèr - 31 freq
petér - 10 freq
petér' - 1 freq
'peter' - 1 freq
poodery - 2 freq
pdr - 2 freq
'poetry - 1 freq
poetry' - 2 freq
paitter - 1 freq
peedier - 7 freq
putter - 1 freq
pedro - 4 freq
poatry - 1 freq
pewter - 3 freq
pouetry - 1 freq
poetre - 1 freq
poietry - 3 freq
petir - 1 freq
€˜poetry - 1 freq
€˜potter - 1 freq
pitter - 2 freq
pouder - 1 freq
€œpetrie - 3 freq
poiïtree - 6 freq
powdery - 1 freq
pooter - 1 freq
padaro - 1 freq
padair - 8 freq
padre - 1 freq
pattir - 2 freq
powdir - 1 freq
PETRIE
Time to execute Levenshtein function - 0.281835 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.602858 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.038850 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.087578 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001428 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.