A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to perth in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
perth (0) - 41 freq
erth (1) - 3 freq
werth (1) - 2 freq
perths (1) - 1 freq
herth (1) - 3 freq
pert (1) - 60 freq
perts (1) - 6 freq
perch (1) - 13 freq
perty (1) - 33 freq
peth (1) - 37 freq
derth (1) - 2 freq
berth (1) - 21 freq
dearth (2) - 9 freq
terts (2) - 2 freq
pech (2) - 35 freq
eith (2) - 26 freq
pertik (2) - 1 freq
hearth (2) - 30 freq
pete (2) - 22 freq
berty (2) - 3 freq
pets (2) - 12 freq
yirth (2) - 43 freq
pest (2) - 25 freq
pero (2) - 1 freq
warth (2) - 64 freq
perth (0) - 41 freq
peth (2) - 37 freq
perty (2) - 33 freq
berth (2) - 21 freq
pairth (2) - 1 freq
erth (2) - 3 freq
perch (2) - 13 freq
derth (2) - 2 freq
perts (2) - 6 freq
werth (2) - 2 freq
herth (2) - 3 freq
perths (2) - 1 freq
pert (2) - 60 freq
part (3) - 196 freq
earth (3) - 253 freq
porty (3) - 1 freq
garth (3) - 3 freq
apert (3) - 12 freq
pirt (3) - 1 freq
furth (3) - 105 freq
praith (3) - 1 freq
darth (3) - 3 freq
paerts (3) - 1 freq
porto (3) - 7 freq
north (3) - 402 freq
SoundEx code - P630
poored - 56 freq
prood - 197 freq
poured - 28 freq
pairt - 933 freq
preed - 4 freq
pride - 168 freq
perth - 41 freq
period - 121 freq
pert - 60 freq
pairty - 293 freq
pourt - 2 freq
parade - 15 freq
part - 196 freq
proud - 153 freq
pree'd - 4 freq
pooered - 3 freq
pretty - 132 freq
party - 157 freq
prayed - 42 freq
peered - 26 freq
perty - 33 freq
proddy - 2 freq
prayit - 1 freq
purity - 5 freq
port - 70 freq
prod - 3 freq
prayt - 2 freq
'prod' - 1 freq
pairtie - 44 freq
pryde - 3 freq
purrit - 1 freq
proddie - 2 freq
pooerd - 1 freq
prettea - 1 freq
preetea - 1 freq
poor't - 1 freq
partey - 3 freq
pirate - 6 freq
porto - 7 freq
pairt' - 3 freq
poirot - 2 freq
preyed - 1 freq
parrot - 21 freq
pritta - 1 freq
proota - 8 freq
pourit - 4 freq
peard - 1 freq
paired - 1 freq
purdie - 16 freq
pratie - 1 freq
pouered - 4 freq
pouertae - 1 freq
parody - 5 freq
pour'd - 1 freq
pried - 2 freq
peratt - 1 freq
pierheid - 1 freq
pret - 1 freq
preat - 1 freq
purred - 8 freq
pierhead - 1 freq
parity - 2 freq
porrt - 1 freq
puirt - 2 freq
príed - 1 freq
prettie - 1 freq
purrt - 1 freq
pree't - 1 freq
pairte - 1 freq
port' - 1 freq
pritty - 2 freq
pertie - 6 freq
€˜pride - 1 freq
parawd - 1 freq
pared - 2 freq
portie - 3 freq
pairth - 1 freq
puritie - 1 freq
pirt - 1 freq
pourd - 1 freq
portia - 2 freq
pouert - 2 freq
powert - 2 freq
poort - 5 freq
peart - 1 freq
powered - 2 freq
€˜pairt - 5 freq
prood' - 1 freq
praid - 1 freq
praith - 1 freq
prity - 2 freq
pratta - 1 freq
'pretty - 1 freq
porty - 1 freq
paaarteeey - 1 freq
priti - 2 freq
pratt - 1 freq
'poor-oot' - 1 freq
partyÂ… - 1 freq
“pirate - 1 freq
MetaPhone code - PR0
perth - 41 freq
pairth - 1 freq
praith - 1 freq
PERTH
Time to execute Levenshtein function - 0.222575 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.360837 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028181 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038426 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000947 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.