A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to paths in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
paths (0) - 32 freq
peths (1) - 3 freq
pats (1) - 15 freq
oaths (1) - 3 freq
paiths (1) - 1 freq
pathy (1) - 1 freq
path's (1) - 5 freq
pathos (1) - 5 freq
path (1) - 178 freq
maths (1) - 35 freq
baths (1) - 35 freq
'path' (2) - 1 freq
cathy (2) - 146 freq
pa's (2) - 1 freq
paces (2) - 14 freq
puts (2) - 41 freq
fat's (2) - 6 freq
palls (2) - 5 freq
pit's (2) - 2 freq
paws (2) - 47 freq
patti (2) - 1 freq
lanths (2) - 1 freq
peth (2) - 37 freq
hats (2) - 46 freq
pastes (2) - 1 freq
paths (0) - 32 freq
paiths (1) - 1 freq
pathos (1) - 5 freq
peths (1) - 3 freq
maths (2) - 35 freq
baths (2) - 35 freq
pathies (2) - 1 freq
path (2) - 178 freq
oaths (2) - 3 freq
pats (2) - 15 freq
pathy (2) - 1 freq
path's (2) - 5 freq
pathie (3) - 9 freq
pts (3) - 3 freq
pots (3) - 46 freq
moths (3) - 7 freq
paets (3) - 28 freq
ruths (3) - 2 freq
patsy (3) - 8 freq
daiths (3) - 10 freq
patois (3) - 18 freq
patios (3) - 2 freq
pet's (3) - 2 freq
pehs (3) - 2 freq
pets (3) - 12 freq
SoundEx code - P320
puddocks - 25 freq
puddocks' - 1 freq
pots - 46 freq
pot's - 2 freq
patch - 79 freq
pits - 201 freq
paths - 32 freq
piteous - 3 freq
photies - 76 freq
pottage - 1 freq
pootch - 22 freq
puts - 41 freq
'photos - 1 freq
photos - 40 freq
paiths - 1 freq
poetic - 35 freq
pads - 41 freq
podgy - 2 freq
pitch - 89 freq
puddock - 34 freq
poet's - 18 freq
poets - 71 freq
path's - 5 freq
pats - 15 freq
pods - 7 freq
pudgy - 1 freq
pete's - 3 freq
peats - 49 freq
paitish - 1 freq
peat-hags - 4 freq
pathways - 4 freq
photes - 1 freq
pod's - 1 freq
pet's - 2 freq
photo's - 4 freq
patchy - 5 freq
peats' - 1 freq
pudsey - 2 freq
pets - 12 freq
pyot's - 1 freq
puddock's - 2 freq
photaes - 9 freq
potties - 1 freq
pitt's - 1 freq
paddy's - 2 freq
paddock's - 1 freq
poats - 2 freq
peety's - 3 freq
pit's - 2 freq
pieties - 1 freq
paets - 28 freq
poyets - 5 freq
patsy - 8 freq
patties - 2 freq
pouties - 1 freq
ptas - 1 freq
peths - 3 freq
pyots - 2 freq
petties - 1 freq
pathweys - 3 freq
pudgie - 3 freq
pytheas - 1 freq
pathos - 5 freq
peattie's - 1 freq
potch - 3 freq
poutch - 1 freq
photos' - 1 freq
peewits - 4 freq
pitts - 1 freq
patty's - 1 freq
puds - 1 freq
puddok - 1 freq
puttock - 1 freq
pate-hag - 2 freq
patios - 2 freq
patois - 18 freq
€œpatsy - 1 freq
photoies - 1 freq
€˜puts - 1 freq
photas - 2 freq
'photies' - 1 freq
potash - 1 freq
puddoks - 1 freq
potts - 1 freq
pie-dish - 3 freq
poots - 3 freq
paddick - 1 freq
poyits - 1 freq
poiïts - 1 freq
pts - 3 freq
pathies - 1 freq
peteskii - 1 freq
potus - 1 freq
ptz - 1 freq
putca - 1 freq
photis - 1 freq
pootsy - 1 freq
paedos - 1 freq
MetaPhone code - P0S
paths - 32 freq
paiths - 1 freq
path's - 5 freq
peths - 3 freq
pytheas - 1 freq
pathos - 5 freq
pathies - 1 freq
PATHS
Time to execute Levenshtein function - 0.301133 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.597814 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027787 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.086228 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000909 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.