A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pie-dish in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pie-dish (0) - 3 freq
fiendish (2) - 2 freq
plenish (3) - 6 freq
pictish (3) - 36 freq
yiddish (3) - 4 freq
reddish (3) - 1 freq
paerish (3) - 1 freq
peevish (3) - 2 freq
perish (3) - 12 freq
endish (3) - 1 freq
swedish (3) - 37 freq
sie-kist (3) - 2 freq
pinkish (3) - 2 freq
bieldit (4) - 6 freq
ploukish (4) - 1 freq
pleading (4) - 1 freq
engish (4) - 2 freq
parish (4) - 30 freq
lenish (4) - 1 freq
irish (4) - 157 freq
penrith (4) - 1 freq
ticklish (4) - 2 freq
perisht (4) - 2 freq
slemish (4) - 1 freq
bieldie (4) - 1 freq
pie-dish (0) - 3 freq
fiendish (4) - 2 freq
perish (5) - 12 freq
endish (5) - 1 freq
swedish (5) - 37 freq
peevish (5) - 2 freq
pinkish (5) - 2 freq
plenish (5) - 6 freq
paerish (5) - 1 freq
pictish (5) - 36 freq
yiddish (5) - 4 freq
reddish (5) - 1 freq
a-tish (6) - 3 freq
paradise (6) - 44 freq
'dish (6) - 2 freq
pleads (6) - 5 freq
dish (6) - 73 freq
tae-bash (6) - 1 freq
popish (6) - 1 freq
laddish (6) - 1 freq
roondish (6) - 1 freq
pereesh (6) - 1 freq
plish (6) - 2 freq
auldish (6) - 1 freq
pundis (6) - 2 freq
SoundEx code - P320
puddocks - 25 freq
puddocks' - 1 freq
pots - 46 freq
pot's - 2 freq
patch - 79 freq
pits - 201 freq
paths - 32 freq
piteous - 3 freq
photies - 76 freq
pottage - 1 freq
pootch - 22 freq
puts - 41 freq
'photos - 1 freq
photos - 40 freq
paiths - 1 freq
poetic - 35 freq
pads - 41 freq
podgy - 2 freq
pitch - 89 freq
puddock - 34 freq
poet's - 18 freq
poets - 71 freq
path's - 5 freq
pats - 15 freq
pods - 7 freq
pudgy - 1 freq
pete's - 3 freq
peats - 49 freq
paitish - 1 freq
peat-hags - 4 freq
pathways - 4 freq
photes - 1 freq
pod's - 1 freq
pet's - 2 freq
photo's - 4 freq
patchy - 5 freq
peats' - 1 freq
pudsey - 2 freq
pets - 12 freq
pyot's - 1 freq
puddock's - 2 freq
photaes - 9 freq
potties - 1 freq
pitt's - 1 freq
paddy's - 2 freq
paddock's - 1 freq
poats - 2 freq
peety's - 3 freq
pit's - 2 freq
pieties - 1 freq
paets - 28 freq
poyets - 5 freq
patsy - 8 freq
patties - 2 freq
pouties - 1 freq
ptas - 1 freq
peths - 3 freq
pyots - 2 freq
petties - 1 freq
pathweys - 3 freq
pudgie - 3 freq
pytheas - 1 freq
pathos - 5 freq
peattie's - 1 freq
potch - 3 freq
poutch - 1 freq
photos' - 1 freq
peewits - 4 freq
pitts - 1 freq
patty's - 1 freq
puds - 1 freq
puddok - 1 freq
puttock - 1 freq
pate-hag - 2 freq
patios - 2 freq
patois - 18 freq
€œpatsy - 1 freq
photoies - 1 freq
€˜puts - 1 freq
photas - 2 freq
'photies' - 1 freq
potash - 1 freq
puddoks - 1 freq
potts - 1 freq
pie-dish - 3 freq
poots - 3 freq
paddick - 1 freq
poyits - 1 freq
poiïts - 1 freq
pts - 3 freq
pathies - 1 freq
peteskii - 1 freq
potus - 1 freq
ptz - 1 freq
putca - 1 freq
photis - 1 freq
pootsy - 1 freq
paedos - 1 freq
MetaPhone code - PTX
paitish - 1 freq
potash - 1 freq
pie-dish - 3 freq
PIE-DISH
Time to execute Levenshtein function - 0.228370 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.379664 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028350 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039590 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001061 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.