A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to paint in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
paint (0) - 60 freq
daint (1) - 4 freq
pant (1) - 2 freq
print (1) - 24 freq
gaint (1) - 1 freq
plaint (1) - 1 freq
haint (1) - 3 freq
taint (1) - 2 freq
kaint (1) - 4 freq
saint (1) - 42 freq
paints (1) - 2 freq
faint (1) - 35 freq
point (1) - 393 freq
aint (1) - 13 freq
pain (1) - 240 freq
pairt (1) - 928 freq
caint (1) - 3 freq
paine (1) - 5 freq
baint (1) - 1 freq
pint (1) - 186 freq
pains (1) - 26 freq
peint (1) - 5 freq
paist (1) - 1 freq
int (2) - 31 freq
laant (2) - 1 freq
paint (0) - 60 freq
point (1) - 393 freq
peint (1) - 5 freq
pant (1) - 2 freq
pint (1) - 186 freq
peent (2) - 1 freq
paist (2) - 1 freq
pains (2) - 26 freq
poynt (2) - 1 freq
pynt (2) - 276 freq
pinot (2) - 4 freq
opint (2) - 1 freq
panto (2) - 5 freq
peynt (2) - 1 freq
pent (2) - 39 freq
panet (2) - 1 freq
pointy (2) - 6 freq
panty (2) - 1 freq
punt (2) - 9 freq
piynt (2) - 2 freq
paints (2) - 2 freq
haint (2) - 3 freq
faint (2) - 35 freq
baint (2) - 1 freq
taint (2) - 2 freq
SoundEx code - P530
pynt - 276 freq
paint - 60 freq
point - 393 freq
pint - 186 freq
peened - 10 freq
pent - 39 freq
pund - 37 freq
pend - 3 freq
pyntie - 1 freq
pound - 127 freq
punt - 9 freq
phont - 3 freq
pyned - 1 freq
panned - 6 freq
penned - 14 freq
phoned - 73 freq
peend - 4 freq
pond - 50 freq
pownte - 1 freq
pinned - 22 freq
pynit - 3 freq
poond - 13 freq
peyn't - 6 freq
peynt - 1 freq
peint - 5 freq
pawned - 2 freq
peanut - 11 freq
punto - 4 freq
pointy - 6 freq
pmt - 3 freq
puntee - 1 freq
panda - 1 freq
pennit - 1 freq
pin't - 1 freq
phone't - 1 freq
pained - 1 freq
pant - 2 freq
pinot - 4 freq
pine-widd - 2 freq
pinnet - 1 freq
poynt - 1 freq
panto - 5 freq
pened - 1 freq
piynt - 2 freq
pinnied - 1 freq
pin-heid - 1 freq
pennied - 1 freq
punt' - 1 freq
powneed - 1 freq
pynty - 1 freq
pawnd - 1 freq
panet - 1 freq
-pund - 1 freq
peent - 1 freq
pand - 3 freq
pinewood - 1 freq
pandyyyy - 2 freq
panty - 1 freq
pnd - 1 freq
penud - 1 freq
punnet - 1 freq
pmd - 1 freq
MetaPhone code - PNT
pynt - 276 freq
paint - 60 freq
point - 393 freq
pint - 186 freq
peened - 10 freq
pent - 39 freq
pund - 37 freq
pend - 3 freq
pyntie - 1 freq
pound - 127 freq
punt - 9 freq
pyned - 1 freq
panned - 6 freq
penned - 14 freq
peend - 4 freq
pond - 50 freq
pownte - 1 freq
pinned - 22 freq
pynit - 3 freq
poond - 13 freq
peyn't - 6 freq
peynt - 1 freq
peint - 5 freq
pawned - 2 freq
peanut - 11 freq
punto - 4 freq
pointy - 6 freq
puntee - 1 freq
panda - 1 freq
pennit - 1 freq
pin't - 1 freq
pained - 1 freq
pant - 2 freq
pinot - 4 freq
pinnet - 1 freq
poynt - 1 freq
panto - 5 freq
pened - 1 freq
piynt - 2 freq
pinnied - 1 freq
pennied - 1 freq
punt' - 1 freq
powneed - 1 freq
pynty - 1 freq
pawnd - 1 freq
panet - 1 freq
-pund - 1 freq
peent - 1 freq
pand - 3 freq
pandyyyy - 2 freq
panty - 1 freq
penud - 1 freq
punnet - 1 freq
PAINT
Time to execute Levenshtein function - 0.184841 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.339346 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027339 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037025 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000952 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.