A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to certificate in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
certificate (0) - 12 freq
certficate (1) - 1 freq
certificates (1) - 3 freq
certeeficate (2) - 1 freq
certification (3) - 1 freq
pontificate (3) - 1 freq
certifies (4) - 1 freq
extricate (4) - 2 freq
testifycates (4) - 9 freq
participate (4) - 2 freq
artifice (4) - 3 freq
eradicate (4) - 3 freq
predicate (5) - 11 freq
cruciate (5) - 1 freq
certainlie (5) - 2 freq
communicate (5) - 23 freq
estimate (5) - 4 freq
creiticall (5) - 1 freq
confiscate (5) - 1 freq
airtificial (5) - 1 freq
vertical (5) - 21 freq
beatification (5) - 1 freq
cultivate (5) - 2 freq
certainty (5) - 10 freq
anticipate (5) - 1 freq
certificate (0) - 12 freq
certficate (1) - 1 freq
certeeficate (2) - 1 freq
certificates (2) - 3 freq
certification (4) - 1 freq
pontificate (5) - 1 freq
certifies (6) - 1 freq
airtifact (6) - 1 freq
artifice (6) - 3 freq
artifacts (7) - 2 freq
artificial (7) - 9 freq
certainty (7) - 10 freq
criticise (7) - 11 freq
mortification (7) - 2 freq
airtificial (7) - 1 freq
certaint (7) - 6 freq
cheetie-cat (7) - 3 freq
critical (7) - 19 freq
artefact (7) - 3 freq
participate (7) - 2 freq
eradicate (7) - 3 freq
creitical (7) - 5 freq
testifycates (7) - 9 freq
extricate (7) - 2 freq
cruciate (7) - 1 freq
SoundEx code - C631
creative - 99 freq
cardboard - 31 freq
carrot-topped - 1 freq
cairt-fu's - 1 freq
creativity - 9 freq
creative' - 1 freq
cairdboard - 12 freq
caird-board - 1 freq
certificate - 12 freq
certificates - 3 freq
'creative - 4 freq
creatively - 4 freq
cairdboord - 1 freq
creatives - 6 freq
certification - 1 freq
chairitable - 2 freq
certeeficate - 1 freq
credibility - 3 freq
certficate - 1 freq
credible - 3 freq
creativelie - 1 freq
charitable - 2 freq
€˜cairdboard - 1 freq
cardiff - 1 freq
certifies - 1 freq
creativly - 1 freq
credibeelity - 1 freq
cardboardy - 1 freq
creativescots - 16 freq
creativeblock - 2 freq
cardifan - 1 freq
cerdboard - 1 freq
creativeageintl - 1 freq
creativeedin - 1 freq
MetaPhone code - SRTFKT
certificate - 12 freq
certeeficate - 1 freq
certficate - 1 freq
CERTIFICATE
Time to execute Levenshtein function - 0.197882 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.390417 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027259 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037199 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000837 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.