A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to equity in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
equity (0) - 7 freq
equine (2) - 3 freq
squinty (2) - 6 freq
requit (2) - 1 freq
inequity (2) - 1 freq
fruity (2) - 3 freq
entity (2) - 17 freq
'quit (2) - 1 freq
equip (2) - 1 freq
quite (2) - 475 freq
'quite (2) - 3 freq
quit (2) - 28 freq
quits (2) - 3 freq
enmity (2) - 4 freq
equate (2) - 5 freq
enquiry (2) - 7 freq
equality (2) - 34 freq
quinty (2) - 1 freq
vaunty (3) - 2 freq
'quate (3) - 1 freq
squint (3) - 13 freq
security (3) - 54 freq
cavity (3) - 2 freq
fourty (3) - 1 freq
quiat (3) - 2 freq
equity (0) - 7 freq
quit (2) - 28 freq
quite (2) - 475 freq
equate (2) - 5 freq
quiet (3) - 239 freq
quet (3) - 3 freq
quinty (3) - 1 freq
inequity (3) - 1 freq
quate (3) - 161 freq
quot (3) - 15 freq
quitie (3) - 2 freq
quyt (3) - 3 freq
quait (3) - 131 freq
quiat (3) - 2 freq
quota (3) - 1 freq
equality (3) - 34 freq
equip (3) - 1 freq
'quit (3) - 1 freq
quote (3) - 50 freq
'quite (3) - 3 freq
requit (3) - 1 freq
equine (3) - 3 freq
quits (3) - 3 freq
quat (3) - 31 freq
buty (4) - 3 freq
SoundEx code - E230
eichty - 7 freq
eicht - 61 freq
eesed - 65 freq
eest - 24 freq
eight - 66 freq
est - 22 freq
eaught - 1 freq
eejit - 69 freq
eiked - 3 freq
eighth - 4 freq
east - 304 freq
eikit - 65 freq
echtie - 4 freq
eschewed - 2 freq
echoed - 15 freq
echt - 114 freq
exit - 27 freq
eeejit - 1 freq
eegit - 4 freq
ecuid - 3 freq
echaed - 2 freq
eked - 3 freq
echae'd - 1 freq
egged - 5 freq
eeight - 1 freq
eichtie - 2 freq
echty - 22 freq
eestae - 3 freq
eastae - 1 freq
ejit - 1 freq
eesta - 1 freq
exude - 4 freq
eighty - 12 freq
eggheid - 1 freq
excite - 2 freq
'eicht - 2 freq
ect - 25 freq
echth - 1 freq
eaucht - 1 freq
eeyjit - 1 freq
eyght - 4 freq
¬‚eggit - 1 freq
egt - 1 freq
equate - 5 freq
exceed - 1 freq
eekit - 6 freq
esto - 2 freq
equity - 7 freq
eskside - 1 freq
ees't - 8 freq
eeside - 1 freq
eastawa - 3 freq
ekit - 1 freq
eased - 3 freq
eigged - 1 freq
eiged - 1 freq
€˜east - 1 freq
'exit' - 1 freq
€œeight - 1 freq
€™est - 4 freq
€œexit - 1 freq
eused - 1 freq
€˜eighty - 1 freq
ekd - 1 freq
echyty - 1 freq
eoggudh - 1 freq
exdt - 1 freq
ezd - 1 freq
eijit - 1 freq
eaxxd - 1 freq
ejjit - 1 freq
egid - 1 freq
esd - 1 freq
ezsgwt - 1 freq
eyesite - 1 freq
exsdu - 1 freq
MetaPhone code - EKT
eiked - 3 freq
eikit - 65 freq
ecuid - 3 freq
eked - 3 freq
egged - 5 freq
eggheid - 1 freq
ect - 25 freq
¬‚eggit - 1 freq
egt - 1 freq
equate - 5 freq
eekit - 6 freq
equity - 7 freq
ekit - 1 freq
eigged - 1 freq
ekd - 1 freq
eoggudh - 1 freq
EQUITY
Time to execute Levenshtein function - 0.717019 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.188721 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.098641 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.109858 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000819 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.