A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to yerself in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
yerself (0) - 17 freq
yersel' (1) - 96 freq
yeself (1) - 1 freq
yersell (1) - 26 freq
yerrself (1) - 1 freq
yurself (1) - 3 freq
yersels (1) - 97 freq
yersel (1) - 920 freq
herself (1) - 41 freq
yirself (1) - 5 freq
yirsels (2) - 6 freq
yesel (2) - 1 freq
yirsel (2) - 89 freq
yerselll (2) - 1 freq
hersef (2) - 1 freq
hersel (2) - 1059 freq
yursel' (2) - 1 freq
yursels (2) - 1 freq
hersell (2) - 12 freq
yeirsell (2) - 13 freq
wersels (2) - 1 freq
yer'sel (2) - 2 freq
yersels' (2) - 1 freq
yourself (2) - 42 freq
yerseu' (2) - 1 freq
yerself (0) - 17 freq
yurself (1) - 3 freq
yirself (1) - 5 freq
yourself (2) - 42 freq
yersel' (2) - 96 freq
yersel (2) - 920 freq
herself (2) - 41 freq
yersels (2) - 97 freq
yersell (2) - 26 freq
yeself (2) - 1 freq
yerrself (2) - 1 freq
uersel (3) - 1 freq
ersel (3) - 36 freq
eersell (3) - 1 freq
eersels (3) - 1 freq
yeirsels (3) - 1 freq
yirsell (3) - 1 freq
yeirsel (3) - 2 freq
yaersels (3) - 1 freq
yursel (3) - 10 freq
hurself (3) - 1 freq
eersel (3) - 28 freq
meself (3) - 2 freq
yursel' (3) - 1 freq
yirsel (3) - 89 freq
SoundEx code - Y624
yersel - 920 freq
yerself - 17 freq
yourself - 42 freq
yirsel - 89 freq
yoursel - 45 freq
yoursels - 14 freq
yirself - 5 freq
yourselves - 7 freq
yersels-but - 1 freq
yersels - 97 freq
yeirsel - 2 freq
yersell - 26 freq
yirsell - 1 freq
yersel' - 96 freq
yersail - 1 freq
yerscell - 1 freq
yer'sel - 2 freq
yirsels - 6 freq
yurself - 3 freq
yursel - 10 freq
yursel' - 1 freq
yursels - 1 freq
yersels' - 1 freq
yaersels - 1 freq
yoursel' - 1 freq
'yersel - 2 freq
yeirsell - 13 freq
yourselfl' - 1 freq
yirsael - 2 freq
yersel's - 10 freq
yerrself - 1 freq
yourself' - 1 freq
yeirsels - 1 freq
yeirsells - 1 freq
yowersel - 1 freq
yerselll - 1 freq
yersel’ - 1 freq
yircel - 2 freq
MetaPhone code - YRSLF
yerself - 17 freq
yourself - 42 freq
yirself - 5 freq
yurself - 3 freq
yerrself - 1 freq
yourself' - 1 freq
YERSELF
ye - 20880 freq
you - 6601 freq
ya - 481 freq
du - 727 freq
yer - 8074 freq
your - 1733 freq
ye'r - 256 freq
yi - 1223 freq
yir - 1409 freq
yae - 1059 freq
yeh - 7 freq
yersel - 920 freq
yerself - 17 freq
yourself - 42 freq
ye'll - 962 freq
you'll - 163 freq
ye're - 1060 freq
u - 523 freq
ye've - 614 freq
ye'd - 445 freq
Time to execute Levenshtein function - 0.202843 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.377600 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028243 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040349 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001043 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.