A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to thysel in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
thysel (0) - 1 freq
thyself (1) - 1 freq
thysell (1) - 1 freq
twyse (2) - 1 freq
tassel (2) - 2 freq
thirsel (2) - 5 freq
tinsel (2) - 12 freq
theesel (2) - 6 freq
tossel (2) - 1 freq
thimel (2) - 4 freq
thersel (2) - 1 freq
chisel (2) - 10 freq
thase (2) - 13 freq
thyme (2) - 4 freq
thumsel (2) - 1 freq
hisel (2) - 18 freq
these' (2) - 1 freq
thys (2) - 1 freq
those (2) - 296 freq
thyne (2) - 1 freq
tyse (2) - 1 freq
these (2) - 1129 freq
theses (2) - 4 freq
mysel (2) - 131 freq
chysen (2) - 1 freq
thysel (0) - 1 freq
theesel (2) - 6 freq
thysell (2) - 1 freq
thyself (2) - 1 freq
thys (3) - 1 freq
hisel (3) - 18 freq
these (3) - 1129 freq
these' (3) - 1 freq
themsel (3) - 8 freq
theesael (3) - 1 freq
thumsel (3) - 1 freq
theses (3) - 4 freq
those (3) - 296 freq
tossel (3) - 1 freq
tinsel (3) - 12 freq
tassel (3) - 2 freq
thimel (3) - 4 freq
thirsel (3) - 5 freq
chisel (3) - 10 freq
thersel (3) - 1 freq
thase (3) - 13 freq
thees (4) - 4 freq
thule (4) - 5 freq
thes (4) - 12 freq
thistle (4) - 48 freq
SoundEx code - T240
this'll - 10 freq
teckle - 5 freq
'this'll - 1 freq
taigle - 10 freq
tickle - 13 freq
tackle - 30 freq
tassel - 2 freq
theesel - 6 freq
ïtsel - 4 freq
thïs'll - 2 freq
thickly - 1 freq
'these'll - 1 freq
tkull - 1 freq
theesael - 1 freq
tequila - 1 freq
tekkil - 3 freq
t'sæl - 1 freq
th'eeswal - 2 freq
tossel - 1 freq
tiscali - 1 freq
thysell - 1 freq
taigil - 2 freq
tuckwell - 1 freq
touzly - 1 freq
touzle - 1 freq
tissil - 1 freq
thysel - 1 freq
tesla - 2 freq
tooslie - 1 freq
tsl - 2 freq
tslaw - 1 freq
taikle - 1 freq
thicklie - 1 freq
tql - 1 freq
toozle - 1 freq
toggle - 1 freq
tchell - 1 freq
MetaPhone code - 0SL
this'll - 10 freq
'this'll - 1 freq
theesel - 6 freq
thïs'll - 2 freq
'these'll - 1 freq
theesael - 1 freq
thysell - 1 freq
thysel - 1 freq
THYSEL
Time to execute Levenshtein function - 0.209762 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.353714 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027706 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036910 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000925 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.