A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ythan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ythan (0) - 6 freq
than (1) - 2715 freq
'than (1) - 1 freq
ethan (1) - 1 freq
thans (2) - 3 freq
that (2) - 26604 freq
yan (2) - 1 freq
lhan (2) - 1 freq
thae (2) - 1219 freq
itan (2) - 1 freq
then (2) - 4451 freq
whan (2) - 2751 freq
thai (2) - 445 freq
dehan (2) - 1 freq
tha' (2) - 11 freq
wuhan (2) - 2 freq
yican (2) - 1 freq
yusan (2) - 2 freq
yeman (2) - 1 freq
thaen (2) - 2 freq
stran (2) - 2 freq
thay (2) - 703 freq
tman (2) - 1 freq
thoan (2) - 1 freq
-that (2) - 1 freq
ythan (0) - 6 freq
ethan (1) - 1 freq
than (1) - 2715 freq
thain (2) - 6 freq
thaen (2) - 2 freq
thaun (2) - 2 freq
thoan (2) - 1 freq
thon (2) - 2518 freq
thyn (2) - 1 freq
ithin (2) - 124 freq
thn (2) - 1 freq
thane (2) - 2 freq
thin (2) - 317 freq
athin (2) - 226 freq
'than (2) - 1 freq
then (2) - 4451 freq
thun (2) - 2 freq
bethan (3) - 1 freq
kythin (3) - 33 freq
ethno (3) - 1 freq
thine (3) - 15 freq
athene (3) - 1 freq
yhn (3) - 2 freq
i'ythan (3) - 1 freq
tha (3) - 6292 freq
SoundEx code - Y350
yet-an - 2 freq
yit-an - 2 freq
yotun - 9 freq
yowtin - 2 freq
yae-time - 1 freq
ythan - 6 freq
ydm - 1 freq
ytno - 1 freq
MetaPhone code - 0N
thon - 2518 freq
then - 4451 freq
thin - 317 freq
than - 2715 freq
then-ah - 3 freq
then- - 2 freq
then--- - 1 freq
'then - 38 freq
'thon - 11 freq
th'ane - 1 freq
thine - 15 freq
theen - 2 freq
then' - 3 freq
the-nou - 4 freq
thane - 2 freq
thin' - 1 freq
thein - 23 freq
thain - 6 freq
thn - 1 freq
thone - 13 freq
'than - 1 freq
thun - 2 freq
thoan - 1 freq
thenow - 3 freq
thenoo - 14 freq
thon' - 4 freq
thin-a - 1 freq
thaun - 2 freq
thaen - 2 freq
'thone - 1 freq
thyne - 1 freq
thinn - 2 freq
then - 6 freq
than - 1 freq
then - 1 freq
wthin - 1 freq
thon - 15 freq
thon - 1 freq
thon - 1 freq
thine - 1 freq
thon - 2 freq
then - 1 freq
than - 3 freq
thyn - 1 freq
then - 2 freq
th’n - 4 freq
thi’n - 2 freq
thi'n - 1 freq
th’in - 1 freq
‘thon - 1 freq
thin’ - 1 freq
ythan - 6 freq
YTHAN
Time to execute Levenshtein function - 0.207890 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.530593 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027555 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.071325 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000824 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.