A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to yset in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
yset (0) - 2 freq
'set (1) - 2 freq
yet (1) - 1020 freq
set (1) - 1525 freq
yeet (1) - 1 freq
yised (2) - 4 freq
aet (2) - 102 freq
set' (2) - 2 freq
hyst (2) - 5 freq
sey (2) - 62 freq
uses (2) - 47 freq
net (2) - 87 freq
yeat (2) - 2 freq
yeti (2) - 1 freq
sert (2) - 2 freq
yye (2) - 2 freq
'sex (2) - 1 freq
mysel (2) - 131 freq
sse (2) - 27 freq
onset (2) - 7 freq
sez (2) - 30 freq
yeee (2) - 1 freq
quet (2) - 3 freq
setn (2) - 1 freq
seth (2) - 4 freq
yset (0) - 2 freq
set (1) - 1525 freq
isit (2) - 1 freq
seet (2) - 5 freq
sat (2) - 749 freq
yest (2) - 2 freq
seyt (2) - 2 freq
ist (2) - 11 freq
suet (2) - 9 freq
est (2) - 22 freq
st (2) - 384 freq
saet (2) - 39 freq
seti (2) - 9 freq
usit (2) - 4 freq
seit (2) - 1 freq
syt (2) - 1 freq
sit (2) - 674 freq
seto (2) - 1 freq
sot (2) - 6 freq
yist (2) - 14 freq
sut (2) - 161 freq
seat (2) - 277 freq
yet (2) - 1020 freq
yeet (2) - 1 freq
'set (2) - 2 freq
SoundEx code - Y230
yokit - 73 freq
yist - 14 freq
yased - 18 freq
yaised - 238 freq
yaized - 5 freq
yacht - 11 freq
yuisst - 5 freq
yuist - 2 freq
yoost - 1 freq
yoosed - 1 freq
yaist - 21 freq
yest - 2 freq
yuised - 4 freq
yazed - 1 freq
yiggid - 1 freq
yaisit - 4 freq
yokit' - 1 freq
yeuched - 2 freq
yeast - 11 freq
yocked - 1 freq
yakkit - 2 freq
yockit - 5 freq
yokkt - 3 freq
yokkit - 4 freq
yoostae - 4 freq
yowkit - 2 freq
yoked - 1 freq
yeesed - 1 freq
yeusd - 2 freq
yjd - 1 freq
yised - 4 freq
ysthie - 1 freq
ygt - 1 freq
yessaday - 1 freq
yset - 2 freq
yxd - 1 freq
MetaPhone code - ST
said - 11590 freq
sat - 749 freq
set - 1525 freq
city - 288 freq
stey - 170 freq
side - 1229 freq
seed - 117 freq
sit - 674 freq
suit - 160 freq
sooty - 10 freq
saut - 122 freq
st - 384 freq
seat - 277 freq
stooy - 1 freq
'said - 5 freq
sotto - 2 freq
sutt - 1 freq
sate - 64 freq
sad - 205 freq
soud - 36 freq
sade - 469 freq
seid - 11 freq
suet - 9 freq
said- - 10 freq
-said - 2 freq
ceety - 28 freq
suid - 47 freq
ceity - 22 freq
stew - 51 freq
saddo - 5 freq
stay - 159 freq
wyced - 2 freq
saet - 39 freq
sud - 92 freq
hyst - 5 freq
stee - 1 freq
sautie - 4 freq
'sit - 7 freq
'stay - 2 freq
sado - 1 freq
seyd - 86 freq
site - 100 freq
sod - 22 freq
sut - 161 freq
sadie - 25 freq
'st - 1 freq
cid - 25 freq
sta - 4 freq
suite - 16 freq
sowt - 10 freq
sidey - 2 freq
staw - 10 freq
settee - 30 freq
satt - 2 freq
city' - 2 freq
saat - 29 freq
saed - 183 freq
sot - 6 freq
sauty - 6 freq
saa'ed - 1 freq
seet - 5 freq
see't - 20 freq
sait - 8 freq
soda - 20 freq
see'd - 1 freq
sute - 1 freq
suiht - 1 freq
sid - 19 freq
ceetie - 3 freq
saat'y - 1 freq
syde - 37 freq
cit - 4 freq
saad - 6 freq
soat - 1 freq
satty - 2 freq
sayd - 4 freq
ste - 41 freq
say'd - 2 freq
sad' - 1 freq
say't - 3 freq
swyte - 12 freq
zzat - 1 freq
stie - 6 freq
'set' - 1 freq
stewy - 4 freq
side' - 7 freq
saudi - 6 freq
saud - 2 freq
stei - 6 freq
steh - 1 freq
s-t - 1 freq
zat - 11 freq
situ - 2 freq
suede - 2 freq
citie - 10 freq
sït - 10 freq
'sït - 1 freq
'stey - 2 freq
sodae - 2 freq
stae - 1 freq
suite' - 1 freq
set' - 2 freq
soity - 1 freq
s't - 1 freq
seit - 1 freq
wyste - 2 freq
sautty - 1 freq
staa - 6 freq
sidda - 1 freq
ösed - 7 freq
sood - 37 freq
'set - 2 freq
cite - 2 freq
'city - 1 freq
site' - 2 freq
stow - 7 freq
hcid - 1 freq
saut' - 1 freq
hyste - 9 freq
ceti - 1 freq
soot - 10 freq
sattie - 1 freq
sæ'at - 1 freq
hæst - 1 freq
staey - 1 freq
zoot - 2 freq
seyt - 2 freq
sett - 4 freq
æst - 1 freq
sed - 175 freq
swæt - 1 freq
zed - 1 freq
sade- - 3 freq
øsed - 40 freq
”xt - 1 freq
sätta - 2 freq
sty - 3 freq
ceitie - 19 freq
saide - 5 freq
wycit - 1 freq
ssd - 12 freq
saa't - 1 freq
stw - 43 freq
syd - 1 freq
€œsit - 5 freq
€˜set - 1 freq
'soot' - 1 freq
hhsssst - 1 freq
sidie - 2 freq
stue - 11 freq
swyty - 3 freq
seti - 9 freq
€˜city - 1 freq
€˜sit - 1 freq
stiy - 1 freq
'stay' - 1 freq
zit - 2 freq
staiy - 13 freq
€™side - 2 freq
soed - 1 freq
€˜stew - 1 freq
soad - 1 freq
soote - 1 freq
saaty - 2 freq
sywte - 1 freq
høst - 3 freq
sd - 7 freq
stu - 1 freq
cede - 1 freq
ceud - 1 freq
xdaw - 1 freq
zt - 2 freq
ceedee - 1 freq
½st - 1 freq
sti - 1 freq
sati - 1 freq
cittie - 2 freq
syt - 1 freq
hst - 1 freq
setee - 1 freq
soit - 1 freq
xd - 4 freq
s'at - 1 freq
xut - 1 freq
cet - 1 freq
“zat” - 1 freq
stwy - 1 freq
'side - 1 freq
seedy - 1 freq
zowt - 1 freq
zto - 1 freq
zzhd - 1 freq
zeta - 1 freq
xt - 1 freq
zd - 1 freq
seto - 1 freq
yset - 2 freq
zdd - 1 freq
'side' - 1 freq
YSET
Time to execute Levenshtein function - 0.194257 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341587 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033154 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037712 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000911 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.