A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sime in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sime (0) - 1 freq
seme (1) - 1 freq
sire (1) - 7 freq
size (1) - 216 freq
skime (1) - 2 freq
mime (1) - 1 freq
saime (1) - 14 freq
site (1) - 94 freq
rime (1) - 5 freq
some (1) - 3192 freq
side (1) - 895 freq
stime (1) - 2 freq
time (1) - 4514 freq
hime (1) - 2 freq
syme (1) - 15 freq
sume (1) - 2 freq
slime (1) - 5 freq
lime (1) - 23 freq
sim (1) - 36 freq
same (1) - 1320 freq
sile (1) - 4 freq
ime (1) - 1 freq
sie (1) - 33 freq
sine (1) - 16 freq
rimie (2) - 2 freq
sime (0) - 1 freq
some (1) - 3192 freq
sume (1) - 2 freq
saime (1) - 14 freq
syme (1) - 15 freq
sim (1) - 36 freq
same (1) - 1320 freq
seme (1) - 1 freq
sine (2) - 16 freq
smeu (2) - 2 freq
smo (2) - 1 freq
som (2) - 2 freq
smue (2) - 1 freq
saim (2) - 1 freq
sma (2) - 189 freq
soume (2) - 1 freq
sam (2) - 335 freq
sem (2) - 1 freq
semi (2) - 16 freq
sie (2) - 33 freq
sum (2) - 288 freq
sm (2) - 4 freq
sire (2) - 7 freq
size (2) - 216 freq
stime (2) - 2 freq
SoundEx code - S500
syne - 1397 freq
sayin - 773 freq
seen - 1477 freq
shone - 32 freq
some - 3192 freq
sin - 602 freq
same - 1320 freq
seem - 299 freq
shame - 126 freq
sun - 521 freq
somehow - 50 freq
snaw - 129 freq
sune - 47 freq
saun - 7 freq
shiny - 55 freq
soun - 122 freq
suin - 221 freq
skim - 4 freq
shaein - 1 freq
shaen - 3 freq
skin - 186 freq
soon - 547 freq
seein - 313 freq
sheen - 94 freq
shawin - 46 freq
sma - 189 freq
'syne - 8 freq
sea-maw - 2 freq
snow - 54 freq
scene - 84 freq
shane - 57 freq
son - 380 freq
'son - 2 freq
shuin - 57 freq
shine - 61 freq
sweeney - 1 freq
'some - 7 freq
süne - 1 freq
skinny - 44 freq
somehou - 16 freq
siine - 1 freq
'siine - 1 freq
sum - 288 freq
somme - 10 freq
scan - 11 freq
sam - 335 freq
scum - 8 freq
sane - 5 freq
shan - 9 freq
swim - 39 freq
sayn - 2 freq
showin - 57 freq
sayan - 39 freq
san - 37 freq
scheme - 53 freq
'seein - 1 freq
sinewy - 1 freq
shoon - 13 freq
sweein - 4 freq
snaa - 66 freq
shin - 23 freq
somewey - 35 freq
sawney - 3 freq
swum - 6 freq
shune - 25 freq
sannie - 6 freq
sen - 104 freq
sawin - 6 freq
'scone - 1 freq
some' - 7 freq
somewye - 59 freq
shewn - 13 freq
sweem - 22 freq
scone - 31 freq
shown - 43 freq
seyin - 15 freq
showein - 6 freq
skewin - 1 freq
sown - 3 freq
somehoo - 20 freq
sein - 60 freq
soom - 5 freq
sammy - 31 freq
'ssssssaaaaamy - 1 freq
suhin - 10 freq
swine - 23 freq
swam - 19 freq
snah - 1 freq
seun - 10 freq
saam - 2 freq
swayan - 6 freq
sweem-' - 1 freq
sma' - 11 freq
skene - 12 freq
sheena - 36 freq
shawn - 34 freq
saim - 1 freq
swain - 2 freq
sunny - 157 freq
sweyin - 4 freq
swayin - 3 freq
shimmy - 2 freq
sewin - 11 freq
sume - 2 freq
sanny - 7 freq
semi - 16 freq
sewn - 3 freq
s'no - 1 freq
swawin - 1 freq
sim - 36 freq
sinn - 206 freq
senn - 11 freq
saen - 71 freq
smmaa - 1 freq
shem - 3 freq
ïsnae - 2 freq
smaa - 96 freq
sheeny - 18 freq
sonny - 5 freq
sen' - 8 freq
sayin' - 7 freq
som' - 2 freq
sum' - 1 freq
shun' - 1 freq
sauna - 7 freq
s'naewye - 1 freq
s'nae - 1 freq
shyin - 1 freq
saain - 5 freq
sane' - 1 freq
'same - 3 freq
swne - 1 freq
sham - 7 freq
sheen' - 1 freq
shewin - 14 freq
sin' - 2 freq
soun' - 2 freq
shounn - 1 freq
showan - 5 freq
seween - 1 freq
sewan - 1 freq
smaw - 36 freq
seen-nae - 1 freq
swanee - 1 freq
shön - 40 freq
sken - 2 freq
shon - 7 freq
ösin - 1 freq
sae-an - 1 freq
sanna - 3 freq
sain - 10 freq
shaain - 13 freq
syne' - 3 freq
'sunny' - 1 freq
'seen' - 2 freq
seen' - 2 freq
'sun - 1 freq
sna - 162 freq
sayeen - 2 freq
swona - 1 freq
so'm - 1 freq
sonia - 2 freq
snawy - 6 freq
some- - 1 freq
schawin - 3 freq
saunie - 1 freq
sei'm - 2 freq
sein'm - 1 freq
skame - 4 freq
sunnie - 1 freq
sunn - 8 freq
sei-in - 1 freq
schawn - 1 freq
sueno - 1 freq
seean - 14 freq
skewan - 1 freq
smoo - 1 freq
shaan - 24 freq
shaem - 2 freq
som - 2 freq
'soon - 1 freq
'sun' - 2 freq
sawn - 6 freq
scoom - 2 freq
sine - 16 freq
shüne - 1 freq
syn - 6 freq
sm - 4 freq
snowy - 4 freq
seam - 6 freq
sweean - 1 freq
sheem - 1 freq
sam' - 1 freq
swoon - 2 freq
sumwie - 1 freq
simwie - 1 freq
somewie - 1 freq
shona - 16 freq
saime - 14 freq
sine' - 1 freq
schoon - 1 freq
shoom - 1 freq
shinn - 3 freq
soum - 5 freq
swan - 19 freq
skime - 2 freq
shien - 2 freq
skjin - 1 freq
swiem - 1 freq
syme - 15 freq
'sen - 1 freq
sean - 15 freq
skein - 4 freq
so-an - 1 freq
shawnee - 1 freq
seine - 1 freq
suomi - 2 freq
sumwye - 6 freq
shün - 5 freq
øsin - 5 freq
somewoy - 1 freq
schein - 1 freq
schon - 1 freq
schone - 1 freq
sie-maw - 1 freq
smue - 1 freq
'syme - 1 freq
'son' - 1 freq
sinai - 2 freq
sum-wey - 1 freq
sumwey - 1 freq
'seen - 1 freq
scam - 10 freq
skeen - 1 freq
sweyn - 1 freq
schemie - 2 freq
sìne - 1 freq
son - 1 freq
sonne - 1 freq
soume - 1 freq
sinny - 21 freq
shun - 7 freq
'sgian - 1 freq
sgian' - 1 freq
scummy - 2 freq
sein-na - 1 freq
skoun - 2 freq
suin - 2 freq
schane - 1 freq
seme - 1 freq
sawnie - 5 freq
schame - 2 freq
some - 1 freq
summa - 1 freq
some - 3 freq
same - 1 freq
sin - 1 freq
saem - 4 freq
sweem - 1 freq
'sheeom - 1 freq
sheeom - 2 freq
swyin - 1 freq
siena - 2 freq
ski-in' - 1 freq
sna' - 2 freq
skiin - 4 freq
skiin - 1 freq
sommie - 1 freq
some - 10 freq
skaum - 1 freq
seiin - 2 freq
sæm - 4 freq
sin - 2 freq
sno' - 1 freq
swem - 3 freq
smou - 3 freq
smeu - 2 freq
syne - 1 freq
seen - 2 freq
syne - 4 freq
shinny - 3 freq
shinny - 1 freq
sgian - 1 freq
scma - 1 freq
sime - 1 freq
showin' - 1 freq
sewin' - 1 freq
seeän - 1 freq
sonya - 4 freq
simmy - 1 freq
swinney - 2 freq
smo - 1 freq
shanny - 9 freq
seoiníní - 1 freq
sxm - 1 freq
shinnie - 1 freq
shim - 1 freq
ssm - 1 freq
shuna - 1 freq
sewin’ - 1 freq
somewhy - 1 freq
snaay - 1 freq
seein' - 2 freq
sony - 1 freq
smh - 3 freq
snowey - 1 freq
schemey - 1 freq
sno - 1 freq
shaun - 1 freq
'shin' - 2 freq
'soon' - 1 freq
shum - 1 freq
snow” - 1 freq
smuha - 2 freq
'snaa - 1 freq
“shoon - 1 freq
seanie - 4 freq
sn - 1 freq
'snaw' - 1 freq
saein - 1 freq
scene' - 1 freq
sammi - 1 freq
swanny - 4 freq
sem - 1 freq
snn - 1 freq
shoe-in - 1 freq
sayin’ - 1 freq
MetaPhone code - SM
some - 3192 freq
same - 1320 freq
seem - 299 freq
sma - 189 freq
sea-maw - 2 freq
'some - 7 freq
zombie - 6 freq
sum - 288 freq
somme - 10 freq
sam - 335 freq
some' - 7 freq
soom - 5 freq
sammy - 31 freq
'ssssssaaaaamy - 1 freq
saam - 2 freq
sma' - 11 freq
saim - 1 freq
sume - 2 freq
semi - 16 freq
symbo - 1 freq
zoom - 15 freq
simba - 1 freq
sim - 36 freq
smmaa - 1 freq
smaa - 96 freq
som' - 2 freq
sum' - 1 freq
'same - 3 freq
smaw - 36 freq
so'm - 1 freq
some- - 1 freq
sei'm - 2 freq
smoo - 1 freq
som - 2 freq
samba - 1 freq
sm - 4 freq
seam - 6 freq
sam' - 1 freq
zumba - 2 freq
saime - 14 freq
soum - 5 freq
syme - 15 freq
suomi - 2 freq
sie-maw - 1 freq
smue - 1 freq
'syme - 1 freq
xoom - 1 freq
cim - 5 freq
soume - 1 freq
seme - 1 freq
some - 1 freq
summa - 1 freq
some - 3 freq
same - 1 freq
saem - 4 freq
sommie - 1 freq
some - 10 freq
sæm - 4 freq
smou - 3 freq
smeu - 2 freq
cem - 1 freq
sime - 1 freq
xm - 1 freq
simmy - 1 freq
xoem - 1 freq
smo - 1 freq
hhsm - 1 freq
ssm - 1 freq
somewhy - 1 freq
smh - 3 freq
zma - 1 freq
zm - 1 freq
xxm - 1 freq
cym - 6 freq
sammi - 1 freq
sem - 1 freq
SIME
Time to execute Levenshtein function - 0.233644 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.383350 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028508 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.032153 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001132 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.