A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to stem in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
stem (0) - 18 freq
seem (1) - 404 freq
swem (1) - 3 freq
steg (1) - 1 freq
stee (1) - 1 freq
stey (1) - 170 freq
steh (1) - 1 freq
stam (1) - 2 freq
staem (1) - 1 freq
steem (1) - 13 freq
ste (1) - 41 freq
sem (1) - 1 freq
stei (1) - 6 freq
ster (1) - 3 freq
stew (1) - 51 freq
step (1) - 258 freq
stcem (1) - 1 freq
shem (1) - 3 freq
steam (1) - 77 freq
item (1) - 17 freq
saem (1) - 4 freq
stems (1) - 10 freq
steik (2) - 10 freq
kteu (2) - 2 freq
stj (2) - 1 freq
stem (0) - 18 freq
steem (1) - 13 freq
steam (1) - 77 freq
staem (1) - 1 freq
stam (1) - 2 freq
stems (2) - 10 freq
saem (2) - 4 freq
stame (2) - 8 freq
shem (2) - 3 freq
esteim (2) - 1 freq
stoma (2) - 5 freq
styme (2) - 2 freq
stcem (2) - 1 freq
esteem (2) - 15 freq
stime (2) - 4 freq
steamy (2) - 5 freq
item (2) - 17 freq
stey (2) - 170 freq
steh (2) - 1 freq
step (2) - 258 freq
stee (2) - 1 freq
steg (2) - 1 freq
seem (2) - 404 freq
swem (2) - 3 freq
sem (2) - 1 freq
SoundEx code - S350
stane - 421 freq
sittin - 737 freq
shoutin - 137 freq
skytin - 15 freq
staun - 224 freq
stany - 5 freq
sudden - 213 freq
steen - 114 freq
staan - 13 freq
steam - 77 freq
settin - 144 freq
shuttin - 39 freq
steyin - 40 freq
shootin - 43 freq
scaudin - 4 freq
stymie - 2 freq
saitin - 3 freq
settan - 14 freq
sweatin - 16 freq
sydney - 7 freq
stem - 18 freq
stone - 85 freq
stowin - 6 freq
suttin - 81 freq
schtum - 2 freq
skitin - 23 freq
showtime - 1 freq
squattin - 1 freq
stewin - 4 freq
seatoun - 1 freq
sit-doun - 1 freq
stame - 8 freq
swaden - 1 freq
sweitin - 2 freq
shoudna - 6 freq
setten - 22 freq
sitten - 12 freq
shitin - 4 freq
skuddin - 1 freq
stayin - 22 freq
shoutin' - 2 freq
sittin' - 21 freq
sidden - 16 freq
scootin - 4 freq
shuidnae - 11 freq
sweetin - 3 freq
seethin - 3 freq
stan - 150 freq
steinway - 4 freq
staem - 1 freq
sodden - 11 freq
stoon - 9 freq
stein - 22 freq
showdin - 2 freq
staney - 6 freq
stam - 2 freq
scuddin - 5 freq
sidn - 1 freq
sheetin - 6 freq
steenie - 25 freq
stan¢ - 1 freq
satan - 24 freq
sodom - 9 freq
scythin - 2 freq
shoutan - 5 freq
seethan - 1 freq
sodium - 1 freq
shudnae - 30 freq
stowen - 3 freq
seton - 18 freq
sweden - 20 freq
soothin - 5 freq
sioatin' - 1 freq
settin' - 13 freq
steem - 13 freq
shotten - 4 freq
shouten - 2 freq
sheeten - 1 freq
seitten - 1 freq
skiddin - 4 freq
suddin - 1 freq
skidden - 1 freq
syden - 1 freq
staine - 2 freq
shidnae - 1 freq
sa'tin' - 1 freq
shutt'n - 1 freq
sit'n - 2 freq
sitt'n - 1 freq
stony - 4 freq
stan' - 8 freq
sedn - 1 freq
stine - 1 freq
stain - 26 freq
seatin - 4 freq
showdoon - 1 freq
shittin - 4 freq
satin - 12 freq
southan - 1 freq
steamy - 5 freq
stane-waa - 1 freq
shidna - 9 freq
sutten - 8 freq
steeny - 7 freq
stiyin - 1 freq
stawn - 4 freq
stehin - 1 freq
stowan - 1 freq
styin - 4 freq
sheddin - 5 freq
sithean - 1 freq
sidney - 23 freq
sidon - 11 freq
sïttin - 8 freq
stane' - 2 freq
sautin - 1 freq
skatin - 3 freq
'stan - 2 freq
soothin' - 1 freq
side-on - 1 freq
sweeten' - 1 freq
sit-in - 2 freq
saddam - 1 freq
steen' - 2 freq
shutten - 2 freq
steamie - 9 freq
sea-aeten - 1 freq
shuttan - 3 freq
sittan - 32 freq
skoitin - 1 freq
soodna - 7 freq
stown - 7 freq
shoodna - 3 freq
ston - 9 freq
stun - 2 freq
shuidna - 16 freq
skeetin - 1 freq
sweatan - 1 freq
shouteen - 1 freq
soddan - 1 freq
shaidin - 28 freq
sïxtaen - 1 freq
shuitin - 4 freq
staen - 2 freq
shuitten - 2 freq
shoudno - 2 freq
situn - 1 freq
stoma - 5 freq
sthaain - 1 freq
stöd'im - 1 freq
steyn - 8 freq
shut-doon - 1 freq
shadam - 1 freq
suden - 1 freq
staeyan - 1 freq
setteen - 2 freq
stayan - 1 freq
schotten - 2 freq
shiten - 1 freq
stoun - 2 freq
sheddan - 1 freq
soothan - 1 freq
seteen - 1 freq
sudna - 13 freq
stonn - 1 freq
stime - 4 freq
sateen - 1 freq
steyan - 2 freq
said-na - 1 freq
sittm - 1 freq
swaiden - 2 freq
styme - 2 freq
soudna - 2 freq
sweeten - 3 freq
schatten - 1 freq
seedin - 1 freq
sautan - 1 freq
saidna - 1 freq
stimna - 1 freq
suitin - 1 freq
swattin - 2 freq
swytin - 1 freq
'staun - 1 freq
swithin - 1 freq
€˜staun - 1 freq
suidna - 3 freq
sawtan - 1 freq
stanie - 2 freq
stawen - 3 freq
stowein - 1 freq
swiytin - 1 freq
shootan - 5 freq
shudna - 4 freq
sýstem - 2 freq
shitein - 2 freq
staun' - 2 freq
shadno - 1 freq
stane- - 1 freq
stayin' - 1 freq
soddin' - 1 freq
suiden - 1 freq
squaattin - 1 freq
scoutin - 1 freq
shadin - 1 freq
€œstaan - 1 freq
sudn - 1 freq
styin' - 1 freq
staiyin - 2 freq
siden - 2 freq
€˜siden - 1 freq
stenn - 1 freq
suidnae - 1 freq
seithin - 1 freq
shoudnae - 3 freq
sutton - 6 freq
skaitan - 1 freq
sittn - 1 freq
sitin - 1 freq
set-in - 1 freq
shitin' - 1 freq
stoney - 3 freq
scottm - 1 freq
seaton - 3 freq
shutdoon - 2 freq
setn - 1 freq
stane” - 1 freq
sjtnw - 1 freq
shoud'nae - 1 freq
settn - 1 freq
shoodnae - 1 freq
shutdown - 2 freq
scottewen - 1 freq
sudan - 1 freq
stayhome - 1 freq
MetaPhone code - STM
steam - 77 freq
stymie - 2 freq
stem - 18 freq
stame - 8 freq
wyssdom - 1 freq
staem - 1 freq
stam - 2 freq
sodom - 9 freq
sodium - 1 freq
steem - 13 freq
steamy - 5 freq
saddam - 1 freq
steamie - 9 freq
stoma - 5 freq
wísdom - 1 freq
stime - 4 freq
sittm - 1 freq
styme - 2 freq
xtmy - 1 freq
STEM
Time to execute Levenshtein function - 0.201698 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.315445 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031381 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.049627 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001154 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.