A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to seton in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
seton (0) - 18 freq
eton (1) - 7 freq
settn (1) - 1 freq
ston (1) - 9 freq
seaton (1) - 3 freq
seto (1) - 1 freq
setn (1) - 1 freq
aetan (2) - 3 freq
afton (2) - 3 freq
soon (2) - 668 freq
teuton (2) - 2 freq
weston (2) - 7 freq
schon (2) - 1 freq
sitin (2) - 1 freq
stron (2) - 1 freq
senten (2) - 1 freq
getn (2) - 1 freq
secin (2) - 1 freq
sextin (2) - 1 freq
renton (2) - 4 freq
luton (2) - 2 freq
season (2) - 118 freq
sewin (2) - 14 freq
seiin (2) - 2 freq
saxon (2) - 6 freq
seton (0) - 18 freq
setn (1) - 1 freq
seaton (1) - 3 freq
ston (1) - 9 freq
aston (2) - 4 freq
sitin (2) - 1 freq
stony (2) - 3 freq
stoon (2) - 9 freq
stan (2) - 149 freq
stun (2) - 2 freq
seatin (2) - 4 freq
seatoun (2) - 1 freq
satin (2) - 12 freq
seto (2) - 1 freq
settn (2) - 1 freq
eton (2) - 7 freq
situn (2) - 1 freq
seteen (2) - 1 freq
stoun (2) - 2 freq
satan (2) - 24 freq
stone (2) - 83 freq
steyan (3) - 2 freq
aetin (3) - 27 freq
serten (3) - 4 freq
spon (3) - 1 freq
SoundEx code - S350
stane - 414 freq
sittin - 721 freq
shoutin - 134 freq
skytin - 15 freq
staun - 216 freq
stany - 5 freq
sudden - 210 freq
steen - 114 freq
staan - 13 freq
steam - 77 freq
settin - 142 freq
shuttin - 39 freq
steyin - 40 freq
shootin - 43 freq
scaudin - 4 freq
stymie - 2 freq
saitin - 3 freq
settan - 14 freq
sweatin - 15 freq
sydney - 7 freq
stem - 18 freq
stone - 83 freq
stowin - 6 freq
suttin - 81 freq
schtum - 2 freq
skitin - 22 freq
showtime - 1 freq
squattin - 1 freq
stewin - 4 freq
seatoun - 1 freq
sit-doun - 1 freq
stame - 8 freq
swaden - 1 freq
sweitin - 2 freq
shoudna - 6 freq
setten - 22 freq
sitten - 12 freq
shitin - 4 freq
skuddin - 1 freq
stayin - 20 freq
shoutin' - 2 freq
sittin' - 19 freq
sidden - 16 freq
scootin - 4 freq
shuidnae - 11 freq
sweetin - 3 freq
seethin - 3 freq
stan - 149 freq
steinway - 4 freq
staem - 1 freq
sodden - 11 freq
stoon - 9 freq
stein - 22 freq
showdin - 2 freq
staney - 6 freq
stam - 2 freq
scuddin - 4 freq
sidn - 1 freq
sheetin - 6 freq
steenie - 25 freq
stan¢ - 1 freq
satan - 24 freq
sodom - 9 freq
scythin - 2 freq
shoutan - 5 freq
seethan - 1 freq
sodium - 1 freq
shudnae - 30 freq
stowen - 3 freq
seton - 18 freq
sweden - 19 freq
soothin - 3 freq
sioatin' - 1 freq
settin' - 12 freq
steem - 13 freq
shotten - 4 freq
shouten - 2 freq
sheeten - 1 freq
seitten - 1 freq
skiddin - 4 freq
suddin - 1 freq
skidden - 1 freq
syden - 1 freq
staine - 2 freq
shidnae - 1 freq
sa'tin' - 1 freq
sedn - 1 freq
stine - 1 freq
stain - 26 freq
seatin - 4 freq
showdoon - 1 freq
shittin - 4 freq
satin - 12 freq
southan - 1 freq
steamy - 5 freq
stane-waa - 1 freq
shidna - 9 freq
sutten - 8 freq
steeny - 7 freq
stiyin - 1 freq
stawn - 4 freq
stehin - 1 freq
stowan - 1 freq
styin - 4 freq
sheddin - 5 freq
sithean - 1 freq
sidney - 23 freq
sidon - 11 freq
sïttin - 8 freq
stane' - 2 freq
sautin - 1 freq
skatin - 3 freq
stan' - 6 freq
'stan - 2 freq
soothin' - 1 freq
side-on - 1 freq
sweeten' - 1 freq
sit-in - 2 freq
saddam - 1 freq
steen' - 2 freq
shutten - 2 freq
steamie - 9 freq
sea-aeten - 1 freq
shuttan - 3 freq
sittan - 32 freq
skoitin - 1 freq
soodna - 7 freq
stown - 7 freq
shoodna - 3 freq
ston - 9 freq
stun - 2 freq
shuidna - 16 freq
skeetin - 1 freq
sweatan - 1 freq
shouteen - 1 freq
soddan - 1 freq
shaidin - 28 freq
sïxtaen - 1 freq
shuitin - 4 freq
staen - 2 freq
shuitten - 2 freq
shoudno - 2 freq
situn - 1 freq
stoma - 5 freq
sthaain - 1 freq
stöd'im - 1 freq
steyn - 8 freq
stony - 3 freq
shut-doon - 1 freq
shadam - 1 freq
suden - 1 freq
staeyan - 1 freq
setteen - 2 freq
stayan - 1 freq
schotten - 2 freq
shiten - 1 freq
stoun - 2 freq
sheddan - 1 freq
soothan - 1 freq
seteen - 1 freq
sudna - 13 freq
stonn - 1 freq
stime - 4 freq
sateen - 1 freq
steyan - 2 freq
said-na - 1 freq
sittm - 1 freq
swaiden - 2 freq
styme - 2 freq
soudna - 2 freq
sweeten - 3 freq
schatten - 1 freq
seedin - 1 freq
sautan - 1 freq
saidna - 1 freq
stimna - 1 freq
suitin - 1 freq
swattin - 2 freq
swytin - 1 freq
'staun - 1 freq
swithin - 1 freq
€˜staun - 1 freq
suidna - 3 freq
sawtan - 1 freq
stanie - 2 freq
stawen - 3 freq
stowein - 1 freq
swiytin - 1 freq
shootan - 5 freq
shudna - 4 freq
sýstem - 2 freq
shitein - 2 freq
staun' - 2 freq
shadno - 1 freq
stane- - 1 freq
stayin' - 1 freq
soddin' - 1 freq
suiden - 1 freq
squaattin - 1 freq
scoutin - 1 freq
shadin - 1 freq
€œstaan - 1 freq
sudn - 1 freq
styin' - 1 freq
staiyin - 2 freq
siden - 2 freq
€˜siden - 1 freq
stenn - 1 freq
suidnae - 1 freq
seithin - 1 freq
shoudnae - 3 freq
sutton - 6 freq
skaitan - 1 freq
sittn - 1 freq
sitin - 1 freq
set-in - 1 freq
shitin' - 1 freq
stoney - 3 freq
scottm - 1 freq
seaton - 3 freq
shutdoon - 2 freq
setn - 1 freq
stane” - 1 freq
sjtnw - 1 freq
shoud'nae - 1 freq
settn - 1 freq
shoodnae - 1 freq
shutdown - 2 freq
scottewen - 1 freq
sudan - 1 freq
stayhome - 1 freq
MetaPhone code - STN
stane - 414 freq
sittin - 721 freq
staun - 216 freq
stany - 5 freq
sudden - 210 freq
steen - 114 freq
staan - 13 freq
settin - 142 freq
hsten - 1 freq
saitin - 3 freq
settan - 14 freq
sydney - 7 freq
stone - 83 freq
suttin - 81 freq
seatoun - 1 freq
setten - 22 freq
sitten - 12 freq
sittin' - 19 freq
sidden - 16 freq
cidna - 2 freq
wystin - 2 freq
stan - 149 freq
cidnae - 2 freq
sodden - 11 freq
stoon - 9 freq
stein - 22 freq
staney - 6 freq
sidn - 1 freq
steenie - 25 freq
stan¢ - 1 freq
satan - 24 freq
seton - 18 freq
settin' - 12 freq
seitten - 1 freq
suddin - 1 freq
syden - 1 freq
staine - 2 freq
sa'tin' - 1 freq
sedn - 1 freq
stine - 1 freq
stain - 26 freq
seatin - 4 freq
satin - 12 freq
sutten - 8 freq
steeny - 7 freq
stawn - 4 freq
sidney - 23 freq
sidon - 11 freq
sïttin - 8 freq
stane' - 2 freq
sautin - 1 freq
stan' - 6 freq
'stan - 2 freq
side-on - 1 freq
sit-in - 2 freq
steen' - 2 freq
sea-aeten - 1 freq
sittan - 32 freq
soodna - 7 freq
stown - 7 freq
ston - 9 freq
stun - 2 freq
soddan - 1 freq
staen - 2 freq
situn - 1 freq
steyn - 8 freq
stony - 3 freq
suden - 1 freq
setteen - 2 freq
stoun - 2 freq
seteen - 1 freq
sudna - 13 freq
stonn - 1 freq
sateen - 1 freq
said-na - 1 freq
soudna - 2 freq
seedin - 1 freq
sautan - 1 freq
saidna - 1 freq
cydonia - 2 freq
cidni - 1 freq
suitin - 1 freq
swytin - 1 freq
'staun - 1 freq
€˜staun - 1 freq
suidna - 3 freq
sawtan - 1 freq
stanie - 2 freq
staun' - 2 freq
stane- - 1 freq
soddin' - 1 freq
suiden - 1 freq
€œstaan - 1 freq
sudn - 1 freq
siden - 2 freq
€˜siden - 1 freq
citin - 1 freq
stenn - 1 freq
suidnae - 1 freq
sutton - 6 freq
sittn - 1 freq
sitin - 1 freq
set-in - 1 freq
stoney - 3 freq
seaton - 3 freq
setn - 1 freq
stane” - 1 freq
settn - 1 freq
sudan - 1 freq
SETON
Time to execute Levenshtein function - 0.183219 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.346543 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027280 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036568 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000830 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.