A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to stolen in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
stolen (0) - 31 freq
stole (1) - 52 freq
stowen (1) - 3 freq
storin (2) - 1 freq
stoker (2) - 4 freq
stawen (2) - 3 freq
stopeen (2) - 1 freq
stiven (2) - 4 freq
stoney (2) - 3 freq
storey (2) - 4 freq
soley (2) - 2 freq
stowe (2) - 3 freq
stower (2) - 1 freq
store (2) - 79 freq
stoppen (2) - 1 freq
stones (2) - 29 freq
sailen (2) - 2 freq
token (2) - 12 freq
strlin (2) - 1 freq
toley (2) - 1 freq
styles (2) - 14 freq
stope (2) - 4 freq
stowes (2) - 1 freq
stoke (2) - 6 freq
swollen (2) - 12 freq
stolen (0) - 31 freq
stowen (2) - 3 freq
stalin (2) - 10 freq
stole (2) - 52 freq
stollin (3) - 1 freq
stele (3) - 1 freq
stile (3) - 19 freq
steven (3) - 36 freq
stowin (3) - 6 freq
soolin (3) - 2 freq
styled (3) - 1 freq
staelin (3) - 1 freq
stone (3) - 85 freq
stowein (3) - 1 freq
style' (3) - 2 freq
staen (3) - 2 freq
solan (3) - 7 freq
stirlen (3) - 2 freq
style (3) - 158 freq
ston (3) - 9 freq
stealin (3) - 34 freq
setlin (3) - 1 freq
stonn (3) - 1 freq
streen (3) - 11 freq
stoun (3) - 2 freq
SoundEx code - S345
settlin - 38 freq
stolen - 31 freq
scotland - 2210 freq
scotland's - 165 freq
scotland' - 13 freq
scotlan - 63 freq
'stealin - 1 freq
stealin - 34 freq
stealing - 3 freq
stowlins - 1 freq
shetland - 288 freq
'scotland - 3 freq
sidelins - 4 freq
stellin - 14 freq
stallion - 11 freq
sidelines - 2 freq
scuttlin - 6 freq
soothlauns - 1 freq
scotlaun - 1 freq
stalin - 10 freq
stalemated - 1 freq
skiddlin - 2 freq
scuddln - 1 freq
scotlann - 1 freq
sattlin - 4 freq
scotlandas - 1 freq
'scotland's - 3 freq
shetlan - 12 freq
stillin - 3 freq
stalin's - 1 freq
sidlin - 1 freq
setlin - 1 freq
shaidaelaund - 1 freq
shetlandic - 19 freq
shaetlan - 284 freq
swittlin - 5 freq
shetlander - 4 freq
scoatlan - 12 freq
suthlins - 1 freq
soitlin - 1 freq
settlement - 17 freq
scuttlan - 1 freq
scotlan' - 9 freq
settlements - 4 freq
stalemate - 2 freq
scotlan'an' - 1 freq
shetland's - 2 freq
sjetlin's - 1 freq
sjetlin - 2 freq
'sjetlandsk' - 1 freq
'sjetlandsøyene - 1 freq
staelin - 1 freq
shotlandskaya - 1 freq
shetlanders - 10 freq
'shetlandic - 1 freq
stillness - 9 freq
saddling - 1 freq
settlan - 1 freq
shaetlan-grammar-dictionary - 2 freq
shaetland - 2 freq
shetlands - 6 freq
scotlandonline - 1 freq
'scotland' - 1 freq
sattlement - 2 freq
seedlin - 2 freq
stealin't - 1 freq
sjaetlin - 1 freq
soothland - 2 freq
scotlandunhinged - 1 freq
€œscotland - 4 freq
scotlan's - 1 freq
scotlang - 5 freq
seedling - 3 freq
stollin - 1 freq
€˜scotland - 1 freq
scuddlin - 1 freq
sidelined - 1 freq
scuttling - 1 freq
setttlin - 1 freq
stiling - 1 freq
sooth-laund - 1 freq
scoatlann - 1 freq
€˜shaetlan - 3 freq
€˜shetlandic - 1 freq
shaetlandisms - 1 freq
€˜shetland - 3 freq
scootie-allan - 1 freq
€˜scotlan - 1 freq
scotlands - 2 freq
shætlan - 8 freq
shetlan' - 1 freq
shatland - 1 freq
scotlandvotes - 1 freq
squattlin - 1 freq
sidling - 1 freq
scotland-specific - 3 freq
€˜scotland-specific - 1 freq
shetlandified - 1 freq
settlin' - 3 freq
stallions - 1 freq
scotlandÂ’s - 4 freq
scotlandteam - 3 freq
scotlandnt - 26 freq
scotlandsky - 4 freq
settling - 2 freq
scotlanddddd - 1 freq
stelment - 1 freq
scotlandshiregb - 1 freq
shetlandcat - 2 freq
shetlandwoolwk - 1 freq
shetlandhjarta - 3 freq
shetlandlibrary - 8 freq
shetlandbirds - 2 freq
shetlandpupsÂ… - 1 freq
shetlandsummer - 1 freq
shetlandÂ’s - 1 freq
shetlanders- - 1 freq
shetlandbirdclub - 1 freq
shetlandwild - 1 freq
shetlnad - 1 freq
shetlanddialect - 1 freq
shetlandweather - 1 freq
'shetland - 1 freq
shetlandarts - 1 freq
scotland'll - 1 freq
scotlandloveslanguages - 3 freq
scotlandspeople - 1 freq
scotlandsnumber - 7 freq
scotlandinkidsbooks - 1 freq
shetlandnature - 10 freq
shetlandstuart - 1 freq
shetland' - 1 freq
shetlanderforlife - 1 freq
scotlandislife - 1 freq
scotlandsunico - 1 freq
MetaPhone code - STLN
settlin - 38 freq
stolen - 31 freq
'stealin - 1 freq
stealin - 34 freq
stellin - 14 freq
stallion - 11 freq
stalin - 10 freq
sattlin - 4 freq
stillin - 3 freq
sidlin - 1 freq
setlin - 1 freq
soitlin - 1 freq
staelin - 1 freq
settlan - 1 freq
seedlin - 2 freq
stollin - 1 freq
setttlin - 1 freq
settlin' - 3 freq
STOLEN
Time to execute Levenshtein function - 0.188816 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.351326 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028411 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037739 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000910 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.