A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sunshine in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sunshine (0) - 78 freq
sinshine (1) - 10 freq
sinhine (2) - 1 freq
outshine (2) - 1 freq
sunshein (2) - 1 freq
sinshene (2) - 1 freq
squashin (3) - 2 freq
susanne (3) - 1 freq
wushin (3) - 4 freq
cuisine (3) - 6 freq
sumhing (3) - 4 freq
suntie (3) - 6 freq
dushin (3) - 1 freq
sunshinekid (3) - 1 freq
hunsin (3) - 3 freq
sungkin (3) - 1 freq
smashin (3) - 58 freq
sensin (3) - 9 freq
sumhin (3) - 37 freq
sumthin (3) - 97 freq
munching (3) - 2 freq
susie (3) - 38 freq
punchin (3) - 13 freq
sanstane (3) - 1 freq
punshen (3) - 1 freq
sunshine (0) - 78 freq
sinshine (1) - 10 freq
sunshein (2) - 1 freq
sinshene (2) - 1 freq
snashin (3) - 1 freq
sunsheeny (3) - 1 freq
sinhine (3) - 1 freq
sunsheen (3) - 6 freq
punshen (4) - 1 freq
sneeshin (4) - 8 freq
sleshin (4) - 1 freq
sanstane (4) - 1 freq
sinsyne (4) - 30 freq
squishin (4) - 1 freq
saunstane (4) - 1 freq
swishin (4) - 4 freq
sunsh- (4) - 1 freq
smushin (4) - 4 freq
sloshin (4) - 3 freq
smashin (4) - 58 freq
swushin (4) - 1 freq
snushan (4) - 2 freq
moonshine (4) - 1 freq
sinsheeny (4) - 1 freq
outshine (4) - 1 freq
SoundEx code - S525
sunshine - 78 freq
sinkin - 25 freq
singin - 278 freq
swingin - 26 freq
snowkin - 7 freq
sinsyne - 30 freq
smoking - 13 freq
smokin - 92 freq
sneezin - 27 freq
sneezing - 4 freq
singing - 29 freq
sensins - 1 freq
snoozin - 9 freq
sneckin - 7 freq
somecunt's - 1 freq
smashin - 58 freq
'somecunt - 1 freq
smugness - 2 freq
sensyne - 7 freq
scancin - 6 freq
shangshan - 2 freq
somecunt - 6 freq
smashing - 6 freq
some-cunt - 1 freq
sneakin - 6 freq
samson - 8 freq
sneeshin - 8 freq
smeekin - 2 freq
sing-sang - 4 freq
synsyne's - 1 freq
smokkin - 3 freq
sunsheen - 6 freq
smashinest - 1 freq
sensin - 9 freq
sunken - 4 freq
smookan - 2 freq
singan - 12 freq
singin' - 2 freq
sinkin' - 2 freq
smokin' - 4 freq
simson's - 1 freq
smakken - 1 freq
simson - 1 freq
simsen's - 1 freq
sumkeyn - 1 freq
smeakin - 1 freq
scansouns - 1 freq
sinking - 5 freq
smookin - 3 freq
smackin - 4 freq
snashin - 1 freq
swinging - 3 freq
sneexin - 1 freq
sangsmiths - 1 freq
snoggin - 2 freq
somchin - 1 freq
snogyin - 1 freq
skinkin - 1 freq
sing-sangie - 1 freq
snoozan - 1 freq
smokeen - 2 freq
sneakan - 2 freq
snokin - 2 freq
scancein - 1 freq
sing-sangy - 3 freq
singsangy - 1 freq
scansin - 1 freq
sunnschein - 2 freq
sweengan - 2 freq
swung'im - 1 freq
snushan - 2 freq
sinkan - 2 freq
swingan - 1 freq
smokan - 2 freq
sanson - 1 freq
sensan - 1 freq
sangmaakers - 1 freq
sungkin - 1 freq
singkin - 1 freq
swinkin - 1 freq
sin-sheen - 1 freq
samson's - 3 freq
sumkin - 1 freq
smockan - 1 freq
sneezan - 6 freq
sinseen - 1 freq
sinshene - 1 freq
singin't - 1 freq
snoukin-saats - 1 freq
sangsmith - 4 freq
smushin - 4 freq
snowcem- - 1 freq
smeikin - 1 freq
sinshine - 10 freq
sunsheeny - 1 freq
sinsheeny - 1 freq
'singin - 1 freq
sunshein - 1 freq
snakin - 2 freq
singsong - 1 freq
smeegin - 1 freq
swankin' - 1 freq
sneakin' - 1 freq
€œsmashin - 1 freq
snoozing - 1 freq
semi-sympathetic - 1 freq
sensing - 1 freq
sense-o-humour - 1 freq
sangam - 1 freq
synesen - 1 freq
sneckins - 1 freq
sweengin - 2 freq
€˜sinsyne - 1 freq
€˜smashing - 1 freq
€˜smashin - 1 freq
semi-conscious - 1 freq
singjn' - 1 freq
shaunkennedy - 1 freq
smashin' - 1 freq
schmazing - 1 freq
semi-juniors - 1 freq
samheughan - 1 freq
smnzm - 1 freq
singinÂ’ - 1 freq
sinkn - 1 freq
snewsma - 1 freq
samsung - 2 freq
samhowson - 1 freq
snsmw - 1 freq
sjnjmzhrfc - 1 freq
swingingthelead - 1 freq
swanswan - 1 freq
sneaking - 1 freq
snecking - 1 freq
sunshinekid - 1 freq
seanmcm - 1 freq
smoochin - 1 freq
seancampbell - 5 freq
sonnyswanson - 2 freq
shamshoum - 1 freq
snacking - 1 freq
syngenta - 1 freq
snowyscene - 1 freq
MetaPhone code - SNXN
sunshine - 78 freq
sneeshin - 8 freq
sunsheen - 6 freq
snashin - 1 freq
snushan - 2 freq
snaitchin - 1 freq
sin-sheen - 1 freq
sinshene - 1 freq
sinshine - 10 freq
sunsheeny - 1 freq
sinsheeny - 1 freq
sunshein - 1 freq
snatchan - 1 freq
snatchin - 1 freq
SUNSHINE
Time to execute Levenshtein function - 0.224109 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.368052 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027272 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036928 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000839 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.