A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sunset in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sunset (0) - 28 freq
susset (1) - 1 freq
'sunset (1) - 1 freq
sunket (1) - 3 freq
sunsets (1) - 4 freq
kinset (2) - 1 freq
sunlit (2) - 2 freq
sunken (2) - 4 freq
sussex (2) - 1 freq
sinse (2) - 20 freq
sunkets (2) - 2 freq
hunsed (2) - 1 freq
suner (2) - 3 freq
suet (2) - 9 freq
unselt (2) - 1 freq
sune' (2) - 1 freq
sonse (2) - 5 freq
unser (2) - 3 freq
dunse (2) - 1 freq
suist (2) - 1 freq
sensit (2) - 1 freq
sune (2) - 82 freq
sense (2) - 527 freq
sunder (2) - 1 freq
€˜sunset (2) - 2 freq
sunset (0) - 28 freq
sensit (2) - 1 freq
sunsoot (2) - 1 freq
sunsets (2) - 4 freq
sunket (2) - 3 freq
susset (2) - 1 freq
'sunset (2) - 1 freq
sensed (3) - 18 freq
sonnet (3) - 12 freq
suist (3) - 1 freq
senses (3) - 56 freq
soundet (3) - 12 freq
suns (3) - 15 freq
sonse (3) - 5 freq
sinses (3) - 2 freq
sense (3) - 527 freq
unst (3) - 12 freq
sinse (3) - 20 freq
sinsed (3) - 1 freq
sunlit (3) - 2 freq
kinset (3) - 1 freq
onset (3) - 7 freq
inset (3) - 2 freq
sneest (3) - 1 freq
suntee (3) - 1 freq
SoundEx code - S523
scansed - 8 freq
sunset - 28 freq
sensed - 18 freq
sneckit - 28 freq
sneezed - 10 freq
sneysters - 5 freq
sanctity - 2 freq
sinsed - 1 freq
smashed - 35 freq
sensation - 22 freq
saunstane - 1 freq
sanct - 45 freq
sconced - 2 freq
scanced - 10 freq
snashed - 11 freq
sanctioned - 7 freq
smoked - 24 freq
smaaest - 11 freq
smuist - 1 freq
somegate - 1 freq
seamstress - 1 freq
sancts - 4 freq
sneesht - 1 freq
smuchter - 1 freq
snokit - 2 freq
snecked - 6 freq
sneaked - 12 freq
sunsets - 4 freq
sangster's - 6 freq
sneistery - 1 freq
smeekit - 9 freq
singit - 1 freq
semi-steamin - 1 freq
smeakit - 1 freq
sanctuary - 11 freq
snugged - 1 freq
sensitive - 16 freq
sneists - 1 freq
sanctis - 4 freq
sanct's - 1 freq
sanctuarie - 2 freq
snaggit - 1 freq
snichtert - 1 freq
saimstress - 1 freq
snouked - 1 freq
sanct-aundraes - 1 freq
sinister-lookin - 1 freq
sinister - 10 freq
sanction - 4 freq
sanctuary' - 1 freq
snochtered - 4 freq
smuiked - 1 freq
skin-scowder - 1 freq
snaiked - 1 freq
sneck-drawer - 1 freq
smeukit - 1 freq
'sneck-drawin' - 1 freq
snakit - 3 freq
sun-stuffie - 1 freq
smokit - 6 freq
sanctum - 1 freq
sneeked - 1 freq
sunnyside - 1 freq
sneukit - 1 freq
snushed - 3 freq
sangsters - 13 freq
sneakit - 2 freq
sneakt - 1 freq
sanctimonious - 6 freq
skinniest - 1 freq
snogged - 2 freq
sneckt - 1 freq
swan-necked - 1 freq
smasht - 1 freq
snochters - 5 freq
snawstorm - 1 freq
smackit - 4 freq
seenister - 1 freq
smawest - 3 freq
sneestered - 1 freq
smokkit - 1 freq
snuggit - 2 freq
'sunset - 1 freq
singed - 3 freq
smacked - 7 freq
sunset's - 2 freq
sanstane - 1 freq
sennicht - 10 freq
smaa'est - 1 freq
snystie - 3 freq
snowked - 2 freq
snochterdichter - 1 freq
sneyster - 5 freq
sunket - 3 freq
sneist - 5 freq
skansed - 1 freq
sanctuar - 5 freq
sensautioun - 1 freq
sneckid - 1 freq
sensethe - 1 freq
snowkit - 1 freq
smooked - 1 freq
sneistie - 6 freq
smeegit - 3 freq
song-threeds - 1 freq
sneak'd - 1 freq
skenstid - 1 freq
skenstoft - 1 freq
sensitivity - 4 freq
snochter - 1 freq
sneisty - 1 freq
snasht - 1 freq
sneester - 3 freq
smeigit - 1 freq
snaikit - 2 freq
sunkets - 2 freq
snake-heidit - 1 freq
smash't - 1 freq
sneest - 1 freq
somegaits - 3 freq
sneistie-like - 2 freq
somegaits- - 1 freq
shihuangdi - 1 freq
sang-gaitherers - 1 freq
sang-gaitherer - 1 freq
sansgter - 1 freq
suinest - 2 freq
seanachaidh - 2 freq
seinister - 3 freq
scene-settin - 1 freq
smaest - 2 freq
snagged - 1 freq
smashet - 1 freq
sensationalist - 1 freq
sensit - 1 freq
sumgates - 1 freq
sangster - 24 freq
smuchtered - 1 freq
sangstress - 1 freq
skonsed - 1 freq
sinkit - 1 freq
snekkit - 4 freq
smouchterin - 1 freq
sneggit - 1 freq
skonced - 1 freq
snashit - 1 freq
sancte - 4 freq
sainctis - 1 freq
snochter-dichters - 1 freq
snowsuit - 1 freq
€˜sunset - 2 freq
snaked - 2 freq
semester - 2 freq
smeeged - 3 freq
semi-hostile - 1 freq
€˜sangsteris - 1 freq
smackheids - 1 freq
sensations - 3 freq
smackheid - 4 freq
sennichts - 2 freq
sun-god - 1 freq
snochtert - 1 freq
seamstresses - 1 freq
sunset--- - 1 freq
smuikit - 1 freq
swingset - 2 freq
sanctification - 1 freq
sneeshed - 1 freq
sensiteevity - 1 freq
sinister-luckin - 1 freq
somegait - 1 freq
scanct - 1 freq
sneesters - 1 freq
snecht - 1 freq
shamsatoo - 1 freq
sneysterin - 1 freq
snaw-coatit - 1 freq
sangster-sangscrievers - 1 freq
sensational - 7 freq
sunsoot - 1 freq
sowensauthor - 1 freq
smicht - 5 freq
samgoodman - 1 freq
soonest - 1 freq
symngtnbadyin - 1 freq
sxmsdpni - 1 freq
seemescotland - 5 freq
sneistin - 1 freq
sunniest - 1 freq
sungaets - 1 freq
sheenster - 150 freq
sheensterpriv - 1 freq
smxt - 1 freq
MetaPhone code - SNST
sunset - 28 freq
sensed - 18 freq
sneezed - 10 freq
sinsed - 1 freq
sunnyside - 1 freq
'sunset - 1 freq
snystie - 3 freq
sneist - 5 freq
sneistie - 6 freq
sneisty - 1 freq
sneest - 1 freq
suinest - 2 freq
sensit - 1 freq
snowsuit - 1 freq
€˜sunset - 2 freq
sunset--- - 1 freq
sunsoot - 1 freq
soonest - 1 freq
sunniest - 1 freq
SUNSET
Time to execute Levenshtein function - 0.198622 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.336998 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027427 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037486 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000916 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.