A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sensation in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sensation (0) - 22 freq
sensations (1) - 3 freq
sensautioun (2) - 1 freq
sensational (2) - 7 freq
stervation (3) - 5 freq
seatin (3) - 4 freq
elation (3) - 3 freq
equation (3) - 5 freq
secretion (3) - 1 freq
rendition (3) - 5 freq
seituation (3) - 6 freq
donation (3) - 12 freq
seaton (3) - 3 freq
session (3) - 81 freq
sensitive (3) - 16 freq
seduction (3) - 2 freq
generation (3) - 99 freq
sanction (3) - 4 freq
sesssion (3) - 1 freq
section (3) - 76 freq
sanitation (3) - 4 freq
summation (3) - 1 freq
deflation (3) - 1 freq
senario (3) - 1 freq
pension (3) - 54 freq
sensation (0) - 22 freq
sensautioun (2) - 1 freq
sensations (2) - 3 freq
sensational (3) - 7 freq
sneistin (4) - 1 freq
sensitive (4) - 16 freq
sensan (4) - 1 freq
sensin (4) - 9 freq
sanitation (4) - 4 freq
sanction (4) - 4 freq
survation (5) - 4 freq
santin (5) - 1 freq
emendation (5) - 2 freq
separation (5) - 5 freq
sunshein (5) - 1 freq
mention (5) - 144 freq
station (5) - 139 freq
senator (5) - 2 freq
sanson (5) - 1 freq
sanstane (5) - 1 freq
fantation (5) - 2 freq
tension (5) - 32 freq
selection (5) - 28 freq
sebastian (5) - 10 freq
sensethe (5) - 1 freq
SoundEx code - S523
scansed - 8 freq
sunset - 28 freq
sensed - 18 freq
sneckit - 28 freq
sneezed - 10 freq
sneysters - 5 freq
sanctity - 2 freq
sinsed - 1 freq
smashed - 35 freq
sensation - 22 freq
saunstane - 1 freq
sanct - 45 freq
sconced - 2 freq
scanced - 10 freq
snashed - 11 freq
sanctioned - 7 freq
smoked - 24 freq
smaaest - 11 freq
smuist - 1 freq
somegate - 1 freq
seamstress - 1 freq
sancts - 4 freq
sneesht - 1 freq
smuchter - 1 freq
snokit - 2 freq
snecked - 6 freq
sneaked - 12 freq
sunsets - 4 freq
sangster's - 6 freq
sneistery - 1 freq
smeekit - 9 freq
singit - 1 freq
semi-steamin - 1 freq
smeakit - 1 freq
sanctuary - 11 freq
snugged - 1 freq
sensitive - 16 freq
sneists - 1 freq
sanctis - 4 freq
sanct's - 1 freq
sanctuarie - 2 freq
snaggit - 1 freq
snichtert - 1 freq
sanct-aundraes - 1 freq
sinister-lookin - 1 freq
sinister - 10 freq
sanction - 4 freq
sanctuary' - 1 freq
snochtered - 4 freq
smuiked - 1 freq
skin-scowder - 1 freq
snaiked - 1 freq
sneck-drawer - 1 freq
smeukit - 1 freq
'sneck-drawin' - 1 freq
snakit - 3 freq
sun-stuffie - 1 freq
smokit - 6 freq
sanctum - 1 freq
sneeked - 1 freq
sunnyside - 1 freq
sneukit - 1 freq
snushed - 3 freq
sangsters - 13 freq
sneakit - 2 freq
sneakt - 1 freq
sanctimonious - 6 freq
skinniest - 1 freq
snogged - 2 freq
sneckt - 1 freq
swan-necked - 1 freq
smasht - 1 freq
snochters - 5 freq
snawstorm - 1 freq
smackit - 4 freq
seenister - 1 freq
smawest - 3 freq
sneestered - 1 freq
smokkit - 1 freq
snuggit - 2 freq
'sunset - 1 freq
singed - 3 freq
smacked - 7 freq
sunset's - 2 freq
sanstane - 1 freq
sennicht - 10 freq
smaa'est - 1 freq
snystie - 3 freq
snowked - 2 freq
snochterdichter - 1 freq
sneyster - 5 freq
sunket - 3 freq
sneist - 5 freq
skansed - 1 freq
sanctuar - 5 freq
sensautioun - 1 freq
sneckid - 1 freq
sensethe - 1 freq
snowkit - 1 freq
smooked - 1 freq
sneistie - 6 freq
smeegit - 3 freq
song-threeds - 1 freq
sneak'd - 1 freq
skenstid - 1 freq
skenstoft - 1 freq
sensitivity - 4 freq
snochter - 1 freq
sneisty - 1 freq
snasht - 1 freq
sneester - 3 freq
smeigit - 1 freq
snaikit - 2 freq
sunkets - 2 freq
snake-heidit - 1 freq
smash't - 1 freq
sneest - 1 freq
somegaits - 3 freq
sneistie-like - 2 freq
somegaits- - 1 freq
shihuangdi - 1 freq
sang-gaitherers - 1 freq
sang-gaitherer - 1 freq
sansgter - 1 freq
suinest - 2 freq
seanachaidh - 2 freq
seinister - 3 freq
scene-settin - 1 freq
smaest - 2 freq
snagged - 1 freq
smashet - 1 freq
sensationalist - 1 freq
sensit - 1 freq
sumgates - 1 freq
sangster - 24 freq
smuchtered - 1 freq
sangstress - 1 freq
skonsed - 1 freq
sinkit - 1 freq
snekkit - 4 freq
smouchterin - 1 freq
sneggit - 1 freq
skonced - 1 freq
snashit - 1 freq
sancte - 4 freq
sainctis - 1 freq
snochter-dichters - 1 freq
snowsuit - 1 freq
€˜sunset - 2 freq
snaked - 2 freq
semester - 2 freq
smeeged - 3 freq
semi-hostile - 1 freq
€˜sangsteris - 1 freq
smackheids - 1 freq
sensations - 3 freq
smackheid - 4 freq
sennichts - 2 freq
sun-god - 1 freq
snochtert - 1 freq
seamstresses - 1 freq
sunset--- - 1 freq
smuikit - 1 freq
swingset - 2 freq
sanctification - 1 freq
sneeshed - 1 freq
sensiteevity - 1 freq
sinister-luckin - 1 freq
somegait - 1 freq
scanct - 1 freq
sneesters - 1 freq
snecht - 1 freq
shamsatoo - 1 freq
sneysterin - 1 freq
snaw-coatit - 1 freq
sangster-sangscrievers - 1 freq
sensational - 7 freq
sunsoot - 1 freq
sowensauthor - 1 freq
smicht - 5 freq
samgoodman - 1 freq
soonest - 1 freq
symngtnbadyin - 1 freq
sxmsdpni - 1 freq
seemescotland - 5 freq
sneistin - 1 freq
sunniest - 1 freq
sungaets - 1 freq
sheenster - 150 freq
sheensterpriv - 1 freq
smxt - 1 freq
MetaPhone code - SNSXN
sensation - 22 freq
sensautioun - 1 freq
sunnschein - 2 freq
SENSATION
Time to execute Levenshtein function - 0.326703 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.661610 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027482 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.072405 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000876 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.