A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to smoking in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
smoking (0) - 13 freq
sooking (1) - 3 freq
smokin (1) - 92 freq
smokin' (1) - 4 freq
smorning (2) - 1 freq
smokan (2) - 2 freq
seeking (2) - 6 freq
smiling (2) - 16 freq
choking (2) - 4 freq
stocking (2) - 5 freq
mocking (2) - 1 freq
smokies (2) - 1 freq
slowing (2) - 2 freq
joking (2) - 1 freq
storing (2) - 2 freq
sookin' (2) - 1 freq
snowing (2) - 4 freq
making (2) - 57 freq
smokit (2) - 6 freq
shoving (2) - 2 freq
yoking (2) - 2 freq
sacking (2) - 2 freq
sookins (2) - 1 freq
fooking (2) - 1 freq
smorin (2) - 5 freq
smoking (0) - 13 freq
smokin (2) - 92 freq
smokin' (2) - 4 freq
sooking (2) - 3 freq
spiking (3) - 9 freq
smookin (3) - 3 freq
sooming (3) - 2 freq
shaking (3) - 8 freq
sucking (3) - 5 freq
sacking (3) - 2 freq
sinking (3) - 5 freq
making (3) - 57 freq
smiling (3) - 16 freq
seeking (3) - 6 freq
smokan (3) - 2 freq
moving (4) - 23 freq
boking (4) - 1 freq
smeakin (4) - 1 freq
stroking (4) - 1 freq
sumkin (4) - 1 freq
looking (4) - 160 freq
poking (4) - 5 freq
somehing (4) - 6 freq
booking (4) - 5 freq
sookin (4) - 52 freq
SoundEx code - S525
sunshine - 78 freq
sinkin - 25 freq
singin - 278 freq
swingin - 26 freq
snowkin - 7 freq
sinsyne - 30 freq
smoking - 13 freq
smokin - 92 freq
sneezin - 27 freq
sneezing - 4 freq
singing - 29 freq
sensins - 1 freq
snoozin - 9 freq
sneckin - 7 freq
somecunt's - 1 freq
smashin - 58 freq
'somecunt - 1 freq
smugness - 2 freq
sensyne - 7 freq
scancin - 6 freq
shangshan - 2 freq
somecunt - 6 freq
smashing - 6 freq
some-cunt - 1 freq
sneakin - 6 freq
samson - 8 freq
sneeshin - 8 freq
smeekin - 2 freq
sing-sang - 4 freq
synsyne's - 1 freq
smokkin - 3 freq
sunsheen - 6 freq
smashinest - 1 freq
sensin - 9 freq
sunken - 4 freq
smookan - 2 freq
singan - 12 freq
singin' - 2 freq
sinkin' - 2 freq
smokin' - 4 freq
simson's - 1 freq
smakken - 1 freq
simson - 1 freq
simsen's - 1 freq
sumkeyn - 1 freq
smeakin - 1 freq
scansouns - 1 freq
sinking - 5 freq
smookin - 3 freq
smackin - 4 freq
snashin - 1 freq
swinging - 3 freq
sneexin - 1 freq
sangsmiths - 1 freq
snoggin - 2 freq
somchin - 1 freq
snogyin - 1 freq
skinkin - 1 freq
sing-sangie - 1 freq
snoozan - 1 freq
smokeen - 2 freq
sneakan - 2 freq
snokin - 2 freq
scancein - 1 freq
sing-sangy - 3 freq
singsangy - 1 freq
scansin - 1 freq
sunnschein - 2 freq
sweengan - 2 freq
swung'im - 1 freq
snushan - 2 freq
sinkan - 2 freq
swingan - 1 freq
smokan - 2 freq
sanson - 1 freq
sensan - 1 freq
sangmaakers - 1 freq
sungkin - 1 freq
singkin - 1 freq
swinkin - 1 freq
sin-sheen - 1 freq
samson's - 3 freq
sumkin - 1 freq
smockan - 1 freq
sneezan - 6 freq
sinseen - 1 freq
sinshene - 1 freq
singin't - 1 freq
snoukin-saats - 1 freq
sangsmith - 4 freq
smushin - 4 freq
snowcem- - 1 freq
smeikin - 1 freq
sinshine - 10 freq
sunsheeny - 1 freq
sinsheeny - 1 freq
'singin - 1 freq
sunshein - 1 freq
snakin - 2 freq
singsong - 1 freq
smeegin - 1 freq
swankin' - 1 freq
sneakin' - 1 freq
€œsmashin - 1 freq
snoozing - 1 freq
semi-sympathetic - 1 freq
sensing - 1 freq
sense-o-humour - 1 freq
sangam - 1 freq
synesen - 1 freq
sneckins - 1 freq
sweengin - 2 freq
€˜sinsyne - 1 freq
€˜smashing - 1 freq
€˜smashin - 1 freq
semi-conscious - 1 freq
singjn' - 1 freq
shaunkennedy - 1 freq
smashin' - 1 freq
schmazing - 1 freq
semi-juniors - 1 freq
samheughan - 1 freq
smnzm - 1 freq
singinÂ’ - 1 freq
sinkn - 1 freq
snewsma - 1 freq
samsung - 2 freq
samhowson - 1 freq
snsmw - 1 freq
sjnjmzhrfc - 1 freq
swingingthelead - 1 freq
swanswan - 1 freq
sneaking - 1 freq
snecking - 1 freq
sunshinekid - 1 freq
seanmcm - 1 freq
smoochin - 1 freq
seancampbell - 5 freq
sonnyswanson - 2 freq
shamshoum - 1 freq
snacking - 1 freq
syngenta - 1 freq
snowyscene - 1 freq
MetaPhone code - SMKNK
smoking - 13 freq
SMOKING
Time to execute Levenshtein function - 0.467941 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.517326 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027612 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.068944 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000836 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.