A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to smokin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
smokin (0) - 97 freq
smokan (1) - 2 freq
spokin (1) - 5 freq
smokit (1) - 6 freq
smokie (1) - 1 freq
smorin (1) - 5 freq
smookin (1) - 3 freq
smokin' (1) - 4 freq
smokkin (1) - 3 freq
smoking (1) - 13 freq
smok'n (1) - 1 freq
sookin (1) - 52 freq
snokin (1) - 2 freq
sooken (2) - 1 freq
smeakin (2) - 1 freq
shakin (2) - 107 freq
pokin (2) - 19 freq
sookan (2) - 3 freq
bokin (2) - 5 freq
sulkin (2) - 5 freq
stotin (2) - 2 freq
ookin (2) - 1 freq
sinkin (2) - 25 freq
soolin (2) - 2 freq
skoukin (2) - 2 freq
smokin (0) - 97 freq
smokan (1) - 2 freq
smookin (1) - 3 freq
smokeen (2) - 2 freq
smeekin (2) - 2 freq
smeakin (2) - 1 freq
sumkin (2) - 1 freq
smookan (2) - 2 freq
snokin (2) - 2 freq
smeikin (2) - 1 freq
sookin (2) - 52 freq
spokin (2) - 5 freq
smokin' (2) - 4 freq
smorin (2) - 5 freq
smokit (2) - 6 freq
smokkin (2) - 3 freq
smok'n (2) - 1 freq
smoking (2) - 13 freq
smokie (2) - 1 freq
spokan (3) - 3 freq
smokes (3) - 8 freq
sikin (3) - 2 freq
sekin (3) - 2 freq
smokey (3) - 5 freq
smirkin (3) - 11 freq
SoundEx code - S525
sunshine - 79 freq
sinkin - 25 freq
singin - 286 freq
swingin - 27 freq
snowkin - 7 freq
sinsyne - 30 freq
smoking - 13 freq
smokin - 97 freq
sneezin - 27 freq
sneezing - 4 freq
singing - 30 freq
sensins - 1 freq
snoozin - 9 freq
sneckin - 7 freq
somecunt's - 1 freq
smashin - 60 freq
'somecunt - 1 freq
smugness - 2 freq
sensyne - 7 freq
scancin - 6 freq
shangshan - 2 freq
somecunt - 6 freq
smashing - 6 freq
some-cunt - 1 freq
sneakin - 6 freq
samson - 8 freq
sneeshin - 8 freq
smeekin - 2 freq
sing-sang - 4 freq
synsyne's - 1 freq
smokkin - 3 freq
sunsheen - 6 freq
smashinest - 1 freq
sensin - 9 freq
sunken - 4 freq
smookan - 2 freq
singan - 12 freq
singin' - 2 freq
sinkin' - 2 freq
smokin' - 4 freq
simson's - 1 freq
smakken - 1 freq
simson - 1 freq
simsen's - 1 freq
sumkeyn - 1 freq
smok'n - 1 freq
smeakin - 1 freq
scansouns - 1 freq
sinking - 5 freq
smookin - 3 freq
smackin - 4 freq
snashin - 1 freq
swinging - 3 freq
sneexin - 1 freq
sangsmiths - 1 freq
snoggin - 2 freq
somchin - 1 freq
snogyin - 1 freq
skinkin - 1 freq
sing-sangie - 1 freq
snoozan - 1 freq
smokeen - 2 freq
sneakan - 2 freq
snokin - 2 freq
scancein - 1 freq
sing-sangy - 3 freq
singsangy - 1 freq
scansin - 1 freq
sunnschein - 2 freq
sweengan - 2 freq
swung'im - 1 freq
snushan - 2 freq
sinkan - 2 freq
swingan - 1 freq
smokan - 2 freq
sanson - 1 freq
sensan - 1 freq
sangmaakers - 1 freq
sungkin - 1 freq
singkin - 1 freq
swinkin - 1 freq
sin-sheen - 1 freq
samson's - 3 freq
sumkin - 1 freq
smockan - 1 freq
sneezan - 6 freq
sinseen - 1 freq
sinshene - 1 freq
singin't - 1 freq
snoukin-saats - 1 freq
sangsmith - 4 freq
smushin - 4 freq
snowcem- - 1 freq
smeikin - 1 freq
sinshine - 10 freq
sunsheeny - 1 freq
sinsheeny - 1 freq
'singin - 1 freq
sunshein - 1 freq
snakin - 2 freq
singsong - 1 freq
smeegin - 1 freq
swankin' - 1 freq
sneakin' - 1 freq
€œsmashin - 1 freq
snoozing - 1 freq
semi-sympathetic - 1 freq
sensing - 1 freq
sense-o-humour - 1 freq
sangam - 1 freq
synesen - 1 freq
sneckins - 1 freq
sweengin - 2 freq
€˜sinsyne - 1 freq
€˜smashing - 1 freq
€˜smashin - 1 freq
semi-conscious - 1 freq
singjn' - 1 freq
shaunkennedy - 1 freq
smashin' - 1 freq
schmazing - 1 freq
semi-juniors - 1 freq
samheughan - 1 freq
smnzm - 1 freq
singinÂ’ - 1 freq
sinkn - 1 freq
snewsma - 1 freq
samsung - 2 freq
samhowson - 1 freq
snsmw - 1 freq
sjnjmzhrfc - 1 freq
swingingthelead - 1 freq
swanswan - 1 freq
sneaking - 1 freq
snecking - 1 freq
sunshinekid - 1 freq
seanmcm - 1 freq
smoochin - 1 freq
seancampbell - 5 freq
sonnyswanson - 2 freq
shamshoum - 1 freq
snacking - 1 freq
syngenta - 1 freq
snowyscene - 1 freq
MetaPhone code - SMKN
smokin - 97 freq
smeekin - 2 freq
smokkin - 3 freq
smookan - 2 freq
smokin' - 4 freq
smakken - 1 freq
sumkeyn - 1 freq
smok'n - 1 freq
smeakin - 1 freq
smookin - 3 freq
smackin - 4 freq
smokeen - 2 freq
smokan - 2 freq
sumkin - 1 freq
smockan - 1 freq
smeikin - 1 freq
SMOKIN
Time to execute Levenshtein function - 0.192386 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.459025 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030948 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037542 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000888 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.