A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to smoker in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
smoker (0) - 2 freq
smoke (1) - 116 freq
smoked (1) - 24 freq
smokey (1) - 5 freq
stoker (1) - 4 freq
smokes (1) - 8 freq
sooker (1) - 2 freq
smokers (1) - 8 freq
sober (2) - 31 freq
mower (2) - 15 freq
moder (2) - 2 freq
scorer (2) - 3 freq
smore (2) - 7 freq
smawer (2) - 9 freq
looker (2) - 8 freq
snoke (2) - 9 freq
smolder (2) - 1 freq
shoner (2) - 1 freq
stouer (2) - 1 freq
snokes (2) - 1 freq
smother (2) - 3 freq
smiler (2) - 1 freq
spaker (2) - 3 freq
smote (2) - 2 freq
sikker (2) - 1 freq
smoker (0) - 2 freq
smokes (2) - 8 freq
smokers (2) - 8 freq
stoker (2) - 4 freq
sooker (2) - 2 freq
smokey (2) - 5 freq
smoked (2) - 24 freq
smoke (2) - 116 freq
smaer (3) - 3 freq
smoor (3) - 28 freq
sma-er (3) - 1 freq
snooker (3) - 16 freq
smoky (3) - 5 freq
smokit (3) - 6 freq
seeker (3) - 2 freq
maker (3) - 18 freq
shaker (3) - 1 freq
smokin (3) - 97 freq
smaaer (3) - 7 freq
smok (3) - 2 freq
smokies (3) - 1 freq
smacker (3) - 3 freq
smokan (3) - 2 freq
sucker (3) - 3 freq
smokie (3) - 1 freq
SoundEx code - S526
snickerin - 5 freq
smacher - 2 freq
singers - 55 freq
singer - 47 freq
singer's - 2 freq
snocherin - 7 freq
smokers' - 4 freq
sincerely - 8 freq
sincere - 13 freq
sneegart - 1 freq
sneegert - 1 freq
sniggerin - 6 freq
sincerity - 3 freq
snochert - 1 freq
snickert - 1 freq
smoker - 2 freq
smasher - 4 freq
sanquhar - 9 freq
semi-circles - 1 freq
smickered - 1 freq
smickert - 1 freq
snicheren - 2 freq
smokers - 8 freq
sing-greet - 1 freq
sniggers - 5 freq
smugger - 1 freq
smickerin - 2 freq
sangria - 2 freq
snichered - 2 freq
snicher - 3 freq
skincare - 1 freq
smachrie - 2 freq
sniggered - 4 freq
singers' - 2 freq
sniggern - 9 freq
sniggert - 5 freq
smoker's - 2 freq
snooker - 16 freq
synchronised - 1 freq
snocher - 6 freq
snochered - 1 freq
smackaroonies - 2 freq
snicker - 1 freq
shae-makker - 1 freq
shae-maakers - 1 freq
smaikrie - 1 freq
sensory-seutid - 1 freq
sneegired - 1 freq
sneegirs - 1 freq
sanger - 1 freq
sheanchara - 1 freq
snicheran - 1 freq
snicherin - 1 freq
snickered - 1 freq
snigger' - 1 freq
snichert - 2 freq
sanchar - 1 freq
smacker - 3 freq
syncretism - 1 freq
snashers - 1 freq
snickerin' - 1 freq
shoemaker-levy - 1 freq
smacherie - 1 freq
synchronise - 1 freq
snookert - 1 freq
sensoryattachmentintervention - 1 freq
sensory - 4 freq
snochrie - 12 freq
singer-sangwriters - 1 freq
sanskrit - 1 freq
semi-circle - 1 freq
somequhaar - 1 freq
smicker - 1 freq
€œsincerely - 1 freq
synchronically - 1 freq
smacherry - 2 freq
sunscreem - 6 freq
smokiered - 16 freq
samcornwell - 1 freq
skinnycortado - 1 freq
sincerest - 1 freq
shingaurds - 1 freq
smoocher - 3 freq
shaunwkearney - 1 freq
sunnygradio - 1 freq
sensored - 1 freq
sonjahern - 1 freq
sinker - 1 freq
samgray - 29 freq
seanmccrory - 1 freq
snookers - 1 freq
sensors - 1 freq
snickers - 1 freq
sammgreer - 14 freq
suncream - 1 freq
MetaPhone code - SMKR
smoker - 2 freq
smugger - 1 freq
smaikrie - 1 freq
smacker - 3 freq
smicker - 1 freq
samgray - 29 freq
SMOKER
Time to execute Levenshtein function - 0.280662 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.471280 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030399 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.047086 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000879 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.