A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to smug in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
smug (0) - 17 freq
mug (1) - 45 freq
scug (1) - 5 freq
shug (1) - 21 freq
smue (1) - 2 freq
smg (1) - 1 freq
smog (1) - 4 freq
stug (1) - 1 freq
spug (1) - 1 freq
smig (1) - 1 freq
slug (1) - 11 freq
smur (1) - 3 freq
snug (1) - 30 freq
sru (2) - 1 freq
sur (2) - 22 freq
mrg (2) - 1 freq
Ÿmur (2) - 1 freq
sma' (2) - 13 freq
mut (2) - 3 freq
soun (2) - 163 freq
mugs (2) - 17 freq
'meg (2) - 1 freq
mus (2) - 1 freq
song (2) - 132 freq
stub (2) - 1 freq
smug (0) - 17 freq
smog (1) - 4 freq
smig (1) - 1 freq
smg (1) - 1 freq
slug (2) - 11 freq
snug (2) - 30 freq
smeeg (2) - 5 freq
smur (2) - 3 freq
sumg (2) - 1 freq
scug (2) - 5 freq
mug (2) - 45 freq
shug (2) - 21 freq
spug (2) - 1 freq
stug (2) - 1 freq
smue (2) - 2 freq
slag (3) - 16 freq
skag (3) - 3 freq
sma (3) - 266 freq
smout (3) - 1 freq
sg (3) - 15 freq
smoo (3) - 1 freq
sneg (3) - 1 freq
mig (3) - 2 freq
smaw (3) - 49 freq
shag (3) - 14 freq
SoundEx code - S520
suns - 15 freq
sink - 96 freq
swung - 62 freq
smoky - 4 freq
swings - 35 freq
since - 561 freq
sense - 513 freq
swans' - 1 freq
sang - 611 freq
sing - 344 freq
siem's - 1 freq
soons - 31 freq
sons - 89 freq
sung - 68 freq
smoke - 111 freq
sneeze - 24 freq
shanks - 74 freq
seems - 517 freq
snake - 75 freq
sins - 56 freq
sank - 34 freq
seein's - 7 freq
seeing - 59 freq
song - 132 freq
smack - 25 freq
saying - 107 freq
smeuk - 6 freq
swinge - 1 freq
scaums - 1 freq
snaik - 2 freq
snecks - 7 freq
sangs - 256 freq
seams - 6 freq
schames - 2 freq
sheens - 10 freq
sangshaw - 1 freq
shimmies - 2 freq
singe - 3 freq
sunheich - 1 freq
swing - 68 freq
smick - 1 freq
swines - 4 freq
'smoke - 2 freq
sync - 6 freq
smug - 17 freq
skin's - 2 freq
'sneaky - 1 freq
smash - 26 freq
snooze - 17 freq
skins - 21 freq
smaas - 1 freq
sneck - 41 freq
sonsie - 53 freq
scance - 31 freq
souns - 52 freq
sconce - 2 freq
sinse - 19 freq
snaws - 15 freq
sayin's - 2 freq
sings - 53 freq
sunk - 26 freq
sums - 43 freq
séance - 1 freq
schemes - 42 freq
sonic - 6 freq
sneak - 23 freq
shanks' - 1 freq
science - 76 freq
snacks - 4 freq
snack - 7 freq
scones - 46 freq
sun's - 20 freq
snook - 2 freq
same's - 4 freq
schaims - 1 freq
snash - 24 freq
shuin's - 2 freq
sneesh - 5 freq
sonsy - 10 freq
'seein's - 1 freq
seyins - 6 freq
snoke - 9 freq
'shines - 1 freq
snowk - 6 freq
'seence - 1 freq
scanse - 4 freq
suin's - 3 freq
symes - 2 freq
syme's - 1 freq
swank - 9 freq
scenes - 45 freq
smeik - 6 freq
sens - 6 freq
sains - 4 freq
snug - 30 freq
shewing - 1 freq
smokey - 5 freq
scans - 4 freq
shines - 37 freq
smoochy - 2 freq
seamus - 3 freq
songs - 44 freq
snog - 7 freq
swang - 5 freq
sinks - 23 freq
sneaky - 9 freq
seamaws - 2 freq
synes - 1 freq
sauns - 3 freq
seen's - 2 freq
sweeng - 11 freq
sang' - 2 freq
sanwich - 1 freq
sowens - 1 freq
snag - 1 freq
skiing - 2 freq
showing - 16 freq
seem's - 3 freq
smoak - 2 freq
seyn's - 2 freq
saan's - 1 freq
syne's - 1 freq
seink - 1 freq
skim's - 2 freq
smacks - 2 freq
shahnk - 1 freq
soon's - 1 freq
snek - 11 freq
swans - 34 freq
smog - 4 freq
senga - 35 freq
swaying - 2 freq
sea-maws - 4 freq
shank - 22 freq
scene's - 1 freq
siine's - 1 freq
seance - 2 freq
somchow - 1 freq
seeins - 2 freq
skanks - 1 freq
smock - 14 freq
swankie - 6 freq
sammie's - 1 freq
sawney's - 3 freq
sweems - 3 freq
scanes - 1 freq
shins - 3 freq
saimness - 1 freq
swanky - 11 freq
sammy's - 4 freq
scheme's - 2 freq
sayins - 4 freq
snush - 2 freq
shanesie - 2 freq
shanes - 1 freq
seemes - 1 freq
squeamish - 3 freq
singiy - 1 freq
schemies - 4 freq
snuck - 2 freq
singh - 6 freq
sciencey - 1 freq
sincé - 1 freq
sooms - 3 freq
snicks - 1 freq
sïns - 12 freq
sinns - 4 freq
senns - 2 freq
sing' - 2 freq
skïns - 3 freq
sen's - 2 freq
sam's - 2 freq
smeek - 5 freq
smg - 1 freq
snags - 1 freq
sneaks - 3 freq
sneck's - 1 freq
saunnies - 3 freq
snugs - 1 freq
smeeg - 5 freq
senzie - 1 freq
'sangshaw' - 1 freq
sonja - 3 freq
soums - 3 freq
song' - 1 freq
sans - 7 freq
shanks's - 1 freq
shames - 2 freq
'skinny's' - 1 freq
sankey - 1 freq
semi's - 1 freq
shank' - 2 freq
sunks - 3 freq
sonk - 1 freq
semes - 2 freq
snackie - 1 freq
smoosie - 2 freq
sunny's - 9 freq
sheena's - 3 freq
sannies - 7 freq
soun's - 1 freq
shunkie - 1 freq
snashy - 2 freq
sneuk - 2 freq
siamese - 1 freq
soang - 1 freq
sewing - 4 freq
smooch - 1 freq
semoc - 1 freq
sea-maws' - 1 freq
shenachie - 3 freq
skims - 1 freq
sannis - 1 freq
'seeing - 1 freq
smoks - 1 freq
æshnis - 2 freq
siems - 2 freq
smuck - 1 freq
shame's - 1 freq
son's - 3 freq
sangschaw - 13 freq
sum's - 1 freq
'sink' - 1 freq
smush - 13 freq
suins - 3 freq
sin's - 1 freq
some's - 1 freq
sweeing - 1 freq
seumas - 17 freq
skeens - 3 freq
skweenge - 1 freq
sinews - 2 freq
sonsi - 1 freq
sneug - 1 freq
swaans - 4 freq
shangie - 1 freq
swaan's - 1 freq
scunce - 6 freq
shannack - 2 freq
saains - 1 freq
sanns - 3 freq
sheemach - 1 freq
shang - 1 freq
skene's - 2 freq
sawins - 1 freq
sneg - 1 freq
'sennachie' - 2 freq
'songs - 1 freq
'syme's - 1 freq
snowks - 1 freq
skames - 2 freq
shannoch - 1 freq
swains - 1 freq
skynnis - 2 freq
sune's - 1 freq
sowans - 1 freq
sonnis - 1 freq
sayings - 8 freq
sie-mews - 1 freq
sneish - 1 freq
shanghai - 2 freq
sions - 1 freq
€˜sang - 2 freq
smaws - 1 freq
shawing - 1 freq
scams - 1 freq
sunsh- - 1 freq
smok - 2 freq
'science - 1 freq
snhs - 3 freq
€˜since - 4 freq
€˜smack - 1 freq
€œsenga - 2 freq
€œsengaaa - 1 freq
€˜sneak - 1 freq
shansi - 1 freq
ski-ing - 1 freq
sommies - 3 freq
song's - 1 freq
€œsince - 1 freq
smocks - 2 freq
schons - 1 freq
sonse - 5 freq
shanzi - 4 freq
snigs - 1 freq
so-an-so - 1 freq
€œsuhing - 1 freq
sean's - 1 freq
'suenos' - 1 freq
songe - 1 freq
skanky - 1 freq
shankie - 1 freq
singhia - 1 freq
smig - 1 freq
swinkie - 1 freq
saems - 1 freq
sinky - 1 freq
shinzzaa - 1 freq
snoozey - 1 freq
snkkwi - 1 freq
sonÂ’s - 2 freq
smx - 1 freq
smoochie - 1 freq
'skins - 1 freq
sence - 1 freq
skynews - 6 freq
seamyc - 1 freq
smaic - 1 freq
sneckie - 3 freq
snakey - 1 freq
semis - 4 freq
sines - 1 freq
swansea - 1 freq
shauniex - 1 freq
sony's - 1 freq
smokie - 1 freq
sange - 1 freq
sumg - 1 freq
simmykay - 2 freq
sheenz - 37 freq
sheenaÂ’z - 1 freq
sqanews - 15 freq
sunak - 6 freq
simcha - 2 freq
shonky - 1 freq
saunies - 1 freq
shaneÂ’s - 1 freq
skink - 1 freq
snc - 1 freq
shona's - 1 freq
shank‘s - 1 freq
sneeky - 1 freq
snow's - 1 freq
sma's - 1 freq
sssscomic - 4 freq
snx - 1 freq
songs' - 1 freq
sngio - 1 freq
shaunmcc - 57 freq
shanksy - 1 freq
shomac - 8 freq
souness - 1 freq
MetaPhone code - SMK
smoky - 4 freq
smoke - 111 freq
smack - 25 freq
smeuk - 6 freq
smick - 1 freq
'smoke - 2 freq
smug - 17 freq
smeik - 6 freq
smokey - 5 freq
smoak - 2 freq
smog - 4 freq
smock - 14 freq
smeek - 5 freq
smg - 1 freq
smeeg - 5 freq
semoc - 1 freq
smuck - 1 freq
smok - 2 freq
€˜smack - 1 freq
smig - 1 freq
seamyc - 1 freq
smaic - 1 freq
ysmq - 1 freq
smokie - 1 freq
sumg - 1 freq
simmykay - 2 freq
SMUG
Time to execute Levenshtein function - 0.183236 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.353236 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027983 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039211 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001045 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.