A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to somehin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
somehin (0) - 30 freq
sometin (1) - 7 freq
somchin (1) - 1 freq
somethin (1) - 1119 freq
somehing (1) - 6 freq
somthin (1) - 7 freq
someyin (1) - 2 freq
somehin' (1) - 1 freq
somehin's (2) - 1 freq
someman (2) - 1 freq
something (2) - 451 freq
somiethin (2) - 1 freq
soberin (2) - 2 freq
simethin (2) - 1 freq
someean (2) - 3 freq
semein (2) - 4 freq
soughin (2) - 2 freq
soghin (2) - 1 freq
somin (2) - 1 freq
souchin (2) - 16 freq
somehoo (2) - 31 freq
soochin (2) - 1 freq
somewan (2) - 34 freq
somehou (2) - 20 freq
somethins (2) - 2 freq
somehin (0) - 30 freq
someyin (2) - 2 freq
somehin' (2) - 1 freq
somthin (2) - 7 freq
sumhin (2) - 37 freq
somehing (2) - 6 freq
somchin (2) - 1 freq
sometin (2) - 7 freq
somethin (2) - 1119 freq
somewhen (3) - 1 freq
someeen (3) - 1 freq
somehou (3) - 20 freq
somewan (3) - 34 freq
samhain (3) - 1 freq
simthin (3) - 11 freq
stehin (3) - 1 freq
spehin (3) - 1 freq
sumthin (3) - 97 freq
soochin (3) - 1 freq
soothin (3) - 5 freq
someen (3) - 36 freq
somehow (3) - 73 freq
someean (3) - 3 freq
semein (3) - 4 freq
simethin (3) - 1 freq
SoundEx code - S550
someone - 100 freq
saumon - 19 freq
shinin - 77 freq
snaain - 2 freq
sumhin - 37 freq
somehin - 30 freq
swimmin - 63 freq
skimmin - 4 freq
seemin - 15 freq
soumin - 2 freq
scannin - 8 freq
soomin - 7 freq
simone - 4 freq
snowin - 4 freq
snawin - 6 freq
syne-an - 1 freq
seamen - 5 freq
sweemin - 32 freq
sennin - 12 freq
some-whun - 1 freq
sheenin - 26 freq
summin - 36 freq
somewan - 34 freq
shamhna - 1 freq
somewhen - 1 freq
some-een - 4 freq
samin - 4 freq
schemin - 3 freq
summon - 10 freq
sunnin - 5 freq
sumwan - 8 freq
swimmin' - 1 freq
samen - 35 freq
shynen - 1 freq
skimmen - 1 freq
saemony - 1 freq
schenan - 1 freq
showman - 4 freq
shamin - 2 freq
skymin - 1 freq
shine-in - 1 freq
shinan - 10 freq
seaman - 4 freq
seenin - 1 freq
samhain - 1 freq
swannin - 2 freq
simon - 57 freq
©simon - 1 freq
simeon - 8 freq
semein - 4 freq
senin' - 1 freq
sounin - 8 freq
seemon - 17 freq
symeon - 1 freq
summeen - 1 freq
someen - 36 freq
some-ean - 28 freq
someean - 3 freq
'seemon' - 1 freq
'some-ean - 1 freq
sweeman - 3 freq
scannan - 1 freq
scheman - 1 freq
sieman - 1 freq
shimmyin - 1 freq
'someone - 1 freq
someane - 17 freq
soonin - 1 freq
shynan - 1 freq
shammin - 1 freq
summin' - 3 freq
shinin' - 1 freq
shumhin - 1 freq
simwan - 1 freq
sumyin - 1 freq
shynin - 1 freq
somene - 4 freq
swimman - 2 freq
skimman - 1 freq
sumeen - 1 freq
skyimmin - 1 freq
swoonin - 2 freq
someyin - 2 freq
sainin - 4 freq
sumien - 1 freq
somane - 1 freq
saemin - 1 freq
sinnin - 2 freq
sinnonie - 1 freq
sinhine - 1 freq
skinnin - 3 freq
samhuinn - 1 freq
shoomin - 1 freq
somin - 1 freq
simian - 1 freq
sannyin - 1 freq
shonin - 1 freq
sumane - 4 freq
samain - 1 freq
€œseemin - 1 freq
somehin' - 1 freq
€”simon - 1 freq
sheenan' - 1 freq
sumin - 6 freq
sweeminÂ’ - 1 freq
samoan - 1 freq
shannon - 2 freq
shannnnan - 6 freq
simiain - 1 freq
someeen - 1 freq
'sinin - 1 freq
sheenan - 1 freq
sumeane - 1 freq
MetaPhone code - SMHN
sumhin - 37 freq
somehin - 30 freq
some-whun - 1 freq
somewhen - 1 freq
samhain - 1 freq
samhuinn - 1 freq
somehin' - 1 freq
samheughan - 1 freq
SOMEHIN
Time to execute Levenshtein function - 0.187213 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.371479 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027598 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037218 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000915 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.