A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to socitie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
socitie (0) - 2 freq
societie (1) - 13 freq
societies (2) - 14 freq
sortie (2) - 2 freq
boitie (2) - 2 freq
nochtie (2) - 3 freq
citie (2) - 10 freq
politie (2) - 2 freq
savitie (2) - 2 freq
solitrie (2) - 1 freq
skitie (2) - 4 freq
softie (2) - 3 freq
sichtie (2) - 1 freq
sanitie (2) - 1 freq
dochtie (2) - 3 freq
scotitie (2) - 1 freq
toatie (3) - 6 freq
skitit (3) - 15 freq
saitin (3) - 3 freq
staigie (3) - 1 freq
foostie (3) - 15 freq
sowie (3) - 1 freq
gowstie (3) - 7 freq
suitid (3) - 3 freq
michtie (3) - 54 freq
socitie (0) - 2 freq
societie (1) - 13 freq
sanitie (3) - 1 freq
sichtie (3) - 1 freq
scotitie (3) - 1 freq
skitie (3) - 4 freq
scoti (3) - 1 freq
scootie (3) - 2 freq
scotia (3) - 26 freq
softie (3) - 3 freq
citie (3) - 10 freq
society (3) - 177 freq
sortie (3) - 2 freq
societies (3) - 14 freq
savitie (3) - 2 freq
shiti (4) - 1 freq
solit (4) - 5 freq
suntie (4) - 6 freq
spite (4) - 73 freq
sautie (4) - 4 freq
cite (4) - 2 freq
sixtie (4) - 3 freq
cuitie (4) - 1 freq
shotie (4) - 3 freq
saxtie (4) - 6 freq
SoundEx code - S230
sooked - 23 freq
shocked - 38 freq
sicht - 621 freq
sought - 10 freq
socht - 143 freq
sixty - 58 freq
seched - 6 freq
succeed - 12 freq
sight - 134 freq
seaside - 10 freq
sighed - 115 freq
squeezed - 27 freq
squeaked - 9 freq
seekit - 1 freq
society - 177 freq
sixth - 18 freq
saxty - 24 freq
sized - 13 freq
soucht - 9 freq
'sixty - 1 freq
'saxty - 1 freq
sagged - 1 freq
souched - 6 freq
saxt - 8 freq
socket - 4 freq
squished - 2 freq
squeeked - 1 freq
shoggit - 3 freq
suist - 1 freq
souchd - 4 freq
sookit - 27 freq
sachet - 1 freq
soackit - 1 freq
swickit - 6 freq
saucht - 10 freq
swished - 2 freq
squashed - 12 freq
soocide - 1 freq
suicide - 10 freq
soukt - 3 freq
saxth - 4 freq
seased - 1 freq
seized - 6 freq
shicht - 2 freq
sea-side - 1 freq
soukit - 10 freq
soughit - 3 freq
saughit - 1 freq
sussed - 12 freq
seycht - 1 freq
soaket - 4 freq
shcydee - 1 freq
sect - 3 freq
susset - 1 freq
sucked - 6 freq
shoujd - 1 freq
seeched - 1 freq
shagged - 2 freq
swashed - 2 freq
skooshed - 6 freq
soused - 2 freq
sixtie - 3 freq
saxtie - 6 freq
societie - 13 freq
shockt - 5 freq
squeakt - 1 freq
soakit - 4 freq
segued - 1 freq
saiket - 1 freq
sughut - 2 freq
sist - 5 freq
sacked - 9 freq
secht - 3 freq
shoacked - 2 freq
squeakit - 4 freq
sees't - 1 freq
soaked - 11 freq
sackt - 1 freq
squaiked - 8 freq
shockit - 1 freq
seiched - 1 freq
shokkit - 2 freq
skoosht - 1 freq
sae-cawed - 8 freq
swigged - 1 freq
shocht - 1 freq
sweeshed - 3 freq
shakkit - 1 freq
sheuched - 1 freq
siched - 9 freq
'society' - 1 freq
society' - 1 freq
sackit - 1 freq
scoukit - 1 freq
shaakit - 1 freq
sixt - 5 freq
shaste - 1 freq
soched - 1 freq
saggit - 1 freq
shugyit - 1 freq
scoggit - 1 freq
sikkit - 1 freq
swikkit - 1 freq
squawkit - 1 freq
sext - 1 freq
succede - 1 freq
seuched - 1 freq
saught - 1 freq
swick't - 1 freq
schist - 1 freq
shaest - 1 freq
sookt - 1 freq
secked - 2 freq
squaashed - 1 freq
squeezt - 1 freq
soshietie - 2 freq
skooshit - 1 freq
skecht - 1 freq
siesta - 1 freq
saised - 1 freq
shakit - 4 freq
sae-caad - 1 freq
skyuggit - 1 freq
swooshed - 1 freq
sizzed - 1 freq
succoth - 1 freq
socitie - 2 freq
soughed - 1 freq
shooshed - 2 freq
seigit - 1 freq
scuggit - 1 freq
seceede - 1 freq
sqwashed - 1 freq
sooside - 2 freq
shooched - 1 freq
so-cawed - 2 freq
swecht - 1 freq
sichtie - 1 freq
sweechd - 1 freq
saksd - 1 freq
swicked - 1 freq
shoosht - 1 freq
shecht - 1 freq
sschzed - 1 freq
schect - 1 freq
schecht - 1 freq
skecked - 2 freq
sexty - 1 freq
MetaPhone code - SST
seaside - 10 freq
society - 177 freq
ceased - 7 freq
sized - 13 freq
suist - 1 freq
soocide - 1 freq
cist - 1 freq
suicide - 10 freq
seased - 1 freq
seized - 6 freq
sea-side - 1 freq
sussed - 12 freq
susset - 1 freq
soused - 2 freq
societie - 13 freq
sist - 5 freq
sees't - 1 freq
zest - 2 freq
'society' - 1 freq
society' - 1 freq
ceest - 1 freq
siesta - 1 freq
saised - 1 freq
sizzed - 1 freq
ciste - 2 freq
socitie - 2 freq
wycest - 1 freq
seceede - 1 freq
sooside - 2 freq
szd - 1 freq
cyst - 1 freq
xst - 1 freq
SOCITIE
Time to execute Levenshtein function - 0.260630 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.525520 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.062208 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037252 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000866 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.