A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to smithycroft in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
smithycroft (0) - 4 freq
smithycroftlt (2) - 2 freq
smithycrofteng (3) - 5 freq
smithyfor (4) - 1 freq
smithert (4) - 2 freq
swithert (5) - 5 freq
minecraft (5) - 3 freq
slithert (5) - 3 freq
smitherin (5) - 1 freq
smithbarryc (5) - 1 freq
mithert (5) - 1 freq
witchcraft (5) - 8 freq
ashcroft (5) - 6 freq
smithy (5) - 3 freq
switcht (6) - 1 freq
smithermann (6) - 1 freq
smitch (6) - 8 freq
€˜ithoot (6) - 3 freq
stitchwort (6) - 1 freq
swithering (6) - 1 freq
slaithert (6) - 2 freq
thyroid (6) - 1 freq
itchycoo (6) - 1 freq
undercroft (6) - 1 freq
€™ithoot (6) - 6 freq
smithycroft (0) - 4 freq
smithycroftlt (4) - 2 freq
smithycrofteng (5) - 5 freq
smithert (6) - 2 freq
ashcroft (7) - 6 freq
smithyfor (7) - 1 freq
mithert (8) - 1 freq
smithbarryc (8) - 1 freq
witchcraft (8) - 8 freq
minecraft (8) - 3 freq
swithert (8) - 5 freq
slithert (8) - 3 freq
smitherin (8) - 1 freq
smithereens (9) - 7 freq
shrift (9) - 2 freq
smutherin (9) - 1 freq
sowthert (9) - 2 freq
seacraft (9) - 1 freq
wanthrift (9) - 1 freq
slaithirt (9) - 1 freq
southert (9) - 1 freq
smuthick (9) - 1 freq
thrift (9) - 8 freq
smothered (9) - 6 freq
soothart (9) - 1 freq
SoundEx code - S532
sounds - 102 freq
snetchit - 1 freq
smudges - 2 freq
saunds - 5 freq
soonds - 248 freq
sundays - 15 freq
sends - 44 freq
snatch - 14 freq
soonds'll - 2 freq
sands - 24 freq
sandwiches - 35 freq
snatches - 7 freq
smitch - 8 freq
sandwich - 16 freq
saundstane - 5 freq
senatus - 2 freq
sandy's - 15 freq
saund-kelpie - 1 freq
saund-kelpies - 1 freq
'saund-kelpie' - 1 freq
'scents - 1 freq
scientists - 15 freq
syntax - 20 freq
synthesis - 3 freq
synthesisin - 1 freq
synthesin - 1 freq
saunts - 5 freq
somedy's - 2 freq
smudged - 4 freq
scientist - 11 freq
semmits - 9 freq
smiths - 3 freq
saints - 17 freq
smiddy's - 2 freq
snouts - 2 freq
smidgin - 2 freq
sonnets - 8 freq
semitic - 1 freq
santiago - 2 freq
smudge - 11 freq
sandsteen - 2 freq
snaw-white's - 1 freq
scents - 4 freq
shanties - 1 freq
saundcastles - 1 freq
santa's - 9 freq
sanitiser - 3 freq
sanitizer - 3 freq
snitches - 2 freq
smowts - 1 freq
saunt's - 1 freq
snatched - 9 freq
sandwiched - 2 freq
some'dy's - 2 freq
scandic - 3 freq
skin-ticht - 1 freq
sunday's - 3 freq
sumdy's - 3 freq
scants - 1 freq
sandstane - 6 freq
snootcloot - 2 freq
sneds - 3 freq
skinticht - 1 freq
suntie's - 2 freq
summits - 5 freq
saundshoe - 1 freq
sonnets' - 1 freq
snodcakes - 1 freq
snodcake - 1 freq
syndes - 1 freq
smoots - 2 freq
'sounds - 2 freq
smith's - 6 freq
sandwicht - 2 freq
smooths - 1 freq
simmets - 2 freq
snatchets - 1 freq
smoothy's - 1 freq
smuthick - 1 freq
sinths - 1 freq
sinthesised - 1 freq
sommat's - 2 freq
sandstone - 6 freq
smitts - 2 freq
sandisans - 1 freq
sandside - 1 freq
sandsend - 1 freq
sandsgarth - 1 freq
sanitised - 2 freq
snitch - 8 freq
smitsome - 1 freq
sun-dicht - 1 freq
simmet's - 1 freq
somedie's - 1 freq
sanitise - 1 freq
synds - 2 freq
snaitched - 1 freq
snaitchin - 1 freq
sand-stane - 1 freq
skinheids - 1 freq
snoots - 4 freq
smaads - 1 freq
seimits - 1 freq
saands - 1 freq
smutchack - 1 freq
schnitzel - 1 freq
sand-clogged - 1 freq
sandshun - 1 freq
soondscapes - 1 freq
sceintic - 1 freq
soond's - 1 freq
sunties - 2 freq
smeeth-caimbed - 1 freq
smits - 1 freq
smithsonian - 3 freq
smithsonianfolklifefestival - 1 freq
snatchan - 1 freq
sandsound - 3 freq
shindig - 3 freq
semiotics - 1 freq
squints - 1 freq
sandstrøm - 1 freq
sonatas - 2 freq
snatchers - 1 freq
sanitisin - 1 freq
snatchin - 1 freq
snatcher - 1 freq
syntactic - 1 freq
sandy-coloured - 1 freq
sandstorm - 1 freq
smatchet - 1 freq
smiddies - 1 freq
sanitising - 1 freq
€œsoonds - 1 freq
sumdys - 1 freq
sants - 1 freq
syndicalists - 1 freq
soand-so - 1 freq
somatic - 1 freq
snaw-dusted - 1 freq
shandwick - 1 freq
syntactical - 2 freq
somedae's - 1 freq
smootie's - 6 freq
'smootie's - 1 freq
smidgen - 1 freq
smtxabi - 1 freq
smtxjbdlt - 1 freq
sandiescot - 2 freq
syndicate - 1 freq
smithÂ’s - 1 freq
snoot-cloot - 1 freq
sundayÂ’s - 1 freq
snitching - 1 freq
shindog - 1 freq
shandys - 1 freq
shandies - 1 freq
sundaycreaking - 1 freq
semtex - 1 freq
soundcloud - 3 freq
snowthistle - 1 freq
sinethugcat - 2 freq
smithycroftlt - 2 freq
smithycroft - 4 freq
sannytizer - 1 freq
sandwick - 3 freq
santy's - 1 freq
smithycrofteng - 5 freq
'sandwich' - 1 freq
snitch' - 1 freq
sandstonepress - 2 freq
sandys - 8 freq
santas - 2 freq
sundayshoutsfc - 3 freq
's math sin - 1 freq
snettsbirder - 5 freq
sendsnowdayhelp - 1 freq
saintso - 1 freq
MetaPhone code - SM0KRFT
smithycroft - 4 freq
SMITHYCROFT
Time to execute Levenshtein function - 0.194644 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.399453 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028279 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038363 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000881 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.