A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scotbot in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scotbot (0) - 8 freq
scotpol (2) - 14 freq
scott (2) - 290 freq
scotto (2) - 1 freq
scoto (2) - 1 freq
scoot (2) - 13 freq
scotlit (2) - 38 freq
scotgov (2) - 50 freq
scottyt (2) - 1 freq
scythit (3) - 1 freq
scowlt (3) - 2 freq
scooby (3) - 19 freq
scoter (3) - 1 freq
scotlan (3) - 63 freq
mcbot (3) - 1 freq
scoldit (3) - 1 freq
citbo (3) - 1 freq
scots (3) - 6761 freq
sotto (3) - 2 freq
scotia (3) - 26 freq
scotch (3) - 127 freq
scot (3) - 107 freq
scottr (3) - 2 freq
combo (3) - 5 freq
cscott (3) - 5 freq
scotbot (0) - 8 freq
scotlit (3) - 38 freq
scottyt (3) - 1 freq
scotto (3) - 1 freq
scott (3) - 290 freq
scootit (4) - 3 freq
scoutit (4) - 1 freq
scabbit (4) - 8 freq
scuttit (4) - 2 freq
scrybit (4) - 1 freq
scottie (4) - 3 freq
scotti (4) - 2 freq
scoot (4) - 13 freq
scoto (4) - 1 freq
scotpol (4) - 14 freq
scotgov (4) - 50 freq
scythit (4) - 1 freq
scotty (4) - 29 freq
mscott (5) - 1 freq
scotsla (5) - 1 freq
scoopit (5) - 3 freq
scoit (5) - 1 freq
sobbit (5) - 3 freq
scotmeg (5) - 1 freq
scottis (5) - 52 freq
SoundEx code - S313
stopped - 157 freq
stapped - 121 freq
stupit - 131 freq
stappit - 115 freq
stoapped - 18 freq
stoappt - 3 freq
stupitest - 2 freq
stupid - 57 freq
stepped - 65 freq
steppit - 41 freq
stabbed - 9 freq
stoppit - 100 freq
stuffed - 35 freq
stippit - 6 freq
stoppt - 11 freq
stoppd - 1 freq
stoaped - 15 freq
stoapit - 2 freq
sitivation - 1 freq
stepda - 7 freq
stepfaither - 1 freq
steepit - 8 freq
stept - 4 freq
steeped - 5 freq
stooped - 6 freq
steppt - 4 freq
stapt - 30 freq
stepfaither's - 1 freq
stoppet - 9 freq
stappet - 1 freq
stoapet - 3 freq
stupidly - 4 freq
'stepdad - 1 freq
stepdad - 1 freq
stepdas - 1 freq
stepdaughter - 1 freq
stoopit - 4 freq
stupidity - 6 freq
stuppit - 1 freq
stoppid - 1 freq
stoopeed - 1 freq
steppid - 3 freq
stoopid - 2 freq
staapt - 6 freq
stappt - 5 freq
stubbed - 3 freq
stufit-heidit - 1 freq
stop-waatch - 1 freq
stopt - 5 freq
stoupt - 1 freq
stupit-lookin - 1 freq
stupidest - 2 freq
sitivautioun - 6 freq
stoupit - 1 freq
settified - 1 freq
stabbit - 2 freq
stobbed - 1 freq
stappit-fu - 3 freq
stapp't - 1 freq
stap't - 1 freq
stoup't - 1 freq
setifeed - 1 freq
stipit - 7 freq
seetivation - 1 freq
stauppit - 4 freq
setified - 2 freq
scuddie-happit - 1 freq
staffed - 1 freq
staved - 2 freq
step-faither - 6 freq
'stupit-lookin' - 1 freq
stuffit - 1 freq
styoopit - 1 freq
stfdb - 1 freq
stipid - 3 freq
stpatrick - 1 freq
shitepatter - 1 freq
“switftkey - 1 freq
scot-baiting - 1 freq
satefeed - 11 freq
stfdbk - 1 freq
stephderm - 1 freq
scottfitzpatr - 7 freq
scotbot - 8 freq
stevedudley - 2 freq
stopitaggers - 1 freq
stevedickson - 1 freq
MetaPhone code - SKTBT
scotbot - 8 freq
SCOTBOT
Time to execute Levenshtein function - 0.206352 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341315 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030905 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037073 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000834 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.