A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to stub in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
stub (0) - 2 freq
sub (1) - 9 freq
stubs (1) - 1 freq
stut (1) - 1 freq
stun (1) - 2 freq
snub (1) - 2 freq
stuk (1) - 1 freq
tub (1) - 22 freq
stur (1) - 3 freq
stu (1) - 1 freq
stob (1) - 7 freq
stud (1) - 38 freq
stug (1) - 1 freq
stuc (1) - 2 freq
stab (1) - 12 freq
stue (1) - 11 freq
spun (2) - 28 freq
daub (2) - 3 freq
stra (2) - 1 freq
souk (2) - 14 freq
shum (2) - 1 freq
ston (2) - 9 freq
stunk (2) - 3 freq
steud (2) - 6 freq
stop (2) - 521 freq
stub (0) - 2 freq
stob (1) - 7 freq
stab (1) - 12 freq
stu (2) - 1 freq
stue (2) - 11 freq
stud (2) - 38 freq
stuc (2) - 2 freq
stur (2) - 3 freq
stug (2) - 1 freq
stut (2) - 1 freq
stubs (2) - 1 freq
tub (2) - 22 freq
stun (2) - 2 freq
sub (2) - 9 freq
snub (2) - 2 freq
stuk (2) - 1 freq
stobs (3) - 8 freq
scuba (3) - 1 freq
tube (3) - 47 freq
ktb (3) - 1 freq
tb (3) - 14 freq
stow (3) - 7 freq
snb (3) - 2 freq
sty (3) - 3 freq
sjb (3) - 1 freq
SoundEx code - S310
stop - 521 freq
step - 258 freq
stap - 293 freq
stowp - 5 freq
stiff - 55 freq
stoop - 16 freq
stoap - 142 freq
stuff - 624 freq
styfie - 1 freq
sadovaya - 1 freq
'stoap - 3 freq
staff - 109 freq
steep - 34 freq
'stop - 7 freq
stove - 63 freq
stab - 12 freq
stevie - 181 freq
steve - 46 freq
stieve - 28 freq
stob - 7 freq
staap - 1 freq
steive - 12 freq
stave - 1 freq
staive - 1 freq
shid've - 4 freq
stub - 2 freq
stoppy - 1 freq
shtuff - 1 freq
sedova - 1 freq
staffie - 2 freq
stuffie - 5 freq
stovie - 1 freq
stoppe - 1 freq
she'd've - 1 freq
shuid've - 1 freq
stav - 2 freq
'stap - 10 freq
shaddup - 3 freq
shut-up - 1 freq
shood've - 5 freq
shood''ve - 1 freq
'stop' - 2 freq
stoup - 6 freq
'stoup' - 1 freq
staup - 5 freq
staif - 1 freq
set-up - 8 freq
'stap' - 1 freq
shid'v - 1 freq
stoav - 2 freq
steave - 1 freq
sadwife - 1 freq
skaithfu - 4 freq
stiff' - 1 freq
stiv - 1 freq
stify - 1 freq
stowfie - 1 freq
stuffy - 3 freq
stv - 12 freq
stife - 1 freq
€œstop - 3 freq
staffa - 2 freq
stap-fu - 1 freq
stoiff - 2 freq
€œstap - 7 freq
€˜stop - 1 freq
setup - 1 freq
stope - 4 freq
€˜stevie - 5 freq
€˜stevo - 1 freq
€˜steve - 2 freq
€˜stoap - 1 freq
stubby - 2 freq
€œsteep - 1 freq
scotweb - 1 freq
stiffy - 1 freq
step' - 1 freq
stobbie - 1 freq
stuffÂ’ - 1 freq
steff - 1 freq
shud've - 1 freq
steph - 2 freq
sydb - 1 freq
MetaPhone code - STB
stab - 12 freq
stob - 7 freq
stub - 2 freq
stubby - 2 freq
stobbie - 1 freq
citbo - 1 freq
sydb - 1 freq
STUB
Time to execute Levenshtein function - 0.206184 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.406157 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.062997 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040353 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001090 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.