A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sub in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sub (0) - 9 freq
sun (1) - 706 freq
suy (1) - 1 freq
qub (1) - 4 freq
stub (1) - 1 freq
hub (1) - 31 freq
sur (1) - 22 freq
snb (1) - 2 freq
sue (1) - 12 freq
gub (1) - 30 freq
subj (1) - 1 freq
dub (1) - 27 freq
sud (1) - 92 freq
fub (1) - 2 freq
sb (1) - 4 freq
tub (1) - 22 freq
nub (1) - 5 freq
sab (1) - 9 freq
sus (1) - 1 freq
sib (1) - 105 freq
pub (1) - 228 freq
sjb (1) - 1 freq
ub (1) - 3 freq
suk (1) - 2 freq
sut (1) - 161 freq
sub (0) - 9 freq
sb (1) - 4 freq
sib (1) - 105 freq
sab (1) - 9 freq
sob (1) - 7 freq
sut (2) - 161 freq
suk (2) - 2 freq
ub (2) - 3 freq
pub (2) - 228 freq
sjb (2) - 1 freq
subs (2) - 5 freq
sum (2) - 416 freq
cub (2) - 3 freq
usb (2) - 2 freq
wub (2) - 1 freq
rub (2) - 50 freq
sua (2) - 1 freq
su (2) - 8 freq
snub (2) - 2 freq
sup (2) - 53 freq
hub (2) - 31 freq
sur (2) - 22 freq
snb (2) - 2 freq
stub (2) - 1 freq
sun (2) - 706 freq
SoundEx code - S100
sheep - 295 freq
shape - 161 freq
sauf - 71 freq
safe - 280 freq
sepo - 34 freq
sib - 105 freq
soup - 241 freq
shoppie - 50 freq
swap - 17 freq
sob - 7 freq
shoap - 91 freq
s'up - 1 freq
soup' - 6 freq
soo-oop - 29 freq
save - 221 freq
sowf - 3 freq
sype - 4 freq
scuff - 4 freq
subway - 5 freq
soop - 17 freq
skivvy - 7 freq
sappy - 19 freq
sweep - 31 freq
scoop - 16 freq
spy - 35 freq
sofa - 33 freq
sip - 23 freq
shove - 34 freq
scabby - 14 freq
suppie - 24 freq
ship - 186 freq
syboe - 1 freq
sophie - 11 freq
shop - 369 freq
sub - 9 freq
sup - 53 freq
shuve - 2 freq
shap - 26 freq
shiv - 3 freq
sab - 9 freq
skip - 25 freq
swep - 6 freq
scobie - 16 freq
'scobie - 2 freq
skibo - 1 freq
scab - 12 freq
sappie - 12 freq
scabbie - 2 freq
soapy - 7 freq
sci-fi - 1 freq
scaffie - 9 freq
shabby - 7 freq
spew - 21 freq
seep - 5 freq
shoppe - 2 freq
sofie - 1 freq
skiff - 12 freq
skive - 5 freq
shp - 1 freq
scoobie - 5 freq
scuffy - 1 freq
swipe - 12 freq
soap - 45 freq
sef - 1 freq
squib - 2 freq
sif - 4 freq
sheaf - 4 freq
spie - 3 freq
sap - 10 freq
sowp - 3 freq
sufi - 1 freq
saif - 4 freq
seaview - 7 freq
seeve - 3 freq
shaip - 1 freq
shuv - 7 freq
saive - 1 freq
shef - 1 freq
soopaa - 2 freq
shayp - 1 freq
shave - 18 freq
spae - 5 freq
squeef - 1 freq
spee - 1 freq
spa - 6 freq
sp - 8 freq
sepia - 2 freq
scooby - 19 freq
'save - 6 freq
scaup - 5 freq
'safe - 1 freq
scuffie - 4 freq
scoof - 1 freq
soep - 1 freq
scoff - 12 freq
scope - 12 freq
speh - 1 freq
spey - 4 freq
shaef - 5 freq
sope - 3 freq
sheep' - 4 freq
sop - 4 freq
scuba - 1 freq
sovvy - 4 freq
safeway - 3 freq
sowvy - 1 freq
seef - 5 freq
seef' - 1 freq
safe' - 1 freq
scap - 2 freq
scubby - 2 freq
'scab' - 1 freq
sappho - 8 freq
sheba - 3 freq
'seaview' - 1 freq
saip - 11 freq
shopfu - 1 freq
shaav - 4 freq
scowff - 2 freq
suive - 1 freq
shaav' - 1 freq
sieve - 6 freq
see-foo - 1 freq
'shop' - 2 freq
shop' - 2 freq
scapa - 10 freq
schip - 1 freq
'skivvy - 1 freq
skype - 5 freq
shaep - 5 freq
swappy - 1 freq
swoop - 3 freq
skiba - 1 freq
shaiff - 3 freq
shoop - 1 freq
skoof - 2 freq
sheepy - 2 freq
savoy - 5 freq
shehp - 1 freq
scaff - 2 freq
swaabie - 1 freq
shæp - 1 freq
swab - 1 freq
skouf - 1 freq
soo--oop - 1 freq
'shabby' - 1 freq
sv - 6 freq
scuf - 1 freq
sauve - 1 freq
shavie - 1 freq
skyiff - 1 freq
sopp - 1 freq
scowp - 6 freq
supp - 2 freq
€˜spey - 1 freq
sapie - 1 freq
sév - 1 freq
sofae - 3 freq
shapp - 1 freq
€œshiv - 1 freq
skiffie - 6 freq
skippy - 2 freq
shaif - 5 freq
skiffy - 3 freq
'sup - 1 freq
suppy - 1 freq
schop - 1 freq
skabbie - 1 freq
swype - 4 freq
sheip - 1 freq
schep - 2 freq
swoup - 1 freq
scqf - 1 freq
shee-eep - 1 freq
€˜shoap - 2 freq
€˜shop - 1 freq
sibb - 2 freq
scheip - 1 freq
€œsafe - 1 freq
skep - 1 freq
spaee - 1 freq
soupy - 1 freq
ssp - 8 freq
shippy - 1 freq
shippie - 1 freq
skaffie - 1 freq
squeeb - 1 freq
souve - 1 freq
shope - 1 freq
skaffy - 1 freq
€œsup - 1 freq
shopppie - 1 freq
shive - 1 freq
sheepie - 1 freq
sfp - 4 freq
sfa - 28 freq
sssba - 1 freq
€˜sapho - 2 freq
sapho - 1 freq
skave - 4 freq
skew-wheef - 2 freq
swaab - 2 freq
skepp - 2 freq
skew-whiff - 1 freq
sibh - 1 freq
subwey - 1 freq
sowff - 2 freq
sf - 6 freq
spewy - 1 freq
sb - 4 freq
svph - 1 freq
savvy - 2 freq
swufbb - 1 freq
spÂ’y - 1 freq
seaf - 1 freq
schofe - 6 freq
sjb - 1 freq
ssoap - 1 freq
sub-po - 1 freq
'scoff' - 1 freq
spo - 1 freq
spi - 1 freq
shyv - 1 freq
scape - 1 freq
'scabby' - 2 freq
ship' - 1 freq
shep - 5 freq
szf - 1 freq
shfui - 1 freq
spf - 1 freq
svbhi - 1 freq
szjjp - 1 freq
spu - 1 freq
swf - 1 freq
sophia - 2 freq
saf - 1 freq
sgqkp - 1 freq
spow - 1 freq
scifi - 1 freq
sjp - 1 freq
supw - 1 freq
MetaPhone code - SB
sib - 105 freq
sob - 7 freq
syboe - 1 freq
sub - 9 freq
sab - 9 freq
sibb - 2 freq
sssba - 1 freq
sibh - 1 freq
zhbh - 1 freq
xhb - 1 freq
zbu - 1 freq
sb - 4 freq
xb - 3 freq
zob - 1 freq
zb - 3 freq
xbo - 1 freq
xeb - 1 freq
SUB
Time to execute Levenshtein function - 0.168567 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.318958 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027962 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036729 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000812 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.