A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to both in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
both (0) - 198 freq
doth (1) - 9 freq
bott (1) - 3 freq
goth (1) - 10 freq
booth (1) - 10 freq
broth (1) - 70 freq
beth (1) - 26 freq
borth (1) - 73 freq
bots (1) - 4 freq
boath (1) - 2 freq
byth (1) - 1 freq
moth (1) - 7 freq
roth (1) - 2 freq
bath (1) - 98 freq
bothy (1) - 49 freq
noth (1) - 5 freq
bot (1) - 437 freq
woth (1) - 1 freq
borh (1) - 1 freq
botch (1) - 1 freq
boyz (2) - 11 freq
boke (2) - 31 freq
cot (2) - 44 freq
boom (2) - 30 freq
boab (2) - 53 freq
both (0) - 198 freq
beth (1) - 26 freq
boath (1) - 2 freq
bath (1) - 98 freq
booth (1) - 10 freq
byth (1) - 1 freq
bothy (1) - 49 freq
borh (2) - 1 freq
bythe (2) - 1 freq
botch (2) - 1 freq
beith (2) - 5 freq
bothie (2) - 9 freq
bathy (2) - 2 freq
bathe (2) - 7 freq
beath (2) - 3 freq
baith (2) - 1601 freq
woth (2) - 1 freq
buith (2) - 3 freq
goth (2) - 10 freq
bott (2) - 3 freq
roth (2) - 2 freq
broth (2) - 70 freq
moth (2) - 7 freq
doth (2) - 9 freq
bots (2) - 4 freq
SoundEx code - B300
but - 13379 freq
bite - 158 freq
bit - 7597 freq
'but - 301 freq
bide - 830 freq
bad - 949 freq
bed - 930 freq
boot - 112 freq
body - 755 freq
bade - 237 freq
bate - 66 freq
boat - 354 freq
bodie - 443 freq
boyhood - 6 freq
bath - 98 freq
bothy - 49 freq
baith - 1601 freq
bittie - 555 freq
baud - 12 freq
bitty - 57 freq
-bit - 1 freq
bat - 50 freq
beauty - 128 freq
bet - 158 freq
bowed - 38 freq
beat - 212 freq
boady - 94 freq
beheid - 3 freq
by-the - 2 freq
bid - 77 freq
buid - 6 freq
byde - 62 freq
bait - 32 freq
buddie - 28 freq
bawdie - 2 freq
'baith - 4 freq
bead - 7 freq
both - 198 freq
'bad - 6 freq
bedae - 1 freq
boued - 22 freq
behaud - 9 freq
'bide - 5 freq
bud - 36 freq
beaut - 5 freq
but' - 1 freq
ba-heid - 1 freq
--but - 1 freq
buit - 32 freq
bawtie - 2 freq
boddie - 2 freq
biddy - 12 freq
'bit - 24 freq
baid - 7 freq
b-a-t - 1 freq
bot - 437 freq
'bet - 2 freq
butt - 20 freq
bode - 6 freq
bothie - 9 freq
bout - 14 freq
beet - 18 freq
betty - 82 freq
baddie - 2 freq
beatha - 5 freq
buddy - 56 freq
buy't - 3 freq
boo't - 2 freq
baa't - 1 freq
b-but - 3 freq
baet - 27 freq
beady - 14 freq
beatty - 1 freq
beth - 26 freq
bootie - 1 freq
bod - 2 freq
bute - 13 freq
boatie - 124 freq
beautie - 3 freq
bawheid - 6 freq
boaty - 7 freq
beyd - 10 freq
beit - 13 freq
bathie - 2 freq
baty - 1 freq
bita - 1 freq
bodee - 2 freq
buty - 3 freq
bathey - 1 freq
bathy - 2 freq
bayd - 1 freq
baad - 20 freq
'b-but - 1 freq
bow'd - 1 freq
buddha - 16 freq
biood - 1 freq
boattie - 1 freq
'but' - 2 freq
beddie - 3 freq
booed - 40 freq
bathe - 7 freq
'behaud - 1 freq
bowtie - 1 freq
boadie - 48 freq
boyd - 13 freq
be'd - 1 freq
be't - 3 freq
bitta - 5 freq
bitoa - 2 freq
bad' - 4 freq
bawd - 8 freq
by-th - 1 freq
boath - 2 freq
-but - 2 freq
'-but - 1 freq
bae-the-wye - 1 freq
baeheid - 1 freq
bittae - 2 freq
bete - 4 freq
bett - 4 freq
bwat - 4 freq
bow-tie - 1 freq
bette - 3 freq
bt - 8 freq
bood - 2 freq
bóat - 5 freq
béat - 1 freq
bïd - 10 freq
bït - 7 freq
beed - 22 freq
'boot - 2 freq
boy'd - 1 freq
baw-heid - 1 freq
bowt - 23 freq
buddoo - 1 freq
boo'd - 1 freq
'beauty - 1 freq
baaed - 1 freq
bouet - 2 freq
bewtie - 4 freq
boit - 11 freq
boitie - 2 freq
buddo - 3 freq
but-the - 1 freq
bott - 3 freq
body- - 1 freq
bawed - 2 freq
booth - 10 freq
bettie - 50 freq
'bettie - 4 freq
'bittie - 1 freq
bit- - 1 freq
bythe - 1 freq
boddy - 7 freq
bedd - 56 freq
bawth - 1 freq
byd - 1 freq
biød - 1 freq
baed - 3 freq
byt - 4 freq
bød - 3 freq
bøt - 2 freq
buidhe - 1 freq
bou'd - 5 freq
bude - 14 freq
bowhead - 2 freq
boadi - 1 freq
bodi - 1 freq
'boady - 1 freq
böddie - 1 freq
baatie - 1 freq
but- - 2 freq
byte - 1 freq
boud - 1 freq
batt - 1 freq
badd - 1 freq
'bout - 2 freq
byth - 1 freq
beattie - 6 freq
boat' - 1 freq
bide-awee - 1 freq
biddie - 1 freq
buidy - 1 freq
€˜bide - 1 freq
€œbit - 41 freq
beta - 3 freq
€˜beta - 1 freq
€˜but - 32 freq
€“but - 1 freq
büde - 2 freq
€¦but - 5 freq
€œbot - 5 freq
bawd' - 1 freq
body' - 1 freq
bowdie - 1 freq
buith - 3 freq
b-b-b-but - 1 freq
€œbut - 38 freq
bede - 1 freq
btw - 110 freq
€˜bit - 6 freq
ba'ht - 1 freq
batty - 2 freq
bidy - 1 freq
€œbuddo - 1 freq
€˜bad - 3 freq
€œbad - 2 freq
€˜baet - 1 freq
bed-o - 1 freq
€˜buddha - 1 freq
beath - 3 freq
'baith' - 1 freq
€™budy - 2 freq
€¦bed - 1 freq
b-day - 1 freq
€œbet - 1 freq
biit - 1 freq
€™body - 1 freq
bittiie - 1 freq
bade--- - 1 freq
beid - 3 freq
€œbide - 2 freq
båt - 1 freq
bitt - 1 freq
beat- - 2 freq
€œbaith - 1 freq
'bad' - 1 freq
€œbath - 1 freq
€˜bayth - 1 freq
baithe - 3 freq
budy - 1 freq
€˜bot - 1 freq
€œbed - 1 freq
beud - 1 freq
€™but - 9 freq
buttie - 2 freq
bd - 6 freq
btay - 1 freq
butty - 2 freq
bidey - 3 freq
bidie - 3 freq
bday - 1 freq
beith - 5 freq
buddh - 1 freq
b'day - 2 freq
btdy - 1 freq
boyata - 1 freq
booty - 1 freq
baddy - 1 freq
bitey - 1 freq
“bit - 1 freq
buddy” - 1 freq
buaidh - 1 freq
“but - 1 freq
bbt - 1 freq
boyywood - 1 freq
bodwhu - 1 freq
bewty - 5 freq
bewty- - 1 freq
byddi - 3 freq
beatha” - 1 freq
bttaweiiaw - 1 freq
MetaPhone code - B0
bath - 98 freq
bothy - 49 freq
baith - 1601 freq
by-the - 2 freq
'baith - 4 freq
both - 198 freq
bothie - 9 freq
beatha - 5 freq
beth - 26 freq
bathie - 2 freq
bathey - 1 freq
bathy - 2 freq
bathe - 7 freq
by-th - 1 freq
boath - 2 freq
booth - 10 freq
bythe - 1 freq
bawth - 1 freq
byth - 1 freq
buith - 3 freq
beath - 3 freq
'baith' - 1 freq
€œbaith - 1 freq
€œbath - 1 freq
€˜bayth - 1 freq
baithe - 3 freq
beith - 5 freq
beatha” - 1 freq
BOTH
baith - 1601 freq
both - 198 freq
Time to execute Levenshtein function - 0.264530 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.593924 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028510 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.079636 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001128 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.