A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to bath in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
bath (0) - 98 freq
beath (1) - 3 freq
both (1) - 198 freq
batch (1) - 13 freq
dath (1) - 1 freq
beth (1) - 26 freq
hath (1) - 3 freq
kath (1) - 3 freq
path (1) - 178 freq
bah (1) - 4 freq
bats (1) - 34 freq
bate (1) - 66 freq
vath (1) - 1 freq
baths (1) - 35 freq
baty (1) - 1 freq
brath (1) - 1 freq
math (1) - 6 freq
boath (1) - 2 freq
bat (1) - 50 freq
bach (1) - 2 freq
bash (1) - 20 freq
bathe (1) - 7 freq
bathy (1) - 2 freq
cath (1) - 4 freq
baith (1) - 1601 freq
bath (0) - 98 freq
boath (1) - 2 freq
byth (1) - 1 freq
bathe (1) - 7 freq
beth (1) - 26 freq
baith (1) - 1601 freq
bathy (1) - 2 freq
beath (1) - 3 freq
both (1) - 198 freq
bawth (2) - 1 freq
batt (2) - 1 freq
cath (2) - 4 freq
oath (2) - 13 freq
bathey (2) - 1 freq
baithe (2) - 3 freq
beatha (2) - 5 freq
booth (2) - 10 freq
beith (2) - 5 freq
bathie (2) - 2 freq
buith (2) - 3 freq
bythe (2) - 1 freq
bothy (2) - 49 freq
kath (2) - 3 freq
path (2) - 178 freq
batch (2) - 13 freq
SoundEx code - B300
but - 13379 freq
bite - 158 freq
bit - 7597 freq
'but - 301 freq
bide - 830 freq
bad - 949 freq
bed - 930 freq
boot - 112 freq
body - 755 freq
bade - 237 freq
bate - 66 freq
boat - 354 freq
bodie - 443 freq
boyhood - 6 freq
bath - 98 freq
bothy - 49 freq
baith - 1601 freq
bittie - 555 freq
baud - 12 freq
bitty - 57 freq
-bit - 1 freq
bat - 50 freq
beauty - 128 freq
bet - 158 freq
bowed - 38 freq
beat - 212 freq
boady - 94 freq
beheid - 3 freq
by-the - 2 freq
bid - 77 freq
buid - 6 freq
byde - 62 freq
bait - 32 freq
buddie - 28 freq
bawdie - 2 freq
'baith - 4 freq
bead - 7 freq
both - 198 freq
'bad - 6 freq
bedae - 1 freq
boued - 22 freq
behaud - 9 freq
'bide - 5 freq
bud - 36 freq
beaut - 5 freq
but' - 1 freq
ba-heid - 1 freq
--but - 1 freq
buit - 32 freq
bawtie - 2 freq
boddie - 2 freq
biddy - 12 freq
'bit - 24 freq
baid - 7 freq
b-a-t - 1 freq
bot - 437 freq
'bet - 2 freq
butt - 20 freq
bode - 6 freq
bothie - 9 freq
bout - 14 freq
beet - 18 freq
betty - 82 freq
baddie - 2 freq
beatha - 5 freq
buddy - 56 freq
buy't - 3 freq
boo't - 2 freq
baa't - 1 freq
b-but - 3 freq
baet - 27 freq
beady - 14 freq
beatty - 1 freq
beth - 26 freq
bootie - 1 freq
bod - 2 freq
bute - 13 freq
boatie - 124 freq
beautie - 3 freq
bawheid - 6 freq
boaty - 7 freq
beyd - 10 freq
beit - 13 freq
bathie - 2 freq
baty - 1 freq
bita - 1 freq
bodee - 2 freq
buty - 3 freq
bathey - 1 freq
bathy - 2 freq
bayd - 1 freq
baad - 20 freq
'b-but - 1 freq
bow'd - 1 freq
buddha - 16 freq
biood - 1 freq
boattie - 1 freq
'but' - 2 freq
beddie - 3 freq
booed - 40 freq
bathe - 7 freq
'behaud - 1 freq
bowtie - 1 freq
boadie - 48 freq
boyd - 13 freq
be'd - 1 freq
be't - 3 freq
bitta - 5 freq
bitoa - 2 freq
bad' - 4 freq
bawd - 8 freq
by-th - 1 freq
boath - 2 freq
-but - 2 freq
'-but - 1 freq
bae-the-wye - 1 freq
baeheid - 1 freq
bittae - 2 freq
bete - 4 freq
bett - 4 freq
bwat - 4 freq
bow-tie - 1 freq
bette - 3 freq
bt - 8 freq
bood - 2 freq
bóat - 5 freq
béat - 1 freq
bïd - 10 freq
bït - 7 freq
beed - 22 freq
'boot - 2 freq
boy'd - 1 freq
baw-heid - 1 freq
bowt - 23 freq
buddoo - 1 freq
boo'd - 1 freq
'beauty - 1 freq
baaed - 1 freq
bouet - 2 freq
bewtie - 4 freq
boit - 11 freq
boitie - 2 freq
buddo - 3 freq
but-the - 1 freq
bott - 3 freq
body- - 1 freq
bawed - 2 freq
booth - 10 freq
bettie - 50 freq
'bettie - 4 freq
'bittie - 1 freq
bit- - 1 freq
bythe - 1 freq
boddy - 7 freq
bedd - 56 freq
bawth - 1 freq
byd - 1 freq
biød - 1 freq
baed - 3 freq
byt - 4 freq
bød - 3 freq
bøt - 2 freq
buidhe - 1 freq
bou'd - 5 freq
bude - 14 freq
bowhead - 2 freq
boadi - 1 freq
bodi - 1 freq
'boady - 1 freq
böddie - 1 freq
baatie - 1 freq
but- - 2 freq
byte - 1 freq
boud - 1 freq
batt - 1 freq
badd - 1 freq
'bout - 2 freq
byth - 1 freq
beattie - 6 freq
boat' - 1 freq
bide-awee - 1 freq
biddie - 1 freq
buidy - 1 freq
€˜bide - 1 freq
€œbit - 41 freq
beta - 3 freq
€˜beta - 1 freq
€˜but - 32 freq
€“but - 1 freq
büde - 2 freq
€¦but - 5 freq
€œbot - 5 freq
bawd' - 1 freq
body' - 1 freq
bowdie - 1 freq
buith - 3 freq
b-b-b-but - 1 freq
€œbut - 38 freq
bede - 1 freq
btw - 110 freq
€˜bit - 6 freq
ba'ht - 1 freq
batty - 2 freq
bidy - 1 freq
€œbuddo - 1 freq
€˜bad - 3 freq
€œbad - 2 freq
€˜baet - 1 freq
bed-o - 1 freq
€˜buddha - 1 freq
beath - 3 freq
'baith' - 1 freq
€™budy - 2 freq
€¦bed - 1 freq
b-day - 1 freq
€œbet - 1 freq
biit - 1 freq
€™body - 1 freq
bittiie - 1 freq
bade--- - 1 freq
beid - 3 freq
€œbide - 2 freq
båt - 1 freq
bitt - 1 freq
beat- - 2 freq
€œbaith - 1 freq
'bad' - 1 freq
€œbath - 1 freq
€˜bayth - 1 freq
baithe - 3 freq
budy - 1 freq
€˜bot - 1 freq
€œbed - 1 freq
beud - 1 freq
€™but - 9 freq
buttie - 2 freq
bd - 6 freq
btay - 1 freq
butty - 2 freq
bidey - 3 freq
bidie - 3 freq
bday - 1 freq
beith - 5 freq
buddh - 1 freq
b'day - 2 freq
btdy - 1 freq
boyata - 1 freq
booty - 1 freq
baddy - 1 freq
bitey - 1 freq
“bit - 1 freq
buddy” - 1 freq
buaidh - 1 freq
“but - 1 freq
bbt - 1 freq
boyywood - 1 freq
bodwhu - 1 freq
bewty - 5 freq
bewty- - 1 freq
byddi - 3 freq
beatha” - 1 freq
bttaweiiaw - 1 freq
MetaPhone code - B0
bath - 98 freq
bothy - 49 freq
baith - 1601 freq
by-the - 2 freq
'baith - 4 freq
both - 198 freq
bothie - 9 freq
beatha - 5 freq
beth - 26 freq
bathie - 2 freq
bathey - 1 freq
bathy - 2 freq
bathe - 7 freq
by-th - 1 freq
boath - 2 freq
booth - 10 freq
bythe - 1 freq
bawth - 1 freq
byth - 1 freq
buith - 3 freq
beath - 3 freq
'baith' - 1 freq
€œbaith - 1 freq
€œbath - 1 freq
€˜bayth - 1 freq
baithe - 3 freq
beith - 5 freq
beatha” - 1 freq
BATH
Time to execute Levenshtein function - 0.288366 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.708391 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.065709 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043965 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001084 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.