A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to baith in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
baith (0) - 1582 freq
daith (1) - 286 freq
aith (1) - 23 freq
baithe (1) - 3 freq
bawth (1) - 1 freq
beith (1) - 5 freq
faith (1) - 161 freq
braith (1) - 215 freq
saith (1) - 2 freq
brith (1) - 4 freq
baits (1) - 3 freq
buith (1) - 3 freq
waith (1) - 2 freq
laith (1) - 25 freq
bairth (1) - 1 freq
'baith (1) - 4 freq
raith (1) - 9 freq
paith (1) - 4 freq
bait (1) - 32 freq
bath (1) - 98 freq
claith (2) - 59 freq
beit (2) - 13 freq
beath (2) - 3 freq
baird (2) - 30 freq
daeth (2) - 17 freq
baith (0) - 1582 freq
buith (1) - 3 freq
beith (1) - 5 freq
bath (1) - 98 freq
baithe (1) - 3 freq
beth (2) - 26 freq
beath (2) - 3 freq
bait (2) - 32 freq
daith (2) - 286 freq
bawth (2) - 1 freq
both (2) - 195 freq
bathe (2) - 7 freq
bathy (2) - 2 freq
boath (2) - 2 freq
paith (2) - 4 freq
booth (2) - 10 freq
byth (2) - 1 freq
raith (2) - 9 freq
waith (2) - 2 freq
aith (2) - 23 freq
baits (2) - 3 freq
brith (2) - 4 freq
braith (2) - 215 freq
saith (2) - 2 freq
'baith (2) - 4 freq
SoundEx code - B300
but - 13122 freq
bite - 156 freq
bit - 7493 freq
'but - 298 freq
bide - 826 freq
bad - 925 freq
bed - 908 freq
boot - 111 freq
body - 752 freq
bade - 234 freq
bate - 66 freq
boat - 350 freq
bodie - 440 freq
boyhood - 6 freq
bath - 98 freq
bothy - 48 freq
baith - 1582 freq
bittie - 551 freq
baud - 12 freq
bitty - 57 freq
-bit - 1 freq
bat - 50 freq
beauty - 125 freq
bet - 155 freq
bowed - 37 freq
beat - 207 freq
boady - 84 freq
beheid - 3 freq
by-the - 2 freq
bid - 76 freq
buid - 6 freq
byde - 62 freq
bait - 32 freq
buddie - 28 freq
bawdie - 2 freq
'baith - 4 freq
bead - 7 freq
both - 195 freq
'bad - 6 freq
bedae - 1 freq
boued - 22 freq
behaud - 9 freq
'bide - 5 freq
bud - 36 freq
beaut - 5 freq
but' - 1 freq
ba-heid - 1 freq
--but - 1 freq
buit - 32 freq
bawtie - 2 freq
boddie - 2 freq
biddy - 12 freq
'bit - 24 freq
baid - 7 freq
b-a-t - 1 freq
bot - 437 freq
'bet - 2 freq
butt - 20 freq
bode - 6 freq
bothie - 9 freq
bout - 14 freq
beet - 18 freq
betty - 82 freq
baddie - 2 freq
beatha - 5 freq
buddy - 56 freq
buy't - 3 freq
boo't - 2 freq
baa't - 1 freq
b-but - 3 freq
baet - 27 freq
beady - 14 freq
beatty - 1 freq
beth - 26 freq
bootie - 1 freq
bod - 2 freq
bute - 13 freq
boatie - 124 freq
beautie - 3 freq
bawheid - 6 freq
boaty - 7 freq
beyd - 10 freq
beit - 13 freq
bathie - 2 freq
baty - 1 freq
bita - 1 freq
bodee - 2 freq
buty - 3 freq
bathey - 1 freq
bathy - 2 freq
bayd - 1 freq
baad - 20 freq
'b-but - 1 freq
bow'd - 1 freq
booed - 40 freq
bathe - 7 freq
'behaud - 1 freq
bowtie - 1 freq
boadie - 48 freq
boyd - 13 freq
be'd - 1 freq
be't - 3 freq
bitta - 5 freq
bitoa - 2 freq
bad' - 4 freq
bawd - 8 freq
by-th - 1 freq
boath - 2 freq
-but - 2 freq
'-but - 1 freq
bae-the-wye - 1 freq
baeheid - 1 freq
bittae - 2 freq
bete - 4 freq
bett - 4 freq
buddha - 15 freq
bwat - 4 freq
bow-tie - 1 freq
bette - 3 freq
bt - 8 freq
bood - 2 freq
bóat - 5 freq
béat - 1 freq
bïd - 10 freq
bït - 7 freq
beed - 22 freq
'boot - 2 freq
boy'd - 1 freq
baw-heid - 1 freq
bowt - 23 freq
buddoo - 1 freq
boo'd - 1 freq
'beauty - 1 freq
baaed - 1 freq
bouet - 2 freq
bewtie - 4 freq
boit - 11 freq
boitie - 2 freq
buddo - 3 freq
but-the - 1 freq
bott - 3 freq
body- - 1 freq
bawed - 2 freq
booth - 10 freq
bettie - 50 freq
'bettie - 4 freq
'bittie - 1 freq
bit- - 1 freq
bythe - 1 freq
boddy - 7 freq
bedd - 56 freq
bawth - 1 freq
byd - 1 freq
biød - 1 freq
baed - 3 freq
byt - 4 freq
bød - 3 freq
bøt - 2 freq
buidhe - 1 freq
bou'd - 5 freq
bude - 14 freq
bowhead - 2 freq
boadi - 1 freq
bodi - 1 freq
'boady - 1 freq
böddie - 1 freq
baatie - 1 freq
but- - 2 freq
byte - 1 freq
boud - 1 freq
batt - 1 freq
badd - 1 freq
'bout - 2 freq
byth - 1 freq
beattie - 6 freq
boat' - 1 freq
bide-awee - 1 freq
biddie - 1 freq
buidy - 1 freq
€˜bide - 1 freq
€œbit - 41 freq
beta - 3 freq
€˜beta - 1 freq
€˜but - 32 freq
€“but - 1 freq
büde - 2 freq
€¦but - 5 freq
€œbot - 5 freq
bawd' - 1 freq
body' - 1 freq
beddie - 2 freq
bowdie - 1 freq
buith - 3 freq
b-b-b-but - 1 freq
€œbut - 38 freq
bede - 1 freq
btw - 110 freq
€˜bit - 6 freq
ba'ht - 1 freq
batty - 2 freq
bidy - 1 freq
€œbuddo - 1 freq
€˜bad - 3 freq
€œbad - 2 freq
€˜baet - 1 freq
bed-o - 1 freq
€˜buddha - 1 freq
beath - 3 freq
'baith' - 1 freq
€™budy - 2 freq
€¦bed - 1 freq
b-day - 1 freq
€œbet - 1 freq
biit - 1 freq
€™body - 1 freq
bittiie - 1 freq
bade--- - 1 freq
beid - 3 freq
€œbide - 2 freq
båt - 1 freq
bitt - 1 freq
beat- - 2 freq
€œbaith - 1 freq
'bad' - 1 freq
€œbath - 1 freq
€˜bayth - 1 freq
baithe - 3 freq
budy - 1 freq
€˜bot - 1 freq
€œbed - 1 freq
beud - 1 freq
€™but - 9 freq
buttie - 2 freq
bd - 6 freq
btay - 1 freq
butty - 2 freq
bidey - 3 freq
bidie - 3 freq
bday - 1 freq
beith - 5 freq
buddh - 1 freq
b'day - 2 freq
btdy - 1 freq
boyata - 1 freq
booty - 1 freq
baddy - 1 freq
bitey - 1 freq
“bit - 1 freq
buddy” - 1 freq
buaidh - 1 freq
“but - 1 freq
bbt - 1 freq
'but' - 1 freq
boyywood - 1 freq
bodwhu - 1 freq
bewty - 5 freq
bewty- - 1 freq
byddi - 3 freq
beatha” - 1 freq
bttaweiiaw - 1 freq
MetaPhone code - B0
bath - 98 freq
bothy - 48 freq
baith - 1582 freq
by-the - 2 freq
'baith - 4 freq
both - 195 freq
bothie - 9 freq
beatha - 5 freq
beth - 26 freq
bathie - 2 freq
bathey - 1 freq
bathy - 2 freq
bathe - 7 freq
by-th - 1 freq
boath - 2 freq
booth - 10 freq
bythe - 1 freq
bawth - 1 freq
byth - 1 freq
buith - 3 freq
beath - 3 freq
'baith' - 1 freq
€œbaith - 1 freq
€œbath - 1 freq
€˜bayth - 1 freq
baithe - 3 freq
beith - 5 freq
beatha” - 1 freq
BAITH
baith - 1582 freq
both - 195 freq
Time to execute Levenshtein function - 0.642361 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.138945 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.071279 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.127898 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000978 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.