A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to baw in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
baw (0) - 132 freq
taw (1) - 10 freq
waw (1) - 155 freq
gaw (1) - 5 freq
maw (1) - 392 freq
baa (1) - 74 freq
aw (1) - 8237 freq
bew (1) - 7 freq
baq (1) - 1 freq
paw (1) - 47 freq
baw' (1) - 1 freq
caw (1) - 192 freq
bal (1) - 6 freq
bas (1) - 9 freq
ba' (1) - 8 freq
bawl (1) - 4 freq
braw (1) - 1237 freq
blaw (1) - 200 freq
bmw (1) - 4 freq
bw (1) - 3 freq
bar (1) - 493 freq
saw (1) - 1038 freq
bay (1) - 82 freq
bnw (1) - 1 freq
bac (1) - 11 freq
baw (0) - 132 freq
bew (1) - 7 freq
bw (1) - 3 freq
bow (1) - 74 freq
bawd (2) - 8 freq
bak (2) - 376 freq
baf (2) - 1 freq
bowe (2) - 3 freq
ba (2) - 140 freq
haw (2) - 106 freq
law (2) - 295 freq
bae (2) - 855 freq
iaw (2) - 1 freq
bah (2) - 4 freq
'aw (2) - 69 freq
daw (2) - 20 freq
jaw (2) - 58 freq
bai (2) - 10 freq
ban (2) - 48 freq
bat (2) - 50 freq
abow (2) - 1 freq
naw (2) - 909 freq
biow (2) - 1 freq
btw (2) - 110 freq
baws (2) - 93 freq
SoundEx code - B000
by - 4600 freq
be - 15063 freq
boy - 530 freq
bei - 55 freq
bi - 3287 freq
bay - 82 freq
buy - 385 freq
baa - 74 freq
bee' - 6 freq
boay - 211 freq
baw - 132 freq
bh - 13 freq
bi- - 5 freq
bye - 115 freq
bou - 12 freq
byeee - 1 freq
bee - 51 freq
bow - 74 freq
bo - 18 freq
be' - 4 freq
bai - 10 freq
bowe - 3 freq
boa - 16 freq
bey - 24 freq
b- - 3 freq
'be - 13 freq
b - 751 freq
bhoy-' - 1 freq
biow - 1 freq
boyo - 46 freq
'bi - 4 freq
ba - 140 freq
buoy - 3 freq
b' - 196 freq
ba' - 8 freq
booooo - 1 freq
boo - 58 freq
bb - 9 freq
beuy - 64 freq
bae - 855 freq
bia - 5 freq
buiy - 5 freq
baiy - 1 freq
booee - 1 freq
bhoy - 9 freq
'by - 9 freq
bp - 9 freq
boohoo - 1 freq
boo-hoo - 2 freq
biueee - 1 freq
bowie - 16 freq
baia - 1 freq
'b' - 2 freq
beh - 18 freq
'bae - 1 freq
bae' - 1 freq
bae- - 1 freq
bi' - 3 freq
buyiy - 1 freq
buh - 3 freq
bah - 4 freq
bwaahahaha - 1 freq
'b - 1 freq
bie - 15 freq
'boy - 1 freq
boy' - 3 freq
'boy' - 3 freq
'be' - 2 freq
bu - 13 freq
by' - 6 freq
'buy' - 1 freq
bue - 1 freq
bew - 7 freq
'beuy' - 1 freq
'buey' - 1 freq
buey - 10 freq
'buey - 4 freq
buey' - 1 freq
'bye - 1 freq
bæ - 1 freq
b'wye - 1 freq
biy - 3 freq
by-e - 2 freq
by- - 3 freq
'by' - 2 freq
bî - 1 freq
be-e-ehh - 1 freq
b'aa - 1 freq
buyy - 1 freq
€œby - 2 freq
€˜bi - 1 freq
bí - 1 freq
béþ - 1 freq
bayou - 1 freq
boye - 1 freq
€¦be - 1 freq
€˜boa - 2 freq
€˜b - 1 freq
€˜by - 1 freq
€˜beuy - 1 freq
€œbe - 6 freq
€œbye - 1 freq
boiy - 4 freq
€˜bye - 1 freq
€˜boo - 1 freq
bayo - 1 freq
€œb - 1 freq
bio - 10 freq
bea - 2 freq
byue - 2 freq
bff - 1 freq
bwu - 1 freq
bf - 6 freq
bhe - 1 freq
bv - 2 freq
bba - 1 freq
bfhio - 1 freq
bw - 3 freq
boi - 3 freq
bbfa - 1 freq
byeeee - 1 freq
baw' - 1 freq
buaai - 1 freq
“by - 1 freq
bfu - 1 freq
beÂ’ - 1 freq
byÂ’ - 1 freq
beee - 3 freq
bhuy - 1 freq
biae - 1 freq
buhi - 1 freq
MetaPhone code - B
by - 4600 freq
be - 15063 freq
boy - 530 freq
bei - 55 freq
bi - 3287 freq
bay - 82 freq
buy - 385 freq
baa - 74 freq
bee' - 6 freq
boay - 211 freq
baw - 132 freq
bh - 13 freq
bi- - 5 freq
bou - 12 freq
bee - 51 freq
bow - 74 freq
bo - 18 freq
be' - 4 freq
bai - 10 freq
boa - 16 freq
bey - 24 freq
b- - 3 freq
'be - 13 freq
b - 751 freq
biow - 1 freq
'bi - 4 freq
ba - 140 freq
buoy - 3 freq
b' - 196 freq
ba' - 8 freq
booooo - 1 freq
boo - 58 freq
bb - 9 freq
beuy - 64 freq
bae - 855 freq
bia - 5 freq
buiy - 5 freq
baiy - 1 freq
booee - 1 freq
'by - 9 freq
bough - 5 freq
biueee - 1 freq
baia - 1 freq
hbo - 1 freq
'b' - 2 freq
beh - 18 freq
'bae - 1 freq
bae' - 1 freq
bae- - 1 freq
hb - 9 freq
bi' - 3 freq
buh - 3 freq
bah - 4 freq
'b - 1 freq
bie - 15 freq
'boy - 1 freq
boy' - 3 freq
'boy' - 3 freq
'be' - 2 freq
bu - 13 freq
by' - 6 freq
'buy' - 1 freq
bue - 1 freq
bew - 7 freq
'beuy' - 1 freq
'buey' - 1 freq
buey - 10 freq
'buey - 4 freq
buey' - 1 freq
bæ - 1 freq
biy - 3 freq
by-e - 2 freq
by- - 3 freq
'by' - 2 freq
bî - 1 freq
be-e-ehh - 1 freq
b'aa - 1 freq
buyy - 1 freq
€œby - 2 freq
€˜bi - 1 freq
bí - 1 freq
béþ - 1 freq
€¦be - 1 freq
€˜boa - 2 freq
€˜b - 1 freq
€˜by - 1 freq
€˜beuy - 1 freq
€œbe - 6 freq
boiy - 4 freq
€˜boo - 1 freq
€œb - 1 freq
bio - 10 freq
bea - 2 freq
bba - 1 freq
wb - 4 freq
bw - 3 freq
boi - 3 freq
yb - 3 freq
hbui - 1 freq
baw' - 1 freq
buaai - 1 freq
“by - 1 freq
ybi - 1 freq
hbu - 1 freq
beÂ’ - 1 freq
byÂ’ - 1 freq
beee - 3 freq
biae - 1 freq
wbeo - 1 freq
BAW
ba - 140 freq
ball - 62 freq
baa - 74 freq
baw - 132 freq
balls - 19 freq
baws - 93 freq
baas - 30 freq
Time to execute Levenshtein function - 0.173092 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.333154 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028402 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038096 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000960 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.