A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cbeath in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cbeath (0) - 1 freq
beath (1) - 3 freq
mcbeath (1) - 1 freq
aneath (2) - 105 freq
breath (2) - 233 freq
beach (2) - 169 freq
neath (2) - 4 freq
beith (2) - 5 freq
wreath (2) - 14 freq
cheats (2) - 2 freq
create (2) - 50 freq
heath (2) - 2 freq
brath (2) - 1 freq
sheath (2) - 3 freq
'neath (2) - 2 freq
beats (2) - 30 freq
teath (2) - 1 freq
beth (2) - 26 freq
beat- (2) - 2 freq
cath (2) - 4 freq
death (2) - 168 freq
caeth (2) - 1 freq
bath (2) - 98 freq
beatha (2) - 5 freq
boath (2) - 2 freq
cbeath (0) - 1 freq
beath (2) - 3 freq
mcbeath (2) - 1 freq
caeth (3) - 1 freq
bath (3) - 98 freq
beith (3) - 5 freq
boath (3) - 2 freq
cath (3) - 4 freq
beth (3) - 26 freq
beatha (3) - 5 freq
cloath (3) - 1 freq
cloth (4) - 16 freq
clathe (4) - 1 freq
caputh (4) - 1 freq
i'bath (4) - 1 freq
clathy (4) - 1 freq
claith (4) - 59 freq
buith (4) - 3 freq
cbtk (4) - 1 freq
cometh (4) - 6 freq
'baith (4) - 4 freq
bathy (4) - 2 freq
cathy (4) - 112 freq
booth (4) - 10 freq
cbh (4) - 1 freq
SoundEx code - C130
cowpit - 35 freq
chappit - 45 freq
cowpt - 21 freq
cept - 25 freq
chuffed - 84 freq
chapped - 40 freq
coupt - 3 freq
copied - 13 freq
cowped - 48 freq
cupped - 9 freq
cooped - 7 freq
caved - 9 freq
chipped - 14 freq
caped - 2 freq
chippit - 7 freq
chappt - 3 freq
cuffed - 1 freq
chuffd - 2 freq
coft - 9 freq
cupid - 3 freq
'cept - 9 freq
coupit - 16 freq
covid - 73 freq
couped - 9 freq
chaaved - 3 freq
chaved - 1 freq
chuffit - 2 freq
chaft - 13 freq
chuff't - 3 freq
chappet - 2 freq
cheviot - 3 freq
chibbed - 3 freq
coppit - 3 freq
capita - 1 freq
choppt - 2 freq
coopit - 3 freq
covet - 3 freq
covid- - 10 freq
chipt - 1 freq
chopped - 6 freq
chippid - 1 freq
cheepit - 3 freq
cuppid - 1 freq
caveat - 1 freq
cavity - 2 freq
cop-oot - 1 freq
coftee - 1 freq
ciabatta - 1 freq
cave-heid - 2 freq
chaffed - 2 freq
chaift - 1 freq
chuft - 2 freq
cofft - 1 freq
cuppit - 3 freq
chap't - 1 freq
co'peth - 2 freq
cappit - 3 freq
cowboy-hat - 1 freq
cpd - 3 freq
€œcept - 1 freq
caputh - 1 freq
chapit - 1 freq
chafft - 1 freq
chafed - 1 freq
choped - 2 freq
coped - 2 freq
chaffit - 1 freq
chauvit - 1 freq
chippt - 1 freq
chufft - 2 freq
cvht - 1 freq
cbd - 1 freq
coovid - 1 freq
cowpat - 1 freq
cvda - 1 freq
cpt - 1 freq
cbeath - 1 freq
covid” - 1 freq
chuffedÂ… - 1 freq
MetaPhone code - KB0
cbeath - 1 freq
CBEATH
Time to execute Levenshtein function - 0.234274 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.438227 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.036517 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037276 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000823 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.