A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to covid in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
covid (0) - 73 freq
couid (1) - 1 freq
coovid (1) - 1 freq
covid- (1) - 10 freq
covids (1) - 1 freq
corvid (1) - 1 freq
cowid (1) - 3 freq
ovid (1) - 1 freq
cuid (2) - 810 freq
copie (2) - 17 freq
coi (2) - 1 freq
codd (2) - 7 freq
ovoid (2) - 1 freq
jovi (2) - 1 freq
cosie (2) - 23 freq
coukd (2) - 2 freq
cooin (2) - 6 freq
could (2) - 2677 freq
cleid (2) - 10 freq
govit (2) - 1 freq
coard (2) - 3 freq
cokvi (2) - 1 freq
copied (2) - 13 freq
covvy (2) - 16 freq
govin (2) - 9 freq
covid (0) - 73 freq
coovid (1) - 1 freq
cowid (2) - 3 freq
caved (2) - 9 freq
corvid (2) - 1 freq
ovid (2) - 1 freq
covids (2) - 1 freq
covid- (2) - 10 freq
couid (2) - 1 freq
goved (3) - 2 freq
coled (3) - 1 freq
cod (3) - 20 freq
cowed (3) - 7 freq
coped (3) - 2 freq
hoved (3) - 4 freq
cavin (3) - 1 freq
moved (3) - 192 freq
cavie (3) - 2 freq
coud (3) - 152 freq
ovd (3) - 3 freq
comed (3) - 12 freq
coked (3) - 5 freq
cwid (3) - 99 freq
livid (3) - 4 freq
movd (3) - 1 freq
SoundEx code - C130
cowpit - 35 freq
chappit - 45 freq
cowpt - 21 freq
cept - 25 freq
chuffed - 85 freq
chapped - 40 freq
coupt - 3 freq
copied - 13 freq
cowped - 48 freq
cupped - 9 freq
cooped - 7 freq
caved - 9 freq
chipped - 14 freq
caped - 2 freq
chippit - 7 freq
chappt - 3 freq
cuffed - 1 freq
chuffd - 2 freq
coft - 9 freq
cupid - 3 freq
'cept - 9 freq
coupit - 16 freq
covid - 73 freq
couped - 9 freq
chaaved - 3 freq
chaved - 1 freq
chuffit - 2 freq
chaft - 13 freq
chuff't - 3 freq
chappet - 2 freq
cheviot - 3 freq
chopped - 7 freq
chibbed - 3 freq
coppit - 3 freq
capita - 1 freq
choppt - 2 freq
coopit - 3 freq
covet - 3 freq
covid- - 10 freq
chipt - 1 freq
chippid - 1 freq
cheepit - 3 freq
cuppid - 1 freq
caveat - 1 freq
cavity - 2 freq
cop-oot - 1 freq
coftee - 1 freq
ciabatta - 1 freq
cave-heid - 2 freq
chaffed - 2 freq
chaift - 1 freq
chuft - 2 freq
cofft - 1 freq
cuppit - 3 freq
chap't - 1 freq
co'peth - 2 freq
cappit - 3 freq
cowboy-hat - 1 freq
cpd - 3 freq
€œcept - 1 freq
caputh - 1 freq
chapit - 1 freq
chafft - 1 freq
chafed - 1 freq
choped - 2 freq
coped - 2 freq
chaffit - 1 freq
chauvit - 1 freq
chippt - 1 freq
chufft - 2 freq
cvht - 1 freq
cbd - 1 freq
coovid - 1 freq
cowpat - 1 freq
cvda - 1 freq
cpt - 1 freq
cbeath - 1 freq
covid” - 1 freq
chuffedÂ… - 1 freq
MetaPhone code - KFT
caught - 191 freq
goaved - 7 freq
caved - 9 freq
cuffed - 1 freq
coft - 9 freq
coughed - 15 freq
covid - 73 freq
'caught' - 1 freq
gïft - 1 freq
covet - 3 freq
guffed - 4 freq
covid- - 10 freq
gaffed - 13 freq
goved - 2 freq
caveat - 1 freq
cavity - 2 freq
coftee - 1 freq
cofft - 1 freq
govit - 1 freq
gavotte - 1 freq
€˜caught - 1 freq
cvht - 1 freq
coovid - 1 freq
qvt - 1 freq
cvda - 1 freq
govt - 7 freq
covid” - 1 freq
gvht - 1 freq
qfd - 1 freq
COVID
Time to execute Levenshtein function - 0.270334 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.508720 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.058931 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037494 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000871 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.