A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to covet in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
covet (0) - 3 freq
coven (1) - 4 freq
comet (1) - 12 freq
covert (1) - 39 freq
cover (1) - 147 freq
cove (1) - 20 freq
coist (2) - 4 freq
lovel (2) - 1 freq
coupt (2) - 3 freq
cet (2) - 1 freq
poket (2) - 4 freq
toves (2) - 1 freq
coost (2) - 1 freq
co'en (2) - 2 freq
cott (2) - 4 freq
love' (2) - 2 freq
coded (2) - 2 freq
rover (2) - 12 freq
goved (2) - 2 freq
coit (2) - 7 freq
oven (2) - 51 freq
coatet (2) - 3 freq
corer (2) - 1 freq
coney (2) - 2 freq
comeat (2) - 1 freq
covet (0) - 3 freq
caveat (2) - 1 freq
coven (2) - 4 freq
cove (2) - 20 freq
cover (2) - 147 freq
comet (2) - 12 freq
covert (2) - 39 freq
duvet (3) - 20 freq
court (3) - 154 freq
cost (3) - 114 freq
movit (3) - 6 freq
count (3) - 45 freq
govt (3) - 7 freq
livet (3) - 1 freq
cave (3) - 79 freq
govit (3) - 1 freq
cote (3) - 4 freq
covid (3) - 73 freq
coot (3) - 2 freq
cvht (3) - 1 freq
cov (3) - 1 freq
vet (3) - 37 freq
coveret (3) - 2 freq
cont (3) - 2 freq
colt (3) - 1 freq
SoundEx code - C130
cowpit - 35 freq
chappit - 45 freq
cowpt - 21 freq
cept - 25 freq
chuffed - 85 freq
chapped - 40 freq
coupt - 3 freq
copied - 13 freq
cowped - 48 freq
cupped - 9 freq
cooped - 7 freq
caved - 9 freq
chipped - 14 freq
caped - 2 freq
chippit - 7 freq
chappt - 3 freq
cuffed - 1 freq
chuffd - 2 freq
coft - 9 freq
cupid - 3 freq
'cept - 9 freq
coupit - 16 freq
covid - 73 freq
couped - 9 freq
chaaved - 3 freq
chaved - 1 freq
chuffit - 2 freq
chaft - 13 freq
chuff't - 3 freq
chappet - 2 freq
cheviot - 3 freq
chopped - 7 freq
chibbed - 3 freq
coppit - 3 freq
capita - 1 freq
choppt - 2 freq
coopit - 3 freq
covet - 3 freq
covid- - 10 freq
chipt - 1 freq
chippid - 1 freq
cheepit - 3 freq
cuppid - 1 freq
caveat - 1 freq
cavity - 2 freq
cop-oot - 1 freq
coftee - 1 freq
ciabatta - 1 freq
cave-heid - 2 freq
chaffed - 2 freq
chaift - 1 freq
chuft - 2 freq
cofft - 1 freq
cuppit - 3 freq
chap't - 1 freq
co'peth - 2 freq
cappit - 3 freq
cowboy-hat - 1 freq
cpd - 3 freq
€œcept - 1 freq
caputh - 1 freq
chapit - 1 freq
chafft - 1 freq
chafed - 1 freq
choped - 2 freq
coped - 2 freq
chaffit - 1 freq
chauvit - 1 freq
chippt - 1 freq
chufft - 2 freq
cvht - 1 freq
cbd - 1 freq
coovid - 1 freq
cowpat - 1 freq
cvda - 1 freq
cpt - 1 freq
cbeath - 1 freq
covid” - 1 freq
chuffedÂ… - 1 freq
MetaPhone code - KFT
caught - 191 freq
goaved - 7 freq
caved - 9 freq
cuffed - 1 freq
coft - 9 freq
coughed - 15 freq
covid - 73 freq
'caught' - 1 freq
gïft - 1 freq
covet - 3 freq
guffed - 4 freq
covid- - 10 freq
gaffed - 13 freq
goved - 2 freq
caveat - 1 freq
cavity - 2 freq
coftee - 1 freq
cofft - 1 freq
govit - 1 freq
gavotte - 1 freq
€˜caught - 1 freq
cvht - 1 freq
coovid - 1 freq
qvt - 1 freq
cvda - 1 freq
govt - 7 freq
covid” - 1 freq
gvht - 1 freq
qfd - 1 freq
COVET
Time to execute Levenshtein function - 0.194167 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373845 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027573 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037920 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000923 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.