A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to caved in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
caved (0) - 9 freq
carved (1) - 21 freq
cared (1) - 32 freq
caped (1) - 2 freq
ca-ed (1) - 1 freq
maved (1) - 1 freq
cave (1) - 79 freq
calved (1) - 1 freq
cawed (1) - 262 freq
caed (1) - 4 freq
saved (1) - 83 freq
paved (1) - 4 freq
caves (1) - 16 freq
chaved (1) - 1 freq
ca'ed (1) - 76 freq
caaed (1) - 95 freq
waved (1) - 62 freq
caged (1) - 3 freq
caven (1) - 1 freq
caked (1) - 6 freq
craved (1) - 3 freq
raved (1) - 3 freq
taped (2) - 4 freq
cases (2) - 54 freq
cauve (2) - 1 freq
caved (0) - 9 freq
waved (2) - 62 freq
caaed (2) - 95 freq
ca'ed (2) - 76 freq
caven (2) - 1 freq
chaved (2) - 1 freq
caked (2) - 6 freq
covid (2) - 73 freq
raved (2) - 3 freq
craved (2) - 3 freq
caves (2) - 16 freq
caged (2) - 3 freq
caped (2) - 2 freq
maved (2) - 1 freq
cared (2) - 32 freq
paved (2) - 4 freq
carved (2) - 21 freq
cave (2) - 79 freq
ca-ed (2) - 1 freq
saved (2) - 83 freq
cawed (2) - 262 freq
caed (2) - 4 freq
calved (2) - 1 freq
covet (3) - 3 freq
cavie (3) - 2 freq
SoundEx code - C130
cowpit - 35 freq
chappit - 45 freq
cowpt - 21 freq
cept - 25 freq
chuffed - 84 freq
chapped - 40 freq
coupt - 3 freq
copied - 13 freq
cowped - 48 freq
cupped - 9 freq
cooped - 7 freq
caved - 9 freq
chipped - 14 freq
caped - 2 freq
chippit - 7 freq
chappt - 3 freq
cuffed - 1 freq
chuffd - 2 freq
coft - 9 freq
cupid - 3 freq
'cept - 9 freq
coupit - 16 freq
covid - 73 freq
couped - 9 freq
chaaved - 3 freq
chaved - 1 freq
chuffit - 2 freq
chaft - 13 freq
chuff't - 3 freq
chappet - 2 freq
cheviot - 3 freq
chibbed - 3 freq
coppit - 3 freq
capita - 1 freq
choppt - 2 freq
coopit - 3 freq
covet - 3 freq
covid- - 10 freq
chipt - 1 freq
chopped - 6 freq
chippid - 1 freq
cheepit - 3 freq
cuppid - 1 freq
caveat - 1 freq
cavity - 2 freq
cop-oot - 1 freq
coftee - 1 freq
ciabatta - 1 freq
cave-heid - 2 freq
chaffed - 2 freq
chaift - 1 freq
chuft - 2 freq
cofft - 1 freq
cuppit - 3 freq
chap't - 1 freq
co'peth - 2 freq
cappit - 3 freq
cowboy-hat - 1 freq
cpd - 3 freq
€œcept - 1 freq
caputh - 1 freq
chapit - 1 freq
chafft - 1 freq
chafed - 1 freq
choped - 2 freq
coped - 2 freq
chaffit - 1 freq
chauvit - 1 freq
chippt - 1 freq
chufft - 2 freq
cvht - 1 freq
cbd - 1 freq
coovid - 1 freq
cowpat - 1 freq
cvda - 1 freq
cpt - 1 freq
cbeath - 1 freq
covid” - 1 freq
chuffedÂ… - 1 freq
MetaPhone code - KFT
caught - 191 freq
goaved - 7 freq
caved - 9 freq
cuffed - 1 freq
coft - 9 freq
coughed - 15 freq
covid - 73 freq
'caught' - 1 freq
gïft - 1 freq
covet - 3 freq
guffed - 4 freq
covid- - 10 freq
gaffed - 13 freq
goved - 2 freq
caveat - 1 freq
cavity - 2 freq
coftee - 1 freq
cofft - 1 freq
govit - 1 freq
gavotte - 1 freq
€˜caught - 1 freq
cvht - 1 freq
coovid - 1 freq
qvt - 1 freq
cvda - 1 freq
govt - 7 freq
covid” - 1 freq
gvht - 1 freq
qfd - 1 freq
CAVED
Time to execute Levenshtein function - 0.294097 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.504546 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027504 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.070644 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000785 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.