A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cigale in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cigale (0) - 1 freq
regale (2) - 2 freq
aigle (2) - 4 freq
cirele (2) - 2 freq
circle (2) - 87 freq
€˜cigale (2) - 1 freq
ciggie (2) - 4 freq
finale (2) - 4 freq
diggle (2) - 1 freq
giggle (2) - 21 freq
sighle (2) - 1 freq
bigake (2) - 5 freq
cigars (2) - 5 freq
gale (2) - 64 freq
cigar (2) - 19 freq
cele (3) - 1 freq
diane (3) - 9 freq
craalt (3) - 3 freq
cage (3) - 59 freq
ciggies (3) - 2 freq
vigil (3) - 53 freq
caalt (3) - 24 freq
yale (3) - 7 freq
icicle (3) - 3 freq
hiddle (3) - 10 freq
cigale (0) - 1 freq
ciggie (3) - 4 freq
cgl (3) - 2 freq
cigar (3) - 19 freq
cagoule (3) - 2 freq
gale (3) - 64 freq
cirele (3) - 2 freq
aigle (3) - 4 freq
regale (3) - 2 freq
creole (4) - 1 freq
coalie (4) - 1 freq
bogle (4) - 45 freq
caall (4) - 2 freq
cal (4) - 29 freq
wigly (4) - 1 freq
nigel (4) - 49 freq
argyle (4) - 5 freq
bigly (4) - 2 freq
coyle (4) - 53 freq
gle (4) - 4 freq
coggie (4) - 5 freq
coggly (4) - 1 freq
angle (4) - 22 freq
cule (4) - 1 freq
cghee (4) - 1 freq
SoundEx code - C240
casually - 16 freq
chuckle - 12 freq
cockle - 4 freq
cheisel - 2 freq
co-equally - 1 freq
casual - 22 freq
cycle - 24 freq
chisel - 10 freq
cassle - 1 freq
cackle - 7 freq
chickle - 1 freq
cosla - 1 freq
cheekily - 3 freq
chessel - 1 freq
cajole - 1 freq
casel - 1 freq
chicl - 1 freq
chook'll - 1 freq
cosily - 1 freq
chesell - 1 freq
coaxial - 1 freq
cochlea - 2 freq
cheesel - 1 freq
coggly - 1 freq
case-law - 1 freq
coocil - 1 freq
cigale - 1 freq
€˜cigale - 1 freq
€œcysill - 1 freq
chagall - 1 freq
cagoule - 2 freq
coklay - 1 freq
cyexul - 1 freq
cashley - 2 freq
cecilia - 2 freq
MetaPhone code - SKL
skail - 51 freq
scuil - 71 freq
skull - 45 freq
skill - 55 freq
scale - 54 freq
scaly - 13 freq
seagull - 9 freq
skeelie - 32 freq
skeely - 22 freq
scowly - 1 freq
scowl - 10 freq
skuil - 39 freq
skale - 1 freq
skeill - 11 freq
squeal - 13 freq
skule - 10 freq
skeel - 45 freq
skelly - 10 freq
cycle - 24 freq
scayle - 1 freq
skool - 3 freq
sqeel - 1 freq
sickle - 4 freq
sequel - 8 freq
sickly - 6 freq
scaaly - 1 freq
saicl - 1 freq
scully - 2 freq
scoul - 2 freq
sea-eagle - 2 freq
skaill - 2 freq
skeillie - 10 freq
skeul - 8 freq
scael - 4 freq
scala - 1 freq
skely - 1 freq
skael - 1 freq
skellie - 4 freq
skeilie - 2 freq
skeil - 2 freq
scaley - 3 freq
skül - 3 freq
sickly' - 1 freq
squill - 4 freq
scöl - 9 freq
scol - 1 freq
seekly - 1 freq
scaal - 1 freq
skeily - 1 freq
seiklie - 1 freq
cigale - 1 freq
€˜cigale - 1 freq
squally - 1 freq
€˜skl - 1 freq
squeel - 4 freq
€˜skull - 1 freq
skil - 3 freq
sculie - 1 freq
scuill - 1 freq
squall - 1 freq
sgoil - 1 freq
skl - 1 freq
ziklie - 1 freq
scull - 1 freq
suql - 1 freq
skuill - 10 freq
xgayl - 2 freq
scl - 1 freq
seacole - 1 freq
skal - 1 freq
zwqil - 1 freq
CIGALE
Time to execute Levenshtein function - 0.206253 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.398749 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028802 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038840 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000840 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.