A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cicada in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cicada (0) - 2 freq
cicadas (1) - 1 freq
canada (2) - 38 freq
hirada (2) - 1 freq
mikado (3) - 1 freq
vada (3) - 1 freq
scala (3) - 1 freq
gicd (3) - 1 freq
kanada (3) - 1 freq
cara (3) - 3 freq
iced (3) - 14 freq
wada (3) - 1 freq
micah (3) - 2 freq
wida (3) - 5 freq
arcana (3) - 1 freq
cata (3) - 1 freq
€œcaa (3) - 1 freq
'cawa (3) - 1 freq
caad (3) - 306 freq
ricardo (3) - 6 freq
cricd (3) - 1 freq
cinema (3) - 8 freq
cctd (3) - 1 freq
ida (3) - 127 freq
arcade (3) - 8 freq
cicada (0) - 2 freq
cicadas (2) - 1 freq
canada (3) - 38 freq
facade (4) - 4 freq
corda (4) - 1 freq
chad (4) - 3 freq
ccea (4) - 1 freq
cockad (4) - 1 freq
clad (4) - 14 freq
decade (4) - 30 freq
cudda (4) - 2 freq
cad (4) - 40 freq
cvda (4) - 1 freq
arcadia (4) - 7 freq
cindy (4) - 6 freq
cind (4) - 2 freq
diced (4) - 4 freq
scad (4) - 6 freq
canadae (4) - 6 freq
ticed (4) - 2 freq
cocoa (4) - 3 freq
cicero (4) - 1 freq
craad (4) - 2 freq
ca'ad (4) - 11 freq
cscd (4) - 1 freq
SoundEx code - C230
caught - 191 freq
chased - 50 freq
cockit - 30 freq
cast - 226 freq
chist - 68 freq
chokit - 18 freq
cooked - 27 freq
chest - 97 freq
checked - 84 freq
coast - 115 freq
caused - 78 freq
choked - 19 freq
cook'd - 1 freq
cost - 113 freq
cicada - 2 freq
ceased - 7 freq
chucked - 28 freq
cuist - 14 freq
cachet - 1 freq
caucht - 32 freq
chase't - 1 freq
caked - 6 freq
cocht - 11 freq
chuckt - 6 freq
checkt - 5 freq
cackit - 1 freq
cheekit - 2 freq
chikked - 1 freq
cocked - 10 freq
coushie-dou - 1 freq
coughed - 15 freq
check-oot - 8 freq
chuckit - 3 freq
cassidy - 4 freq
chackt - 4 freq
cassette - 3 freq
costa - 7 freq
chacked - 2 freq
cist - 1 freq
cockade - 3 freq
checkit - 20 freq
chocht - 1 freq
chicked - 1 freq
chasit - 2 freq
coaxit - 1 freq
caist - 4 freq
choakit - 4 freq
causit - 3 freq
checkout - 2 freq
cushie-doo - 7 freq
checket - 2 freq
choaket - 1 freq
chukket - 1 freq
chucket - 3 freq
chaist - 3 freq
casset - 2 freq
cogged - 2 freq
chakked - 1 freq
coogate - 2 freq
'c'est - 1 freq
cascade - 3 freq
caste - 2 freq
checkoot - 20 freq
coked - 5 freq
cassat - 1 freq
chaak-white - 2 freq
'cost - 1 freq
'caught' - 1 freq
cockatoo - 1 freq
caged - 3 freq
cheust - 150 freq
caased - 6 freq
choke't - 3 freq
cookit - 3 freq
chuckst - 1 freq
coaxed - 3 freq
chokt - 2 freq
coaxt - 1 freq
chuggte - 1 freq
cowkit - 1 freq
'chest - 1 freq
cashed - 3 freq
chyst - 1 freq
chockit - 2 freq
choost - 12 freq
cosset - 1 freq
casket - 1 freq
c'est - 1 freq
'cheust - 1 freq
coist - 4 freq
cock-eyed - 1 freq
'chist - 1 freq
cocotte - 1 freq
chukkit - 1 freq
costo - 1 freq
coost - 1 freq
cushat' - 1 freq
cakit - 1 freq
chocked - 3 freq
chowked - 2 freq
cosst - 1 freq
chowkit - 2 freq
chusit - 1 freq
coasta - 1 freq
ceest - 1 freq
cushat - 1 freq
chaste - 1 freq
cock-ee'd - 1 freq
chiste - 1 freq
chugged - 1 freq
cowked - 1 freq
cokkit - 1 freq
cuikit - 2 freq
chuisit - 2 freq
cousteau - 1 freq
coucht - 1 freq
chisty - 1 freq
cowgate - 2 freq
€˜caught - 1 freq
ciste - 2 freq
chackit - 1 freq
coukit - 1 freq
coste - 1 freq
costie - 1 freq
“caiket - 1 freq
cwgd - 1 freq
coukd - 2 freq
coached - 1 freq
cyst - 1 freq
'cast' - 1 freq
chicity - 146 freq
chesty - 1 freq
cockad - 1 freq
choaked - 1 freq
MetaPhone code - SKT
sooked - 23 freq
scot - 107 freq
scott - 290 freq
zact - 4 freq
seekit - 1 freq
skid - 2 freq
cicada - 2 freq
skite - 42 freq
scootie - 2 freq
squad - 38 freq
scout - 11 freq
sagged - 1 freq
skied - 3 freq
socket - 4 freq
sookit - 27 freq
soackit - 1 freq
scaud - 3 freq
scda - 2 freq
scud - 30 freq
s'guid - 1 freq
skitey - 5 freq
scoot - 13 freq
skyte - 4 freq
soukt - 3 freq
soukit - 10 freq
scawt - 2 freq
soaket - 4 freq
skait - 1 freq
sct - 1 freq
sect - 3 freq
skit - 3 freq
squat - 7 freq
scad - 6 freq
scatty - 1 freq
scoit - 1 freq
scuddy - 2 freq
skoda - 2 freq
soakit - 4 freq
segued - 1 freq
saiket - 1 freq
skate - 10 freq
sacked - 9 freq
skytie - 3 freq
skud - 2 freq
soaked - 11 freq
sackt - 1 freq
skoit - 12 freq
squoot - 1 freq
squeed - 1 freq
skoot - 1 freq
skæt - 1 freq
skiddy - 2 freq
scooty - 2 freq
sqiud - 1 freq
skitie - 4 freq
sackit - 1 freq
scottie - 3 freq
saggit - 1 freq
skett - 1 freq
scota - 2 freq
sikkit - 1 freq
scott' - 7 freq
scuddie - 3 freq
scotty - 29 freq
sookt - 1 freq
secked - 2 freq
skyty - 3 freq
skawtie - 1 freq
€¦scud - 2 freq
skiyt - 1 freq
sscd - 4 freq
sucked - 5 freq
sae-caad - 1 freq
scata - 2 freq
zigged - 1 freq
zagged - 2 freq
€œscott - 2 freq
skeet - 1 freq
scoti - 1 freq
sgt - 1 freq
scottyw - 3 freq
‘scot’ - 1 freq
zukt - 1 freq
squid - 1 freq
skdw - 1 freq
sÂ’got - 1 freq
scotto - 1 freq
scoto - 1 freq
zct - 1 freq
xkd - 1 freq
skooty - 2 freq
xcd - 1 freq
zzuwgwt - 1 freq
'skaddi' - 1 freq
zgd - 1 freq
squaddie - 1 freq
scotti - 2 freq
skad - 1 freq
CICADA
Time to execute Levenshtein function - 0.206058 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.364789 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029351 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044178 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001408 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.