A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cydonia in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cydonia (0) - 2 freq
cadona (2) - 6 freq
cidni (3) - 1 freq
antonia (3) - 8 freq
colonic (3) - 1 freq
lydon (3) - 1 freq
ammonia (3) - 8 freq
sonia (3) - 2 freq
adonis (3) - 3 freq
cidna (3) - 2 freq
cronie (3) - 2 freq
cudna (3) - 165 freq
coia (3) - 1 freq
codonas (3) - 1 freq
macedonia (3) - 5 freq
yona (3) - 12 freq
madonna (3) - 4 freq
cyclonic (3) - 1 freq
colonial (3) - 8 freq
crdon (3) - 1 freq
donna (3) - 51 freq
caledonia (3) - 16 freq
pysonin (3) - 3 freq
corona (3) - 19 freq
lydia (3) - 1 freq
cydonia (0) - 2 freq
cadona (2) - 6 freq
cudna (3) - 165 freq
cidna (3) - 2 freq
cudnea (3) - 2 freq
cidni (3) - 1 freq
caledonia (4) - 16 freq
crdon (4) - 1 freq
corona (4) - 19 freq
coudna (4) - 47 freq
coodna (4) - 71 freq
cuidna (4) - 94 freq
cudnae (4) - 144 freq
cidnae (4) - 2 freq
lydon (4) - 1 freq
dona (4) - 1 freq
cronie (4) - 2 freq
codonas (4) - 1 freq
macedonia (4) - 5 freq
academia (5) - 9 freq
wadna (5) - 223 freq
croodin (5) - 2 freq
doein (5) - 1 freq
cuddie (5) - 43 freq
cadence (5) - 2 freq
SoundEx code - C350
cuttin - 73 freq
cut-doon - 2 freq
cuidnae - 135 freq
caution - 9 freq
coudna - 47 freq
cotton - 25 freq
cuidna - 94 freq
cudnae - 144 freq
cudna - 165 freq
chattin - 22 freq
cheatin - 4 freq
cotton-woo - 2 freq
cidna - 2 freq
cidnae - 2 freq
cwidna - 47 freq
chidin - 1 freq
cuttin' - 1 freq
coodna - 71 freq
coddin - 2 freq
chaitin - 3 freq
cud'nae - 1 freq
'cudna - 1 freq
coudnae - 19 freq
cuttan - 4 freq
chaetin - 1 freq
cheatan - 1 freq
coudno - 3 freq
cadona - 6 freq
coodnae - 52 freq
cuddie-an - 1 freq
chatham - 1 freq
cydonia - 2 freq
cweedna - 2 freq
cidni - 1 freq
cottown - 1 freq
chattan - 1 freq
€œcudna - 1 freq
coatin - 1 freq
citin - 1 freq
chutney - 4 freq
cowden - 7 freq
codeine - 1 freq
ctyem - 1 freq
cudnea - 2 freq
MetaPhone code - STN
stane - 414 freq
sittin - 721 freq
staun - 216 freq
stany - 5 freq
sudden - 210 freq
steen - 114 freq
staan - 13 freq
settin - 142 freq
hsten - 1 freq
saitin - 3 freq
settan - 14 freq
sydney - 7 freq
stone - 83 freq
suttin - 81 freq
seatoun - 1 freq
setten - 22 freq
sitten - 12 freq
sittin' - 19 freq
sidden - 16 freq
cidna - 2 freq
wystin - 2 freq
stan - 149 freq
cidnae - 2 freq
sodden - 11 freq
stoon - 9 freq
stein - 22 freq
staney - 6 freq
sidn - 1 freq
steenie - 25 freq
stan¢ - 1 freq
satan - 24 freq
seton - 18 freq
settin' - 12 freq
seitten - 1 freq
suddin - 1 freq
syden - 1 freq
staine - 2 freq
sa'tin' - 1 freq
sedn - 1 freq
stine - 1 freq
stain - 26 freq
seatin - 4 freq
satin - 12 freq
sutten - 8 freq
steeny - 7 freq
stawn - 4 freq
sidney - 23 freq
sidon - 11 freq
sïttin - 8 freq
stane' - 2 freq
sautin - 1 freq
stan' - 6 freq
'stan - 2 freq
side-on - 1 freq
sit-in - 2 freq
steen' - 2 freq
sea-aeten - 1 freq
sittan - 32 freq
soodna - 7 freq
stown - 7 freq
ston - 9 freq
stun - 2 freq
soddan - 1 freq
staen - 2 freq
situn - 1 freq
steyn - 8 freq
stony - 3 freq
suden - 1 freq
setteen - 2 freq
stoun - 2 freq
seteen - 1 freq
sudna - 13 freq
stonn - 1 freq
sateen - 1 freq
said-na - 1 freq
soudna - 2 freq
seedin - 1 freq
sautan - 1 freq
saidna - 1 freq
cydonia - 2 freq
cidni - 1 freq
suitin - 1 freq
swytin - 1 freq
'staun - 1 freq
€˜staun - 1 freq
suidna - 3 freq
sawtan - 1 freq
stanie - 2 freq
staun' - 2 freq
stane- - 1 freq
soddin' - 1 freq
suiden - 1 freq
€œstaan - 1 freq
sudn - 1 freq
siden - 2 freq
€˜siden - 1 freq
citin - 1 freq
stenn - 1 freq
suidnae - 1 freq
sutton - 6 freq
sittn - 1 freq
sitin - 1 freq
set-in - 1 freq
stoney - 3 freq
seaton - 3 freq
setn - 1 freq
stane” - 1 freq
settn - 1 freq
sudan - 1 freq
CYDONIA
Time to execute Levenshtein function - 0.220141 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.547079 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033440 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.065089 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000753 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.