A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to charity in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
charity (0) - 41 freq
clarity (1) - 13 freq
chairity (1) - 3 freq
chait (2) - 1 freq
clarify (2) - 5 freq
cariy (2) - 1 freq
chariot (2) - 4 freq
charts (2) - 8 freq
rarity (2) - 3 freq
charitie (2) - 2 freq
chatty (2) - 6 freq
christy (2) - 1 freq
clarty (2) - 56 freq
chart (2) - 4 freq
parity (2) - 2 freq
chargit (2) - 2 freq
harit (2) - 1 freq
cavity (2) - 2 freq
chapit (2) - 1 freq
chasit (2) - 2 freq
charley (2) - 1 freq
chicity (2) - 146 freq
chariots (2) - 1 freq
chanty (2) - 10 freq
chitty (3) - 1 freq
charity (0) - 41 freq
chairity (1) - 3 freq
chariot (2) - 4 freq
clarity (2) - 13 freq
charitie (2) - 2 freq
chart (2) - 4 freq
chicity (3) - 146 freq
charley (3) - 1 freq
chasit (3) - 2 freq
chariots (3) - 1 freq
chert (3) - 2 freq
cheritie (3) - 3 freq
cheerit (3) - 1 freq
chapit (3) - 1 freq
cheriot (3) - 2 freq
chanty (3) - 10 freq
chairiot (3) - 2 freq
christy (3) - 1 freq
chairt (3) - 17 freq
chait (3) - 1 freq
clarty (3) - 56 freq
chatty (3) - 6 freq
charts (3) - 8 freq
chargit (3) - 2 freq
harit (3) - 1 freq
SoundEx code - C630
cried - 983 freq
carrot - 19 freq
cooried - 63 freq
cairt - 92 freq
cairried - 144 freq
carried - 35 freq
caird - 96 freq
cheered - 27 freq
crood - 111 freq
cairrit - 27 freq
crowd - 140 freq
court - 154 freq
creaut - 4 freq
coward - 12 freq
cryed - 97 freq
cairiet - 7 freq
carte - 4 freq
create - 51 freq
caird' - 1 freq
criet - 1 freq
corat - 1 freq
cared - 34 freq
cured - 29 freq
chord - 11 freq
coort - 125 freq
curd - 5 freq
courit - 7 freq
crawed - 7 freq
charity - 41 freq
crude - 5 freq
cryit - 35 freq
cheritie - 3 freq
cairriet - 70 freq
couardie - 3 freq
crathie - 1 freq
craad - 2 freq
cairit - 6 freq
crowdie - 4 freq
cardie - 2 freq
card - 61 freq
coor't - 1 freq
coo'ard - 1 freq
curate - 7 freq
charade - 2 freq
cord - 18 freq
coard - 3 freq
cairied - 12 freq
croud - 8 freq
chairt - 17 freq
croudie - 2 freq
cooriet - 8 freq
cheerit - 1 freq
charitie - 2 freq
chirrie-wuid - 1 freq
chariot - 4 freq
crooed - 1 freq
cure't - 2 freq
crowed - 2 freq
carret - 3 freq
caireyt - 1 freq
cheert - 1 freq
cardee - 1 freq
couriet - 1 freq
creed - 10 freq
cerd - 7 freq
cart - 8 freq
chored - 9 freq
couried - 6 freq
crout - 1 freq
cert - 1 freq
cairtie - 12 freq
'cairry-oot' - 2 freq
cairdie - 1 freq
cairred - 2 freq
corrodi - 1 freq
cooardy - 3 freq
cairryit - 2 freq
chart - 4 freq
crate - 6 freq
car'ied - 1 freq
cairret - 2 freq
cairry-oot - 4 freq
carred - 1 freq
cowert - 2 freq
cyaard - 35 freq
cairryt - 1 freq
chert - 2 freq
cooardie - 1 freq
c--road - 1 freq
cardi - 2 freq
cyard - 1 freq
cerried - 3 freq
cartie - 1 freq
carriet - 15 freq
chaired - 6 freq
croat - 1 freq
coured - 1 freq
certy - 3 freq
cooerd - 6 freq
cooard - 1 freq
coorit - 2 freq
certie - 4 freq
crait - 1 freq
corda - 1 freq
curt - 1 freq
crete - 14 freq
cheriot - 2 freq
croatia - 7 freq
curried - 2 freq
courie't - 1 freq
curdoo - 1 freq
curday - 1 freq
couart - 1 freq
'cyaard - 1 freq
cairry'd - 1 freq
coored - 2 freq
'cured' - 1 freq
cairie-oot - 1 freq
crehd - 2 freq
caiort - 1 freq
courried - 1 freq
cairrie-out - 1 freq
couered - 1 freq
€”croatia - 1 freq
chard - 1 freq
cardy - 1 freq
chairity - 3 freq
cardio - 1 freq
carruth - 1 freq
cígarette - 1 freq
chairiot - 2 freq
coordy - 1 freq
cohort - 4 freq
cerry-oot - 1 freq
craa-tae - 1 freq
charred - 1 freq
chord' - 1 freq
carryout - 1 freq
cairt” - 1 freq
'court - 1 freq
crit - 1 freq
charaid - 1 freq
carat - 1 freq
MetaPhone code - XRT
shroud - 10 freq
cheered - 27 freq
short - 323 freq
shared - 87 freq
sheared - 6 freq
shirt - 75 freq
shard - 2 freq
chord - 11 freq
charity - 41 freq
cheritie - 3 freq
shirty - 1 freq
charade - 2 freq
chairt - 17 freq
shrood - 5 freq
cheerit - 1 freq
charitie - 2 freq
shooert - 2 freq
chariot - 4 freq
cheert - 1 freq
chored - 9 freq
shoart - 41 freq
shired - 5 freq
shred - 1 freq
shrewd - 2 freq
shird - 1 freq
chart - 4 freq
'shorty' - 1 freq
chert - 2 freq
shoured - 1 freq
shaired - 3 freq
chaired - 6 freq
cheriot - 2 freq
shoard - 4 freq
sharet - 3 freq
chard - 1 freq
chairity - 3 freq
chairiot - 2 freq
shooered - 3 freq
€˜short - 1 freq
shurt - 1 freq
charred - 1 freq
shaerd - 2 freq
shired' - 1 freq
chord' - 1 freq
charaid - 1 freq
CHARITY
Time to execute Levenshtein function - 0.174308 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.328334 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027427 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037286 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000844 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.