A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to canal in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
canal (0) - 27 freq
anal (1) - 1 freq
cabal (1) - 2 freq
canali (1) - 2 freq
canals (1) - 3 freq
cana (1) - 29 freq
caal (1) - 99 freq
canae (1) - 28 freq
canon (2) - 11 freq
anae (2) - 1 freq
canni (2) - 6 freq
mahal (2) - 3 freq
daal (2) - 9 freq
canary (2) - 14 freq
final (2) - 162 freq
natal (2) - 2 freq
cainan (2) - 4 freq
cany (2) - 5 freq
saal (2) - 3 freq
cata (2) - 1 freq
paal (2) - 2 freq
caaf (2) - 4 freq
manual (2) - 6 freq
lanl (2) - 1 freq
donal (2) - 7 freq
canal (0) - 27 freq
canali (1) - 2 freq
canae (2) - 28 freq
canle (2) - 2 freq
caunil (2) - 1 freq
caal (2) - 99 freq
caunel (2) - 6 freq
cana (2) - 29 freq
canals (2) - 3 freq
anal (2) - 1 freq
cabal (2) - 2 freq
caul (3) - 77 freq
nal (3) - 1 freq
cani (3) - 2 freq
cantle (3) - 2 freq
caall (3) - 2 freq
cral (3) - 1 freq
canada (3) - 38 freq
canaan (3) - 3 freq
cans (3) - 45 freq
cannel (3) - 18 freq
cant (3) - 40 freq
canio (3) - 1 freq
connal (3) - 1 freq
craal (3) - 5 freq
SoundEx code - C540
chimley - 11 freq
cannily - 65 freq
caumly - 8 freq
canle - 2 freq
caunle - 26 freq
canal - 27 freq
chimlie - 1 freq
camel - 7 freq
channel - 41 freq
comlie - 1 freq
cunnle - 3 freq
cannel - 18 freq
channel' - 2 freq
cannle - 7 freq
chenille - 1 freq
cumuli - 1 freq
chanel - 1 freq
cummel - 1 freq
chon'll - 1 freq
chon'il - 1 freq
cunnel - 1 freq
caunill - 1 freq
cannilie - 5 freq
comelie - 2 freq
caumel - 1 freq
caunil - 1 freq
canali - 2 freq
can'll - 1 freq
chimla - 3 freq
caunnil - 1 freq
camille - 1 freq
comely - 4 freq
connolly - 2 freq
caunel - 6 freq
cawmill - 2 freq
connul - 2 freq
caunnle - 1 freq
camilla - 2 freq
connal - 1 freq
connelly - 1 freq
camillo - 1 freq
MetaPhone code - KNL
cannily - 65 freq
keenly - 8 freq
queen'll - 3 freq
canle - 2 freq
caunle - 26 freq
canal - 27 freq
kennel - 11 freq
kennle - 2 freq
gunnel - 2 freq
gunnul - 2 freq
kinelie - 1 freq
cunnle - 3 freq
gunnal - 1 freq
cannel - 18 freq
kinely - 4 freq
cannle - 7 freq
kïnnle - 1 freq
ken'll - 1 freq
goeen'll - 1 freq
cunnel - 1 freq
caunill - 1 freq
cannilie - 5 freq
kynlie - 3 freq
caunil - 1 freq
canali - 2 freq
can'll - 1 freq
kinnle - 3 freq
caunnil - 1 freq
connolly - 2 freq
caunel - 6 freq
gun-ile - 1 freq
connul - 2 freq
kunal - 11 freq
caunnle - 1 freq
connal - 1 freq
connelly - 1 freq
CANAL
Time to execute Levenshtein function - 0.199125 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.328182 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027577 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037115 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000827 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.