A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to carlton in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
carleton (1) - 1 freq
calton (1) - 10 freq
carton (1) - 7 freq
charlton (1) - 2 freq
parton (2) - 4 freq
carlos (2) - 3 freq
caxton (2) - 1 freq
carlgo (2) - 1 freq
carbon (2) - 7 freq
cartoon (2) - 13 freq
charltons (2) - 4 freq
carlo (2) - 16 freq
carrion (2) - 1 freq
carlin (2) - 6 freq
canton (2) - 2 freq
carson (2) - 1 freq
cardoon (2) - 1 freq
dalton (2) - 1 freq
barton (2) - 3 freq
cartin (2) - 1 freq
carron (2) - 4 freq
carryon (2) - 1 freq
cornton (2) - 2 freq
cairton (2) - 3 freq
cartons (2) - 1 freq
carleton (1) - 1 freq
calton (2) - 10 freq
charlton (2) - 2 freq
carton (2) - 7 freq
cartin (3) - 1 freq
cartoon (3) - 13 freq
chilton (3) - 1 freq
carlin (3) - 6 freq
cornton (3) - 2 freq
cairton (3) - 3 freq
corleone (4) - 1 freq
cairtin (4) - 7 freq
carnation (4) - 3 freq
carlene (4) - 1 freq
crouton (4) - 2 freq
charlatan (4) - 3 freq
cratin (4) - 1 freq
corntin (4) - 1 freq
cretin (4) - 3 freq
carntyne (4) - 1 freq
cairtoun (4) - 1 freq
curlan (4) - 2 freq
crueton (4) - 1 freq
cairtoon (4) - 1 freq
curlin (4) - 17 freq
SoundEx code - C643
cruelty - 14 freq
curlt - 7 freq
crawlt - 3 freq
curled - 44 freq
crawled - 26 freq
cheerleads - 1 freq
crueltt - 1 freq
crowlt - 1 freq
craalt - 3 freq
crawl't - 1 freq
carlton's - 1 freq
charlotte - 9 freq
craaled - 8 freq
curlit - 1 freq
carolled - 1 freq
curlie-dodie - 1 freq
cheerleaders - 1 freq
crowled - 2 freq
charltons - 4 freq
craw-leid - 1 freq
corralled - 2 freq
curl-doddie - 1 freq
charlatan - 3 freq
curly-heidit - 1 freq
cruelties - 1 freq
charlatans - 1 freq
carltonkirby - 2 freq
charlottegshore - 1 freq
charlton - 2 freq
carleton - 1 freq
curlytalebooks - 11 freq
correlation - 1 freq
charltonleonie’s - 1 freq
MetaPhone code - KRLTN
carleton - 1 freq
CARLTON
Time to execute Levenshtein function - 0.214303 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.397557 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.038323 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037951 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001014 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.