A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clarted in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clarted (0) - 14 freq
claried (1) - 1 freq
clarten (1) - 1 freq
carted (1) - 2 freq
clerted (1) - 1 freq
clartet (1) - 3 freq
clartie (2) - 4 freq
claithed (2) - 6 freq
clarried (2) - 3 freq
clapped (2) - 28 freq
blarged (2) - 1 freq
clatter (2) - 61 freq
clart (2) - 40 freq
plaited (2) - 3 freq
flared (2) - 4 freq
clarts (2) - 2 freq
clartin (2) - 4 freq
claesed (2) - 2 freq
blurted (2) - 6 freq
charmed (2) - 1 freq
started (2) - 362 freq
clased (2) - 2 freq
cleared (2) - 56 freq
charred (2) - 1 freq
coorted (2) - 2 freq
clarted (0) - 14 freq
clerted (1) - 1 freq
carted (2) - 2 freq
clartet (2) - 3 freq
claried (2) - 1 freq
clarten (2) - 1 freq
cairted (3) - 5 freq
clotted (3) - 1 freq
coorted (3) - 2 freq
clouted (3) - 1 freq
cleared (3) - 56 freq
blirted (3) - 1 freq
clarty (3) - 56 freq
claret (3) - 8 freq
blurted (3) - 6 freq
alerted (3) - 2 freq
chairted (3) - 1 freq
clartit (3) - 15 freq
clart (3) - 40 freq
clarried (3) - 3 freq
clartin (3) - 4 freq
clarts (3) - 2 freq
clartie (3) - 4 freq
chatted (4) - 3 freq
classed (4) - 4 freq
SoundEx code - C463
coloured - 35 freq
cleart - 9 freq
cleared - 56 freq
cloored - 3 freq
colourt - 6 freq
clairty - 8 freq
clartie - 4 freq
clartit - 15 freq
clarts - 2 freq
claart - 1 freq
clarty - 56 freq
clartet - 3 freq
clarted - 14 freq
clarten - 1 freq
clart - 40 freq
chuilyaird's - 1 freq
claret - 8 freq
clort - 3 freq
clortit - 2 freq
cleert - 1 freq
coulered - 1 freq
clert - 1 freq
clarity - 13 freq
'clart - 1 freq
clartier-lookin - 1 freq
clairtit - 3 freq
clearoot - 1 freq
clairt - 5 freq
colourit - 1 freq
clearit - 2 freq
caleeried - 1 freq
clartlessly - 1 freq
clartians - 2 freq
cloured - 3 freq
cloort - 1 freq
collared - 1 freq
cellardyke - 1 freq
clorty - 5 freq
clairtie - 2 freq
clarried - 3 freq
clairts - 1 freq
clartin - 4 freq
clerty - 2 freq
colorado - 1 freq
cullourit - 1 freq
clourt - 1 freq
clairet - 1 freq
cailyardis - 1 freq
€œcoloured - 1 freq
coleridge - 1 freq
culort - 1 freq
clortiest - 1 freq
clertit - 1 freq
€œclart - 1 freq
clerted - 1 freq
claried - 1 freq
claireeadam - 1 freq
cllrdownie - 1 freq
clareadamsonsnp - 2 freq
collart - 1 freq
clerty” - 1 freq
clear-thinking - 1 freq
MetaPhone code - KLRTT
clartit - 15 freq
clartet - 3 freq
clarted - 14 freq
clortit - 2 freq
clairtit - 3 freq
clertit - 1 freq
clerted - 1 freq
CLARTED
Time to execute Levenshtein function - 0.235108 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.372960 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028165 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037704 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000905 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.