A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clerted in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clerted (0) - 1 freq
alerted (1) - 2 freq
clarted (1) - 14 freq
lested (2) - 4 freq
clouted (2) - 1 freq
herted (2) - 6 freq
elected (2) - 10 freq
clertit (2) - 1 freq
bleated (2) - 1 freq
cairted (2) - 5 freq
cerved (2) - 1 freq
glented (2) - 3 freq
cleared (2) - 56 freq
clotted (2) - 1 freq
cherged (2) - 4 freq
clert (2) - 1 freq
cheated (2) - 2 freq
exerted (2) - 1 freq
coorted (2) - 2 freq
created (2) - 27 freq
cleshed (2) - 1 freq
clerty (2) - 2 freq
leeted (2) - 1 freq
cleed (2) - 2 freq
clartet (2) - 3 freq
clerted (0) - 1 freq
clarted (1) - 14 freq
alerted (2) - 2 freq
clartet (3) - 3 freq
created (3) - 27 freq
clert (3) - 1 freq
claried (3) - 1 freq
coorted (3) - 2 freq
carted (3) - 2 freq
blurted (3) - 6 freq
blirted (3) - 1 freq
clotted (3) - 1 freq
clarten (3) - 1 freq
clerty (3) - 2 freq
cairted (3) - 5 freq
clouted (3) - 1 freq
clertit (3) - 1 freq
cleared (3) - 56 freq
clarty (4) - 56 freq
cleart (4) - 9 freq
curated (4) - 1 freq
tolerated (4) - 5 freq
coulered (4) - 1 freq
caleeried (4) - 1 freq
collated (4) - 1 freq
SoundEx code - C463
coloured - 35 freq
cleart - 9 freq
cleared - 56 freq
cloored - 3 freq
colourt - 6 freq
clairty - 8 freq
clartie - 4 freq
clartit - 15 freq
clarts - 2 freq
claart - 1 freq
clarty - 56 freq
clartet - 3 freq
clarted - 14 freq
clarten - 1 freq
clart - 40 freq
chuilyaird's - 1 freq
claret - 8 freq
clort - 3 freq
clortit - 2 freq
cleert - 1 freq
coulered - 1 freq
clert - 1 freq
clarity - 13 freq
'clart - 1 freq
clartier-lookin - 1 freq
clairtit - 3 freq
clearoot - 1 freq
clairt - 5 freq
colourit - 1 freq
clearit - 2 freq
caleeried - 1 freq
clartlessly - 1 freq
clartians - 2 freq
cloured - 3 freq
cloort - 1 freq
collared - 1 freq
cellardyke - 1 freq
clorty - 5 freq
clairtie - 2 freq
clarried - 3 freq
clairts - 1 freq
clartin - 4 freq
clerty - 2 freq
colorado - 1 freq
cullourit - 1 freq
clourt - 1 freq
clairet - 1 freq
cailyardis - 1 freq
€œcoloured - 1 freq
coleridge - 1 freq
culort - 1 freq
clortiest - 1 freq
clertit - 1 freq
€œclart - 1 freq
clerted - 1 freq
claried - 1 freq
claireeadam - 1 freq
cllrdownie - 1 freq
clareadamsonsnp - 2 freq
collart - 1 freq
clerty” - 1 freq
clear-thinking - 1 freq
MetaPhone code - KLRTT
clartit - 15 freq
clartet - 3 freq
clarted - 14 freq
clortit - 2 freq
clairtit - 3 freq
clertit - 1 freq
clerted - 1 freq
CLERTED
Time to execute Levenshtein function - 0.217456 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.350512 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027761 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037594 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000985 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.