A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clarity in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clarity (0) - 13 freq
clarify (1) - 5 freq
charity (1) - 41 freq
clarty (1) - 56 freq
parity (2) - 2 freq
clarky (2) - 2 freq
clairty (2) - 8 freq
alacrity (2) - 1 freq
clart (2) - 40 freq
claret (2) - 8 freq
claith (2) - 59 freq
clawit (2) - 1 freq
cariy (2) - 1 freq
chairity (2) - 3 freq
calamity (2) - 5 freq
clearit (2) - 2 freq
clarry (2) - 2 freq
claried (2) - 1 freq
hilarity (2) - 5 freq
clartit (2) - 15 freq
clarion (2) - 1 freq
clatty (2) - 26 freq
cavity (2) - 2 freq
clorty (2) - 5 freq
rarity (2) - 2 freq
clarity (0) - 13 freq
clarty (1) - 56 freq
clerty (2) - 2 freq
clearit (2) - 2 freq
clorty (2) - 5 freq
clart (2) - 40 freq
claret (2) - 8 freq
charity (2) - 41 freq
clarify (2) - 5 freq
clairty (2) - 8 freq
clarts (3) - 2 freq
clairt (3) - 5 freq
clairet (3) - 1 freq
clatty (3) - 26 freq
clargy (3) - 1 freq
clarion (3) - 1 freq
cleart (3) - 9 freq
clartie (3) - 4 freq
claart (3) - 1 freq
clort (3) - 3 freq
claried (3) - 1 freq
clartit (3) - 15 freq
clawit (3) - 1 freq
clarky (3) - 2 freq
clert (3) - 1 freq
SoundEx code - C463
coloured - 35 freq
cleart - 9 freq
cleared - 56 freq
cloored - 3 freq
colourt - 6 freq
clairty - 8 freq
clartie - 4 freq
clartit - 15 freq
clarts - 2 freq
claart - 1 freq
clarty - 56 freq
clartet - 3 freq
clarted - 14 freq
clarten - 1 freq
clart - 40 freq
chuilyaird's - 1 freq
claret - 8 freq
clort - 3 freq
clortit - 2 freq
cleert - 1 freq
clert - 1 freq
clarity - 13 freq
'clart - 1 freq
clartier-lookin - 1 freq
clairtit - 3 freq
clearoot - 1 freq
clairt - 5 freq
colourit - 1 freq
clearit - 2 freq
caleeried - 1 freq
clartlessly - 1 freq
clartians - 2 freq
cloured - 3 freq
cloort - 1 freq
collared - 1 freq
cellardyke - 1 freq
clorty - 5 freq
clairtie - 2 freq
clarried - 3 freq
clairts - 1 freq
clartin - 4 freq
clerty - 2 freq
colorado - 1 freq
cullourit - 1 freq
clourt - 1 freq
clairet - 1 freq
cailyardis - 1 freq
€œcoloured - 1 freq
coleridge - 1 freq
culort - 1 freq
clortiest - 1 freq
clertit - 1 freq
€œclart - 1 freq
clerted - 1 freq
claried - 1 freq
claireeadam - 1 freq
cllrdownie - 1 freq
clareadamsonsnp - 2 freq
collart - 1 freq
clerty” - 1 freq
clear-thinking - 1 freq
MetaPhone code - KLRT
coloured - 35 freq
gollered - 11 freq
cleart - 9 freq
cleared - 56 freq
cloored - 3 freq
colourt - 6 freq
clairty - 8 freq
gollert - 7 freq
clartie - 4 freq
claart - 1 freq
clarty - 56 freq
clart - 40 freq
claret - 8 freq
clort - 3 freq
cleert - 1 freq
glaurt - 1 freq
queelrod - 6 freq
clert - 1 freq
clarity - 13 freq
'clart - 1 freq
clearoot - 1 freq
clairt - 5 freq
colourit - 1 freq
galleried - 1 freq
clearit - 2 freq
glared - 2 freq
gloried - 1 freq
caleeried - 1 freq
cloured - 3 freq
cloort - 1 freq
collared - 1 freq
clorty - 5 freq
clairtie - 2 freq
clarried - 3 freq
kulirt - 1 freq
gollared - 1 freq
gullert - 18 freq
glert - 1 freq
guller't - 2 freq
clerty - 2 freq
kullirt - 1 freq
klerty - 2 freq
colorado - 1 freq
cullourit - 1 freq
clourt - 1 freq
clairet - 1 freq
€œcoloured - 1 freq
culort - 1 freq
€œclart - 1 freq
claried - 1 freq
collart - 1 freq
clerty” - 1 freq
CLARITY
Time to execute Levenshtein function - 0.196092 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.353897 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028228 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037983 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000809 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.