A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clartit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clartit (0) - 15 freq
clertit (1) - 1 freq
clartie (1) - 4 freq
clortit (1) - 2 freq
clartet (1) - 3 freq
clairtit (1) - 3 freq
clartin (1) - 4 freq
claitht (2) - 1 freq
elatit (2) - 1 freq
dartit (2) - 2 freq
chargit (2) - 2 freq
claimit (2) - 9 freq
courtit (2) - 1 freq
tartit (2) - 1 freq
claspit (2) - 4 freq
spartit (2) - 2 freq
clatt (2) - 5 freq
coortit (2) - 5 freq
slatit (2) - 1 freq
cartin (2) - 1 freq
claikit (2) - 2 freq
lastit (2) - 13 freq
clarity (2) - 13 freq
plantit (2) - 35 freq
clearit (2) - 2 freq
clartit (0) - 15 freq
clartet (1) - 3 freq
clortit (1) - 2 freq
clairtit (1) - 3 freq
clertit (1) - 1 freq
clartie (2) - 4 freq
clartin (2) - 4 freq
cairtit (3) - 11 freq
clearit (3) - 2 freq
clart (3) - 40 freq
flirtit (3) - 1 freq
clarity (3) - 13 freq
alertit (3) - 2 freq
clarts (3) - 2 freq
clarty (3) - 56 freq
clairtie (3) - 2 freq
clarten (3) - 1 freq
claret (3) - 8 freq
coortit (3) - 5 freq
cloutit (3) - 2 freq
chirtit (3) - 1 freq
clarted (3) - 14 freq
clattie (3) - 11 freq
blurtit (3) - 2 freq
courtit (3) - 1 freq
SoundEx code - C463
coloured - 35 freq
cleart - 9 freq
cleared - 56 freq
cloored - 3 freq
colourt - 6 freq
clairty - 8 freq
clartie - 4 freq
clartit - 15 freq
clarts - 2 freq
claart - 1 freq
clarty - 56 freq
clartet - 3 freq
clarted - 14 freq
clarten - 1 freq
clart - 40 freq
chuilyaird's - 1 freq
claret - 8 freq
clort - 3 freq
clortit - 2 freq
cleert - 1 freq
clert - 1 freq
clarity - 13 freq
'clart - 1 freq
clartier-lookin - 1 freq
clairtit - 3 freq
clearoot - 1 freq
clairt - 5 freq
colourit - 1 freq
clearit - 2 freq
caleeried - 1 freq
clartlessly - 1 freq
clartians - 2 freq
cloured - 3 freq
cloort - 1 freq
collared - 1 freq
cellardyke - 1 freq
clorty - 5 freq
clairtie - 2 freq
clarried - 3 freq
clairts - 1 freq
clartin - 4 freq
clerty - 2 freq
colorado - 1 freq
cullourit - 1 freq
clourt - 1 freq
clairet - 1 freq
cailyardis - 1 freq
€œcoloured - 1 freq
coleridge - 1 freq
culort - 1 freq
clortiest - 1 freq
clertit - 1 freq
€œclart - 1 freq
clerted - 1 freq
claried - 1 freq
claireeadam - 1 freq
cllrdownie - 1 freq
clareadamsonsnp - 2 freq
collart - 1 freq
clerty” - 1 freq
clear-thinking - 1 freq
MetaPhone code - KLRTT
clartit - 15 freq
clartet - 3 freq
clarted - 14 freq
clortit - 2 freq
clairtit - 3 freq
clertit - 1 freq
clerted - 1 freq
CLARTIT
Time to execute Levenshtein function - 0.301202 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.553407 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.059643 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039557 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000792 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.