A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cluster in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cluster (0) - 5 freq
fluster (1) - 1 freq
clusters (1) - 5 freq
bluster (1) - 8 freq
clister (1) - 1 freq
clutter (1) - 2 freq
blustery (2) - 15 freq
coster (2) - 1 freq
ulster (2) - 420 freq
flusters (2) - 1 freq
muster (2) - 7 freq
blouster (2) - 5 freq
clugger (2) - 4 freq
caster (2) - 2 freq
chunter (2) - 1 freq
flutter (2) - 9 freq
bouster (2) - 1 freq
glister (2) - 8 freq
plister (2) - 1 freq
houster (2) - 1 freq
coulter (2) - 6 freq
gouster (2) - 5 freq
couser (2) - 2 freq
culter (2) - 1 freq
clustered (2) - 1 freq
cluster (0) - 5 freq
clister (1) - 1 freq
cloister (2) - 2 freq
clusters (2) - 5 freq
clutter (2) - 2 freq
bluster (2) - 8 freq
fluster (2) - 1 freq
alister (3) - 1 freq
closser (3) - 13 freq
culter (3) - 1 freq
closer (3) - 138 freq
clustered (3) - 1 freq
slester (3) - 4 freq
lister (3) - 6 freq
plaster (3) - 14 freq
blaster (3) - 3 freq
plester (3) - 2 freq
coaster (3) - 18 freq
clatter (3) - 61 freq
blister (3) - 3 freq
chester (3) - 1 freq
blustery (3) - 15 freq
caster (3) - 2 freq
glister (3) - 8 freq
coster (3) - 1 freq
SoundEx code - C423
clocked - 32 freq
cleekit - 23 freq
claggit - 6 freq
clased - 2 freq
claucht - 54 freq
cloacked - 11 freq
cleeked - 11 freq
cleckit - 6 freq
clasht - 4 freq
chiels-that - 1 freq
collect - 29 freq
clicked - 12 freq
chalked - 3 freq
collected - 16 freq
closed - 126 freq
cleekt - 2 freq
calloused - 2 freq
clouston - 4 freq
celeste - 3 freq
claustrophobic - 1 freq
collection - 81 freq
cloakit - 2 freq
cloggit - 2 freq
clickt - 2 freq
claesed - 2 freq
collective - 31 freq
closet - 11 freq
collectin't - 1 freq
clookit - 2 freq
closset - 1 freq
collections - 26 freq
clauchts - 2 freq
cloister - 2 freq
claikit - 2 freq
cleikit - 12 freq
clusters - 5 freq
celestal - 1 freq
cluster - 5 freq
cullecten - 1 freq
clocket - 1 freq
collegiate - 1 freq
collectin - 17 freq
coalesced - 2 freq
collectit - 8 freq
celestial - 4 freq
cloistert - 2 freq
collogued - 4 freq
colgate - 2 freq
cloaked - 2 freq
clauchtin - 5 freq
collectors - 9 freq
claised - 1 freq
calcutta - 5 freq
collector - 7 freq
clockit - 2 freq
claustrophobia - 2 freq
coolest - 1 freq
collects - 2 freq
clacked - 1 freq
clashed - 4 freq
'closed' - 1 freq
clickit - 7 freq
colleckit - 5 freq
collectors' - 1 freq
cailst - 3 freq
chalkt - 1 freq
coalcutting - 1 freq
clouston's - 1 freq
clestered - 1 freq
cellist - 1 freq
coal-shade - 1 freq
claught - 49 freq
claught-warkin - 2 freq
clossed - 2 freq
clustered - 1 freq
clousta - 3 freq
clagged - 1 freq
cloistered - 1 freq
clestrain - 1 freq
collectioin - 1 freq
collectin' - 2 freq
collectives - 1 freq
collectively - 4 freq
co-locate - 1 freq
clessed - 1 freq
cloisters' - 1 freq
clostridium - 1 freq
cleuk-tipt - 1 freq
class-drookit - 1 freq
'collected - 1 freq
claiked - 1 freq
clegg-iddergaits - 1 freq
claacht - 1 freq
cleik't - 1 freq
clekkit - 1 freq
clickety - 1 freq
'collectit - 1 freq
coal-cuttin - 1 freq
colloguit - 1 freq
callisto - 2 freq
collecktive - 1 freq
collekit - 1 freq
closeted - 1 freq
clister - 1 freq
cleukit - 2 freq
clauchit - 1 freq
celsitud - 1 freq
collecting - 4 freq
clockt - 1 freq
collectan - 2 freq
clogged-up - 1 freq
collecit - 2 freq
chalk-stour - 1 freq
cleshed - 1 freq
clushet - 12 freq
cliched - 1 freq
collocations - 2 freq
cloased - 1 freq
classed - 4 freq
clock-tower - 1 freq
collieston - 1 freq
cleg-tipping - 1 freq
clusterbourach - 1 freq
closetaehame - 1 freq
clichéd - 1 freq
clstevenson - 1 freq
MetaPhone code - KLSTR
glister - 8 freq
glaister - 2 freq
cloister - 2 freq
cluster - 5 freq
clister - 1 freq
glistery - 1 freq
CLUSTER
Time to execute Levenshtein function - 0.194160 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.383198 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028150 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041500 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000876 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.