100K static datamapplot
UMAP's two knobs
Sweep, don't set.
Same pipeline · different modality
Swap the encoder.
100,000 films · labelled
The labelling layer got smarter.
The beautiful
Catalogs become constellations.

All models are wrong,

but some are useful.

— George E. P. Box, 1976

Clustering is a pipeline,

not a button.

And every stage has an LLM-era upgrade you might not be using yet.

Built with

Open source — mostly.

Hugging Face
PyTorch
scikit-learn
Anthropic Claude
pandas
NumPy
JupyterLab
TMDB
UMAP
HDBSCAN
BBERTopic
·datamapplot
Plotly
Matplotlib
MManim
EEVoC
ODSC East 2026

Thank you.

Seth Levine · Director of AI Innovation, Contentsquare

Repo QR code
github.com/splevine/
clustering-good-bad-beautiful All notebooks, data scripts, and visualizations. Colab-ready.
🎬 poster storm · press c again to keep going
space next · back · esc exit Close · 1/5