Sonic Library Map

Note: This project was built with Claude Code. Code, documentation, and commit messages were AI-generated with human direction and review.

Maps your Spotify library as an interactive 2D scatter plot of Essentia + Discogs-EffNet audio embeddings. This project also forms and draws HDBSCAN clusters.

Each track is extracted from YouTube, analyzed with Essentia's Discogs-EffNet model (1280-dim learned musical similarity), and projected to 2D via UMAP.

Try an interactive demo at https://josyoo.com/work/spotify-umap/.

Stack

Layer	Technology
Frontend	Next.js 16, React 19, TypeScript, Tailwind CSS 4
Visualization	D3.js on Canvas (not SVG — performs well at 5000+ points)
Auth	Spotify OAuth 2.0 + `iron-session` encrypted cookies
Database	SQLite via `better-sqlite3` (WAL mode, 24h cache TTL)
Rate limiting	`p-queue` (concurrency 10, 25 req/sec)
Audio analysis	Essentia + Discogs-EffNet TF model (1280-dim learned musical embeddings)
Audio sourcing	ytmusicapi (search) + `yt-dlp` (download)
ML sidecar	FastAPI + `umap-learn` + `hdbscan` + Essentia (Python 3.11)
Deployment	Docker Compose on Oracle Cloud VPS behind Cloudflare tunnel

Setup

Prerequisites

Node.js 20+
conda / mamba / micromamba (the sidecar env is defined in umap-service/environment.yml; essentia-tensorflow is only on conda-forge + a pre-release PyPI wheel, so pip alone is brittle)
A Spotify Developer App with your email added under User Management
YouTube Music browser auth headers (for audio sourcing — see ytmusicapi setup)

1. Configure environment

cp .env.example .env.local

Fill in SPOTIFY_CLIENT_ID, SPOTIFY_CLIENT_SECRET, and generate a random SESSION_SECRET (32+ chars). Set the redirect URI in your Spotify app to:

http://127.0.0.1:3000/api/auth/callback

Use 127.0.0.1, not localhost — Spotify rejects HTTP localhost redirect URIs.

2. Run in development

Create the sidecar env once:

cd umap-service
conda env create -f environment.yml  # or: mamba / micromamba

Then run both services:

# terminal 1 — Python sidecar
cd umap-service
conda activate spotify-project
uvicorn main:app --host 127.0.0.1 --port 8000

# terminal 2 — Next.js app
cd next-app
npm install
npm run dev

Open http://127.0.0.1:3000 (not localhost — cookies are domain-scoped).

3. Run with Docker Compose

docker compose up --build

This starts both the Next.js app (port 3000) and the Python UMAP sidecar (port 8000, internal only). The web service waits for the UMAP health check before starting.

Architecture

User  ->  Spotify OAuth login
      ->  Fetch saved tracks + playlists + artist genres  (paginated, rate-limited)
      ->  Cache in SQLite
      ->  Python sidecar: search YouTube Music  ->  download audio  ->  Essentia feature extraction
      ->  Cache features + YouTube link in SQLite (audio retained during dev, discarded in prod)
      ->  UMAP on audio features  ->  2D coordinates
      ->  D3 renders interactive scatter plot with playlist boundaries

Key directories

next-app/src/
  app/
    api/auth/{login,callback,logout,refresh}/  -- OAuth routes
    api/library/                               -- SSE streaming data fetch
    dashboard/                                 -- scatter plot page
  components/
    ScatterPlot.tsx    -- D3 Canvas renderer (zoom, pan, quadtree hit-test)
    SongTooltip.tsx    -- hover card with album art
    PlaylistLegend.tsx -- color-coded playlist toggles
    LibraryLoader.tsx  -- SSE progress bar
  lib/
    spotify.ts  -- paginated Spotify API wrapper
    auth.ts     -- iron-session config
    db.ts       -- SQLite cache layer
    types.ts    -- shared TypeScript interfaces

umap-service/
  main.py         -- FastAPI: /umap, /cluster, /features, /health
  tf_extract.py   -- Discogs-EffNet TF embedding extraction (1280-dim)
  feature_extract.py -- Raw spectral feature extraction (41-dim)
  audio_source.py -- ytmusicapi search + yt-dlp download + SQLite cache

Static demo export

The Next app ships a helper that snapshots the library + UMAP coords + cluster labels + raw features into JSON files for a read-only static deployment:

cd next-app
# sidecar must be running at http://127.0.0.1:8000 for cluster labels
node scripts/export-demo.mjs [outDir]

Default outDir is ../../site/public/demo/spotify — adjust as needed. Output bundle is roughly 900 KB for a 900-track library.

Known limitations

Spotify audio features unavailable: Spotify deprecated the /audio-features endpoint for new apps in November 2024, and preview_url returns null for all tracks. Audio features are instead extracted via YouTube Music (search with ytmusicapi → download with yt-dlp → analyze with Essentia). Audio files are retained during development (in umap-service/data/audio/) to avoid re-downloading during extraction pivots; production should delete after processing.
YouTube Music browser auth expires: The ytmusicapi browser auth cookies need periodic re-authentication.
TF embeddings require re-extraction: Existing tracks with only raw spectral features need re-download for Discogs-EffNet TF embedding extraction (YouTube links are cached, so search is skipped).

License

GPL-3.0

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
docs		docs
next-app		next-app
umap-service		umap-service
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Dockerfile.next		Dockerfile.next
Dockerfile.python		Dockerfile.python
LICENSE		LICENSE
PLAN.md		PLAN.md
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sonic Library Map

Stack

Setup

Prerequisites

1. Configure environment

2. Run in development

3. Run with Docker Compose

Architecture

Key directories

Static demo export

Known limitations

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sonic Library Map

Stack

Setup

Prerequisites

1. Configure environment

2. Run in development

3. Run with Docker Compose

Architecture

Key directories

Static demo export

Known limitations

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages