Repo for work pertaining to TSNE, Conj, and English Embeddings
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
doc
renderings
resources
src/conj_2016
test/conj_2016
.gitignore
LICENSE
README.md
_config.yml
circle.yml
project.clj

README.md

conj-2016

See the talk here!

This repo was used to prepare the talk given by Alex Mann at Cognitect's 2016 Conj Conference. It includes a standard implementation of tSNE, examples of data rendered this way, a novel implementation of interop between Clojure and Python, a number of datasets which can be rendered into Clojure objects, and some examples of generatives testing.

Credit

I want to start by citing the sources that helped me get this far. This list is by no means exhaustive as there are many blogs and whitepapers I consumed where the information remains and the name has fled.

tSNE Groundwork

SENNA

Datasets

I lifted datasets from the following places:

Expected Use

Tests

lein test

nREPL

I got sick of starting a headless repl, so the following will start a session at port 54321.

lein nrepl

Rendering SVGs

There are examples of SVG rendering presented in the core namespace in the comments below. The gist is though, to run data through tSNE, then pipe it into spit-svg. Pretty straightforward!

Example Renderings

Collobert and Weston word embeddings of the english language's top 3000 words

Alt text

The MNIST dataset

Alt text