Skip to content
View aht's full-sized avatar

Organizations

@adatao
Block or Report

Block or report aht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aht/README.md

Hi there πŸ‘‹

πŸ˜„ I'm a software engineere & data scientist. I built innovative tools to democratize data science & machine learning.

πŸ’¬ Ask me about DL/ML/dataviz & distributed system

🌱 I’m looking to collaborate on SWE/ML/data for social good

🌱 I’m currently learning about real-time data architecture, AI/tech ethics, EV/battery, sustainability

πŸ’¬ How to reach me: @climate_dad

πŸ˜„ Pronouns: he/him

πŸ”­ At Panasonic, I built an automated deep learning (AutoML) system for time-series & IoT data, which can train/tune DL models on ~1B data points. Helps data scientists train/tune LSTM, ResNet, Self-Normalizing Networks & Mixture Density Network on data from S3/Parquet with zero sweats :)

πŸ”­ At Arimo (a top-ranked data science startup per FastCompany), I built an "Alexa for big data analytics" system which can answer questions & visualize large datasets https://youtu.be/3RQDQApgz-4?t=225 (demo ~ 3:45) using Apache Spark, NLP & statistical graphics best practice & d3.js.

πŸ€Ύβ€ Played with distributed deep learning https://github.com/adatao/tensorspark before Horovod, Ray or tf.distributed comes around. I was not a principal instigator in this project but I provided support & optimization.

πŸ€Ύβ€ I contributed to Golang in the early days, like 10 years ago :). It's an efficient sieve of Eratosthenes using CSP channels that Rob Pike wants to keep as a demo/test use case in the main source code as it's quite an interesting concurrent system https://github.com/aht/gosieve.

πŸ€Ύβ€ Some Python hacking back in the days while I had lots of free time fork-exec and pipe with I/O redirection, Lazily-evaluated, parallelizable Python pipeline, Agents and functions that modify Python sequences in-place

πŸ€Ύβ€ I wrote a toy LISP interpreter that support prefix/postfix & infix op just to annoy LISP people https://github.com/aht/olisp

πŸ€Ύβ€ An esoteric programming gem: a self-hosting Fractran interpreter in 84 fractions. This was one of those things for which the space available is really too small to explain everything...

🌱 I gave talk on "Visualization as Data and Data as Visualization" at Strata Hadoop World 2016. As a father of a little girl, I dived into a data-driven story about women stopping coding since the 80s and imagined a world where data & visualizations are easily sharable & infinitely collaborative.

🌱 I gave a talk on "Concurrent programming with Go" circa 2011

🌱 I gave a talk on "whatis git" the stupid content trackercirca 2012

Pinned

  1. adatao/tensorspark adatao/tensorspark Public archive

    TensorFlow on Spark

    Python 298 101

  2. stream.py stream.py Public

    Lazily-evaluated, parallelizable Python pipeline.

    Python 95 11

  3. gosieve gosieve Public

    An efficient sieve of Eratosthenes using CSP channels in Go

    Go 29 3

  4. whatisgit whatisgit Public

    a presentation about git, the stupid content tracker, for the novice and the semi-experience git user

    JavaScript 11 4

  5. barcamp2011-gonuts barcamp2011-gonuts Public

    barcamp2011 saigon presentation about golang, with sample code

    JavaScript 1

  6. extproc extproc Public

    fork-exec and pipe with I/O redirection -- make Python a sane alternative to non-trivial shell scripts

    Python 14 1