Skip to content
View kosmitive's full-sized avatar
🧭
🧭

Block or report kosmitive

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. burrolib burrolib Public

    Burrolib provides a library for multi-agent Markov games for researchers. It considers Markov games from an economical perspective. The modular agent design allows different agent implementations f…

    Python 4

  2. bootstrapped-dqn bootstrapped-dqn Public

    An implementation of boostrapped DQN (https://arxiv.org/abs/1602.04621). It was created during my bachelor thesis at TU Darmstadt, and you can find thesis at http://www.ias.tu-darmstadt.de/uploads/…

    Python 1

  3. sticky-hdp-slds-hmm sticky-hdp-slds-hmm Public

    An implementation of a hierarchical Dirichlet process (HDP) combined with a switching linear dynamical systems (SLDS) from https://arxiv.org/abs/1003.3829. It is a rather complex model and thus com…

    Python 1

  4. abstract_rl abstract_rl Public

    A modular python implementation of various policy gradient algorithms for use in control problems on experimental quanser robots. This repository includes implementations of Maximum A Posteriori Po…

    Python 2