GitHub - LathDevers/Cognitive-Robotics: Miniproject 10: Meta-Learning with Reptile in the Cognitive Robotics lecture at Universität Bielefeld

This repository contains data for Miniproject 10: Meta-Learning with Reptile in the Cognitive Robotics lecture taught by Prof. Dr. Helge Ritter (AG Neuroinformatik, Universität Bielefeld). The miniproject focuses on exploring a variant of the Model-Agnostic Meta-Learning (MAML) method called Reptile. For more information on Reptile and its scalability as a meta-learning algorithm, refer to the web article "Reptile - a scalable Meta-Learning Algorithm" published by OpenAI here. Additionally, you may find insights on first-order meta-learning algorithms in the paper titled On First-Order Meta-Learning Algorithms by A. Nichol, J. Achiam, and J. Schulman.

Tasks:

read the web-article, the comment paper and inspect the code (take details from the original article where missing)
reproduce the learning of the example function (parameters as in the original paper, i.e. 1-64-64-1-network, 32 gradient steps, amplitude and phase ranges as above)
consider the cases
1. of plain learning from a random weight initialization and
2. learning after weight optimization according to the Reptile algorithm of the example implementation
visualize in each case the results before and after training and compare the outcomes of (i) and (ii) above
finally, replace the sine function by the forward kinematics of a two-link robot arm with segment lengths $A \in [1, 2]$ and $B \in [0.5, 1]$ , and joint angles $x_{1}, x_{2} \in [- \frac{π}{2}, \frac{π}{2}]$ , end effector coordinates $(y_{1}, y_{2})$ (and no phase):

$y_{1} = A \cdot \cos (x_{1}) + B \cdot \cos (x_{1} + x_{2})$
$y_{2} = A \cdot \sin (x_{1}) + B \cdot \sin (x_{1} + x_{2})$

(this requires to use a network that can transform a pair $(x_{1}, x_{2})$ of joint angles to a pair of end effector coordinates $(y_{1}, y_{2})$ . E.g. experiment with a 2-64-64-2-shaped network and again use 10 randomly sampled training points)
repeat the above experiments for this case, now visualizing the error as an error surface above the $y_{1}, y_{2}$ space (e.g., visualizing error as color)
create an interactive result report about your exploration results on the Reptile algorithm

Name	Name	Last commit message	Last commit date
Latest commit LathDevers Update README.md May 18, 2023 34e3ae2 · May 18, 2023 History 38 Commits
.ipynb_checkpoints	.ipynb_checkpoints	Initial commit	Jul 26, 2022
src	src	Update README	May 18, 2023
.gitignore	.gitignore	Initial commit	Jul 26, 2022
README.md	README.md	Update README.md	May 18, 2023
reptile-fk-demo.py	reptile-fk-demo.py	Made FK experiments with different NN parameters	Feb 24, 2023
reptile-sinewaves-demo-original.py	reptile-sinewaves-demo-original.py	Moved original sine regression script	Oct 4, 2022
reptile-sinewaves-demo.py	reptile-sinewaves-demo.py	Added pictures of results	Feb 9, 2023
reptile.ipynb	reptile.ipynb	Changing section title	Mar 8, 2023
vanilla-sinewave.py	vanilla-sinewave.py	Added pictures of results	Feb 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tasks:

About

Languages

LathDevers/Cognitive-Robotics

Folders and files

Latest commit

History

Repository files navigation

Tasks:

About

Topics

Resources

Stars

Watchers

Forks

Languages