Neural Network Kinematics

This project seeks to train a neural network to perform inverse kinematics for rigid body link chains. This is by no means a new idea, as a trained solution to relatively complex nonlinear equations has been desired for decades. As well, it is a discipline where there are in many cases both iterative and closed-form solutions. More generally, this is a study of minimizing error using neural networks on arbitrary nonlinear mappings.

Background

There are three fairly common ways to perform inverse kinematics for rigid link robots:

Closed Form Solutions These are explicit solutions solving the inverse kinematics. These solutions evaluate very quickly and are very accurate, but can suffer the inability to be computed for suitably complex robotic systems. Moreover, additional logic may be required because there may be more than one solution, whence choosing the context of the solution is important. The Jacobian is also closed form in these solutions, so there is an explicit knowledge of the manipulabity and associated joint angular velocity situations that are important for practical robots. Bottom line is the types of robots that closed form solutions are relevant to are limited, but the precision and controlability is high. These solutions are seen in precision industrial and manufacturing robots.

Iterative Solutions In this case the forward kinematics are well known, but the inverse kinematics are more challenging to solve for explicitly. The Jacobian is often still known, so the manipulability and practical concerns for joint angular velocity are often known. The problem is that it is often unknown how long the solution will take to calculate, as it depends on the specific equations being solved, where on the manifold the solution lies, what the initial guess is, etc. While this method offers precisions, it isn't always appropriate since there is no guarantee of the timeframe for a solution, or if the solver will find a solution (e.g. problem conditioning). This type of solution is more commonly seen in unusually complex rigid body robots where there is no constrain on the time required to solve for the joint angles.

Neural Network Solutions In this case a neural network is used to learn the inverse kinematics based on forward kinematic training. The Jacobian can be known, but may not be integral to the concept of a solution. The idea is that this form of solution offers more flexibility (ability to resolve multiple possible solutions by using training from forward kinematics) with a consistent evaluation time at the expense of precision. Since the Jacobian itself is not part of the training, it can still be used for determining manipulability and angular joint velocity, though this is not part of the main idea of the technique. Since this type of solution does not offer high accuracy, it is not a solution that would be found in industrial and manufacturing robots. However, for classes of robots that do not require precision, and for which manipulability may not be of paramount importance, this type of solution is quite flexible in that only the forward kinematics needs to be known. Since this is not of relevance to most industrial problems, it might not be considered a common solution.

Goals

The general goal of this project is somewhat academic in that it is an avenue for me to learn about practical implementation challenges involving all of the techniques. Moreover, there is an aspect of it related to creating my own library for kinematics. In the context of the stated objective regarding the use of neural networks for inverse kinematics solvers, the goal is to understand network structures that work best for inferring the inverse kinematics based on forward kinematic training, particularly in singular areas.

Existing Work

There are a number of works focusing on neural networks for robotic kinematics and inverse kinematics, both using generic multiple layer perceptron networks as well as radial basis function networks:

There are also some peripherally useful articles having to do with planar and 3-RRR robot configurations:

There are also several books pertaining to control theory that cover neural network solutions.

In this study I am using a 3R robot. The rigid body mechanics are well studied, and there are focus articles relating to space division on the basis of the 3R singularity set that are relevant to understanding the style of manifold we intend to learn:

Another interesting and relevant investigation is the work of Rolnick and Tegmark on natural function expression, and augmented by Lin on why deep learning works as well as it does:

The reason for the inclusion of this should be clear -- we are attempting to learn a manifold, hence the representations that are learnable for the manifold is of crucial importance.

Link Description Methodology

The robot kinematics studied here is for a 3R robot. Simplicity and I have an industrial 6R that I can use, but moreover it is handy given the abundance of inexpensive servo-driven 3R robotics models that are available, lending itself simple for anyone to independently verify the results on physical hardware. 3R robots admit a slightly more rich set of singularities than robots like the Stanford arm and Cartesian (gantry) robots, and the forward and inverse kinematics have well known analytic solutions.

In this work I am using the Denavit-Hartenberg convention for computing the composite homogeneous transformation. For training, I am strictly using the forward kinematics to produce a set of features (Cartesian position of the end effector) and labels (joint angles). Why using the forward kinematic mapping rather than the inverse kinematics that would result in more consistent sampling from the desired Cartesian end effector space? Because the ultimate goal is to be able to actuate a robot without knowledge of how to find a closed form inverse mapping and understand how to create and train a model that provides acceptable accuracy.

The general notion of the Denavit-Hartenberg convention used in this code is expressed in this link diagram:

After establishing a link chain based on the Denavit-Hartenberg parameters, a composite homogeneous transformation is produced. Summary information about the transformation and the Jacobian are provided. The forwward transformation is evaluated to train the network.

Model Methodology

Code

First, make sure all your libs are up to date:

pip -r requirements.txt

There are a number of commands that can be invoked through the main.py program in the root of the project. These include:

--test Demonstrates the homogeneous transformation matrix for a two link planar robot (testing for the chain)
--train Trains the network based on the model and parameters for the generator
--infer Runs an inference pass using the model previously generated
--table Produces Jacobian and determinant values
--ik Computes all symbolic inverse solutions and verifies them through forward transform using sympy

The main objective is that the model used for training results in an inference test that is favorable. The other commands are mainly for testing and to validate the Denavit-Hartenberg part of the problem.

The model can be found in src/model.py, the generator is in /src/generator.py and the specifics of training the network can be found in src/train.py. It is all pretty self explanatory and easy to find. The only thing it is not is performant since it is based on sympy evaluation.

Software Results

Physical Robot Results

Future Plans

The original idea from a practical perspective was to be able to train inverse kinematics using a robot with unknown forward or inverse kinematics, though with consistent controlability, and image data from uncalibrated cameras. Since this involves a lot of unknowns, the idea of learning the complex mapping is very appealing. So long as there is consistent ability to actuate joints, the network ought be able to learn the inverse kinematics. Similarly, the image data from uncalibrated cameras should be able to produce the desired effect within some reasonable tolerance.

In this model, the cameras would feed a convnet. The convnet might have six layer input, three for each of two cameras, as the features being detected between the cameras are similar. Or it could just be the concatenation of multiple images in a typical three layer input. Depends on how easily one wants to transfer learning from one network to another on some of the decisions on that aspect of the problem. The top end would be similar to the network depicted in this example project, a regressive network as either a fully connected network using an activation supporting universal approximation or a single radial basis function later with a dense terminal layer. The convnet portion of the network could be trained separately, or even adapted directly from existing trained networks. One the features are trained in the convnet portion, the weights can be frozen and the regressive top end trained. So this is cool -- there is an aspect training for the specific setup of the cameras, and an aspect training for unknown mappins and nonlinearities.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
img		img
jupyter		jupyter
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

jupyter

jupyter

src

src

tests

tests

.gitignore

.gitignore

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Neural Network Kinematics

Background

Goals

Existing Work

Link Description Methodology

Model Methodology

Code

Software Results

Physical Robot Results

Future Plans

About

Releases

Packages

Languages

CedarStreetGarage/python-keras-neural-network-inverse-kinematics

Folders and files

Latest commit

History

Repository files navigation

Neural Network Kinematics

Background

Goals

Existing Work

Link Description Methodology

Model Methodology

Code

Software Results

Physical Robot Results

Future Plans

About

Topics

Resources

Stars

Watchers

Forks

Languages