Mass-Spring-Damper Simulation

***Experimental code for research purposes

This work presents control simulation of a mechanical system using a model-based reinforcement learning (RL) algorithm. It was meant only for data exploration, experimentation, visualization, and understanding the general approach behind MPC and RL.

Summary

Integration of Model Predictive Control (MPC) and RL to control nonlinear MIMO system
Utilization of neural networks for online prediction of the local dynamics

Requirements:

Python 3.8
Tensorflow 2.3.0

Problem Example

Suppose we have three identical point masses m = 0.5 kg, with initial positions x1, x2, and x3. The dampers have damping constant d= 0.25 N.s/m each. The springs are nonlinear, described by the Newton’s law as:

F(x) = k.Δx^3 + kp.Δx^3,

with k = 217 N/m, and kp= 63.5 N/m³.

This problem is very unrealistic. The system is nonlinear and control starts at an extremely high frequency. This is the main challenge.

The neural network has to learn the dynamics function between control inputs, current positions, and past positions. For this exploration purpose in RL, the system is actuated randomly for 40 seconds.

Problem goal

Steer the masses m1 and m3 to its desired reference position values x1_ref and x3_ref. An artificial neural network will be used to predict its future positions x1 and x3 based on the forces u1, dist, and u3.

There are two phases in RL: exploration and exploitation. For validation of online training, I did the following simulation for 40.6 seconds.

Final results

Following proper design of reward function and neural network, the algorithm can control the system within 100 time steps.

Outlook

The control algorithm takes about 4-5 seconds for each time step, because the computation is not optimized (GPU implementation not for all computation, but only for neural network training). Possible improvements for the future:

Integration of a convex-input neural network to learn the whole system dynamics
Explore advanced RL algorithms
Faster implementation of the algorithm using computing frameworks (e.g. OpenCL with C++)
Experimental implementation on a real MIMO system

References

Hagan, M.T. ; Demuth, H.B. ; Jesus, O.d.: An introduction to the use of neural networks in control systems. In: Int. J. Robust Nonlinear Control Bd. 12. John Wiley and Sons Ltd, (2002), S. 959-985
Nagabandi, A. ; Kahn, G. ; Fearing, R.S. ; Levine, S.: Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning. In: CoRR abs/1708.02596 (2017)
Chen, Y. ; Shi, Y. ; Zhang, B.: Optimal Control Via Neural Networks: A Convex Approach. In: arXiv: Optimization and Control (2019)
Sutton, RS. ; Barto, A.G.: Reinforcement Learning: An Introduction. Cambridge, MA, USA : A Bradford Book, (2018) — ISBN 0262039249
Wong, W. ; Chee, E. ; Li, J. ;Wang, X.: Recurrent Neural Network-Based Model Predictive Control for Continuous Pharmaceutical Manufacturing. In: Mathematics 6 (2018), S. 242
Rawlings, J. ; Mayne, D.Q. ; Diehl, M.: Model Predictive Control: Theory, Computation, and Design 2nd Edition. Santa Barbara, California: Nob Hill Publishing, LLC, (2019)

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
RL_simulation		RL_simulation
Results_CSV_to_video		Results_CSV_to_video
data_generation		data_generation
prediction_testing		prediction_testing
LICENSE		LICENSE
README.md		README.md
algo.png		algo.png
changemassreinf.png		changemassreinf.png
control_results_nodist.gif		control_results_nodist.gif
dist100changemassreinf.png		dist100changemassreinf.png
effectchangemass22.png		effectchangemass22.png
flowchartreinf.png		flowchartreinf.png
fullexploration.gif		fullexploration.gif
mass-spring-damper.gif		mass-spring-damper.gif
mass-spring-damper.png		mass-spring-damper.png
testing_method.png		testing_method.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mass-Spring-Damper Simulation

***Experimental code for research purposes

Summary

Problem Example

This problem is very unrealistic. The system is nonlinear and control starts at an extremely high frequency. This is the main challenge.

Problem goal

Final results

Outlook

References

About

Releases

Packages

Languages

License

stephenadhi/nn-mpc

Folders and files

Latest commit

History

Repository files navigation

Mass-Spring-Damper Simulation

***Experimental code for research purposes

Summary

Problem Example

This problem is very unrealistic. The system is nonlinear and control starts at an extremely high frequency. This is the main challenge.

Problem goal

Final results

Outlook

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages