Scientific Papers Suggestion Thread #11

Ghost---Shadow · 2017-04-23T14:24:45Z

Messing around and trying random stuff can only get so far. If we want a more reliable network we should try surveying existing literature.

I think this paper(https://arxiv.org/abs/1608.02192) is a good start. Here is a video about it. (https://www.youtube.com/watch?v=QkqNzrsaxYc)

zxzhijia · 2017-04-23T17:18:53Z

I totally agree with @Ghost---Shadow , we need some studies about the literature and pick or invent a low computational cost method to achieve it. The paper I refer is http://deepdriving.cs.princeton.edu/paper.pdf

Sentdex · 2017-04-24T13:34:06Z

While I think it's pointless to not read up to date research on the best methods, I also think it's unwise to assume you cannot possibly make progress without someone else showing you how to progress. Most of what we're doing here is still totally in the trial and error stage, no one really has the answer. If you get the answer to this project, you've got yourself a billion dollar business.

I am not sure if it is possible, but if you can rename this to "scientific papers" or something that calls people to action moreso to share academic research related to this topic, it might help.

Ghost---Shadow · 2017-04-24T15:54:31Z

Done, you should link this to the readme.md

Sentdex · 2017-05-10T14:02:55Z

Added a link in the readme here. Thanks!

Segmentation: https://arxiv.org/abs/1611.08323
github: https://github.com/TobyPDE/FRRN

Ghost---Shadow · 2017-05-11T02:23:51Z

Since many people were running out of GPU memory. I would recommend the SqueezeNet. My finals are going on so if nobody implements it by the end of the month, then I will.

EDIT: Done

skotone · 2017-05-23T12:14:09Z

Learning how to play Atari games from pixels:
https://www.nature.com/nature/journal/v518/n7540/full/nature14236.html

orenouard · 2017-05-29T08:03:54Z

Some interesting info about what separate buffers GTA V renders, depth and stencil being of particular interest I think http://www.adriancourreges.com/blog/2015/11/02/gta-v-graphics-study/. Direct "snooping" of the individual buffers could I think be achieved with something like https://www.opengl.org/sdk/tools/GLIntercept/, dunno if it could be fast enough though.

mantoone · 2017-06-14T19:58:20Z

NVIDIA's self-driving car model:
https://arxiv.org/pdf/1604.07316v1.pdf
https://devblogs.nvidia.com/parallelforall/deep-learning-self-driving-cars/#disqus_thread

skotone · 2017-06-16T15:11:37Z

Reinforcement Learning: An Introduction, Sutton and Barto, book available online:
http://incompleteideas.net/sutton/book/ebook/the-book.html

A good introduction to policy gradient methods:
http://www.scholarpedia.org/article/Policy_gradient_methods

A very good survey of actor-critic reinforcement learning
http://ieeexplore.ieee.org/document/6392457/

Deterministic policy gradient algorithms:
http://proceedings.mlr.press/v32/silver14.pdf

npetrenko · 2017-07-22T20:31:08Z

@skotone recomended policy gradient methods. The problem with them is that they are mostly "on-policy" which means that the training data should be sampled under fixed NN weights and then they may be updated (though you can use training size of several frames easily with, for examle, A3C algorithm). On-policy methods are very inefficient in terms of the amount of simulation which needs to happen in order to train network, and it should be your biggest concern because running gta5 is very costly by itself.

I would stick to off-policy methods, which may learn optimal policy on ANY game records (even on human ones). With them you could make a big boofer of frames which comes from your gta5 agent and then use only it to make everything happen. Your primary interest should be Q-learning and TD (kinda more advanced q-learning) algorithms. Also, q-learning may have "stability" difficulties sometimes - use "double q-learning" if you experience problems

It is very unfortunate that I have neither windows nor mac computers (nor any gta5). If I had, I would take part in your deeds

npetrenko · 2017-07-22T20:46:08Z

here's a simple double q-learning implementation: https://github.com/npetrenko/oligopolyrl/blob/master/oligopolyrl.ipynb

ajRiverav · 2017-07-23T05:23:16Z

Here is a very nice survey on Deep Reinforcement Learning (from 2017 by Li). Section 5 has a Robotics example that may be useful.

https://arxiv.org/pdf/1701.07274.pdf

Ghost---Shadow · 2017-08-04T12:17:53Z

Inverse Cooperative Reinforcement Learning. I think this form of RL would best suited for this particular case.

https://www.youtube.com/watch?v=9nktr1MgS-A

https://arxiv.org/abs/1606.03137

racinmat · 2017-10-17T15:29:57Z

Using semantically segmented data from GTA V, here they trained classifier only using GTA V data to train classifier which outperforms classifier trained on KITTI dataset in task of classifying real-world cars in photos:
https://arxiv.org/abs/1610.01983
https://github.com/umautobots/driving-in-the-matrix

leo2105 · 2017-11-10T01:27:16Z

This paper form NVIDIA team could help, https://arxiv.org/pdf/1604.07316v1.pdf

xhuvom · 2018-01-17T16:46:27Z

You can try with DeepMind control suite which is also in python but works only with MuJoCo physics engine.

Ghost---Shadow changed the title ~~Referring a scientific paper~~ Scientific Papers Suggestion Thread Apr 24, 2017

daniel-kukiela closed this as completed Jul 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scientific Papers Suggestion Thread #11

Scientific Papers Suggestion Thread #11

Ghost---Shadow commented Apr 23, 2017

zxzhijia commented Apr 23, 2017

Sentdex commented Apr 24, 2017 •

edited

Loading

Ghost---Shadow commented Apr 24, 2017

Sentdex commented May 10, 2017 •

edited

Loading

Ghost---Shadow commented May 11, 2017 •

edited

Loading

skotone commented May 23, 2017

orenouard commented May 29, 2017

mantoone commented Jun 14, 2017

skotone commented Jun 16, 2017 •

edited

Loading

npetrenko commented Jul 22, 2017 •

edited

Loading

npetrenko commented Jul 22, 2017

ajRiverav commented Jul 23, 2017

Ghost---Shadow commented Aug 4, 2017

racinmat commented Oct 17, 2017

leo2105 commented Nov 10, 2017

xhuvom commented Jan 17, 2018

Scientific Papers Suggestion Thread #11

Scientific Papers Suggestion Thread #11

Comments

Ghost---Shadow commented Apr 23, 2017

zxzhijia commented Apr 23, 2017

Sentdex commented Apr 24, 2017 • edited Loading

Ghost---Shadow commented Apr 24, 2017

Sentdex commented May 10, 2017 • edited Loading

Ghost---Shadow commented May 11, 2017 • edited Loading

skotone commented May 23, 2017

orenouard commented May 29, 2017

mantoone commented Jun 14, 2017

skotone commented Jun 16, 2017 • edited Loading

npetrenko commented Jul 22, 2017 • edited Loading

npetrenko commented Jul 22, 2017

ajRiverav commented Jul 23, 2017

Ghost---Shadow commented Aug 4, 2017

racinmat commented Oct 17, 2017

leo2105 commented Nov 10, 2017

xhuvom commented Jan 17, 2018

Sentdex commented Apr 24, 2017 •

edited

Loading

Sentdex commented May 10, 2017 •

edited

Loading

Ghost---Shadow commented May 11, 2017 •

edited

Loading

skotone commented Jun 16, 2017 •

edited

Loading

npetrenko commented Jul 22, 2017 •

edited

Loading