- conda create -n rl python=3.6
- conda activate rl
- pip install --upgrade pip
- pip install -r requirements.txt
- pytorch 설치
- baselines 설치
- git clone https://github.com/openai/gym.git
- cd gym
- pip install -e '.[all]'
- mujoco 에러 무시
- pip freeze > requirements.txt
-
모스키토 설치
- brew install mosquitto
- Linux: https://blog.neonkid.xyz/127
-
모스키토 서비스 실행
- /usr/local/sbin/mosquitto -c /usr/local/etc/mosquitto/mosquitto.conf
-
메세지 구독
- mosquitto_sub -h [주소] -p [포트] -t [주제]
- mosquitto_sub -h 127.0.0.1 -p 1883 -t "topic"
-
메시지 발행
- mosquitto_pub -h [주소] -p [포트] -t [주제] -m [메세지]
- mosquitto_pub -h 127.0.0.1 -p 1883 -t "topic" -m "test messgae"
- main.py 실행
- git rm -r --cached .
- git add .
- git commit -m "Apply .gitignore"
- pip freeze > requirements.txt
- brew install gcc@6
- mkdir ~/.mujoco
- https://github.com/openai/mujoco-py 에서 다운로드
- mv Downloads/mujoco200_macos ~/.mujoco/mujoco200
- cp Downloads/mjkey.txt ~/.mujoco/
- pip install -U 'mujoco-py<2.1,>=2.0'
- 참고
-
- Linux에서의 설정
- sudo apt-get update
- sudo apt install nfs-kernel-server
- sudo chown nobody:nogroup /home/{account_name}/git
- sudo chmod 777 /home/{account_name}/git
- sudo vi /etc/exports
- /home/{account_name}/git 192.168.0.10(rw,sync,no_subtree_check)
- Your MAC IP: 192.168.0.10
- /home/{account_name}/git 192.168.0.10(rw,sync,no_subtree_check)
- sudo exportfs -a
- sudo systemctl restart nfs-kernel-server
- sudo ufw allow from 192.168.0.0/24 to any port nfs
-
- Mac에서의 설정
- mkdir ~/linux_nfs_git
- sudo mount -t nfs -o resvport,rw,nfc 192.168.0.43:/home/{account_name}/git ~/linux_nfs_git
- Your LINUX IP: 192.168.0.43
- python -c 'import torch; print(torch.rand(2,3).cuda())'
- nvidia-smi
- 참고
- https://www.roboti.us/index.html
- 위 사이트에서 mjpro150 혹은 mujoco200을 ~/.mujoco/ 경로에 다운로드받아 압축풀기
- ~/.mujoco/ 경로에 mujoco 인증키 mjkey.txt 를 옮겨놓기
- 이 후 설치는 아래 사이트 참고
- https://github.com/openai/mujoco-py#install-mujoco
설치 11의 mjpro150이 설치 되어있어야함
- https://github.com/openai/mujoco-worldgen
- cd ~/.mujoco
- git clone https://github.com/openai/mujoco-worldgen.git
- xcode-select --install
- open /Library/Developer/CommandLineTools/Packages/macOS_SDK_headers_for_macOS_10.14.pkg
- pip install -r mujoco-worldgen/requirements.txt
- 참고
- https://github.com/openai/multi-agent-emergence-environments
- baselines를 아래 사이트 참고하여 설치
- https://github.com/openai/baselines
- brew install cmake openmpi
- cd baselines
- pip install -e .
- https://medium.com/aureliantactics/ppo-hyperparameters-and-ranges-6fc2d29bccbev
- https://arxiv.org/pdf/1709.06009.pdf
- https://medium.com/@jonathan_hui/rl-proximal-policy-optimization-ppo-explained-77f014ec3f12
- https://en.wikipedia.org/wiki/MM_algorithm
- https://drive.google.com/file/d/0BxXI_RttTZAhMVhsNk5VSXU0U3c/view