gym-cache

OpenAI based Gym environments for training RL caching agent

install it: pip install gym-cache

import it like this:


import gym

gym.make('gym_cache:Cache-v0')

observation space has following variables:

six tokens (integers)
file size [kB]
how full is the cache at that moment

There are two discrete action environments (Cache-v0 and Cache-large-v0) and one continuous action environment (Cache-continuous-v0).

Data extractions and preprocessing

This is a two step procedure:

extract raw data data/extract_data.py - change PQ, date range
process raw data data/process_data.py - tokenizes filenames, generates unique fileIDs, sorts by access time.

Processed data should be copied to the directory where actor runs. It is a parque file (.pa) with one dataframe:

index - access time (sorted)
six tokens derived from the filename ('1', '2', '3', '4', '5', '6')
filesize ('kB')
unique file identifier ('fID')

Rewards

always negative and correspond to cost to get the file - if it was cached it will be smaller (Currently it gives positive for good guess and negative for mistakes. cost = 1 * filesize)
files are cached irrespectively from what action actor performed for the file
cleanup: environment memorizes actions. on cleanup it first deletes files judged not to be needed again (action 0 in discrete environments or smaller values in continues environment). If multiple files have the same action value, LRU one is removed first.

Possible technical implementation in XCache server

There are additional containers in the pod.
- environment container
  - recieves gstream pfc, and disk info
  - recalculates new state, reward, tokenizes recieved gstream info.
  - memorizes last state, actors actions for each file
  - triggers cleanup at lower HWM then xcache itself. Loops through memorized paths and removes ones least probably needed.
- redis db - used by environment container to store actor responses
- actor container

Miscalenious

To change environments:

clone github repository
make changes
install locally:
python -m pip install --user -e . or python setup.py bdist_wheel python -m pip install dist\gym_cache-1.0.3-py3-none-any.whl
to upload to pypi repository create %USER%.pypirc file first. python setup.py bdist_wheel python -m twine upload dist*

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.vscode		.vscode
data		data
gym_cache		gym_cache
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
check_conformity.py		check_conformity.py
requirements.txt		requirements.txt
setup.py		setup.py
unit.py		unit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gym-cache

Data extractions and preprocessing

Rewards

Possible technical implementation in XCache server

Miscalenious

create %USER%.pypirc file first.

About

Releases

Packages

Languages

License

ivukotic/gym-cache

Folders and files

Latest commit

History

Repository files navigation

gym-cache

Data extractions and preprocessing

Rewards

Possible technical implementation in XCache server

Miscalenious

create %USER%.pypirc file first.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages