Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / evgenii-nikishin / rl_with_resets

JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/evgenii-nikishin%2Frl_with_resets

Stars: 85
Forks: 6
Open Issues: 0

License: mit
Language: Python
Repo Size: 1.1 MB
Dependencies: 249

Created: about 2 years ago
Updated: 10 months ago
Last pushed: about 2 years ago
Last synced: 10 months ago

Topics: atari, atari2600, deep-learning, deep-reinforcement-learning, deepmind-control-suite, dmc, drq, flax, gym, jax, machine-learning, overfitting, reinforcement-learning, sac, soft-actor-critic, spr

Files

Readme

Dependencies

continuous_control_requirements.txt pypi

Babel ==2.10.1
Cython ==0.29.28
Jinja2 ==3.1.2
Markdown ==3.3.7
MarkupSafe ==2.1.1
Pillow ==9.1.0
PyOpenGL ==3.1.6
PySocks ==1.7.1
PyYAML ==6.0
Pygments ==2.12.0
Send2Trash ==1.8.0
Werkzeug ==2.1.2
absl-py ==0.12.0
anyio ==3.5.0
argon2-cffi ==21.3.0
argon2-cffi-bindings ==21.2.0
asttokens ==2.0.5
attrs ==21.4.0
backcall ==0.2.0
beautifulsoup4 ==4.11.1
bleach ==5.0.0
cachetools ==4.2.4
certifi ==2021.10.8
cffi ==1.15.0
charset-normalizer ==2.0.12
chex ==0.1.3
cloudpickle ==2.0.0
contextlib2 ==21.6.0
cycler ==0.11.0
debugpy ==1.6.0
decorator ==5.1.1
defusedxml ==0.7.1
distrax ==0.0.2
dm-control ==1.0.2
dm-env ==1.5
dm-tree ==0.1.7
entrypoints ==0.4
executing ==0.8.3
fasteners ==0.17.3
fastjsonschema ==2.15.3
filelock ==3.6.0
flatbuffers ==2.0
flax ==0.4.2
fonttools ==4.33.3
future ==0.18.2
gast ==0.5.3
gdown ==3.12.2
glfw ==2.5.3
google-auth ==1.35.0
google-auth-oauthlib ==0.4.6
grpcio ==1.46.0
gym ==0.20.0
idna ==3.3
imageio ==2.9.0
imageio-ffmpeg ==0.4.3
importlib-metadata ==4.11.3
importlib-resources ==5.7.1
ipykernel ==6.13.0
ipython ==8.3.0
ipython-genutils ==0.2.0
jax ==0.3.10
jaxlib ==0.3.7
jedi ==0.18.1
joblib ==1.1.0
json5 ==0.9.7
jsonschema ==4.5.1
jupyter-client ==7.3.0
jupyter-contrib-core ==0.3.3
jupyter-contrib-nbextensions ==0.5.1
jupyter-core ==4.10.0
jupyter-highlight-selected-word ==0.2.0
jupyter-latex-envs ==1.4.6
jupyter-nbextensions-configurator ==0.4.1
jupyter-server ==1.17.0
jupyterlab ==3.1.14
jupyterlab-pygments ==0.2.2
jupyterlab-server ==2.13.0
kiwisolver ==1.4.2
labmaze ==1.0.5
lxml ==4.8.0
matplotlib ==3.5.2
matplotlib-inline ==0.1.3
mistune ==0.8.4
ml-collections ==0.1.0
msgpack ==1.0.3
mujoco ==2.1.5
mujoco-py ==2.0.2.13
nbclassic ==0.3.7
nbclient ==0.6.2
nbconvert ==6.5.0
nbformat ==5.4.0
nest-asyncio ==1.5.5
notebook ==6.4.11
notebook-shim ==0.1.0
numpy ==1.21.0
oauthlib ==3.2.0
opt-einsum ==3.3.0
optax ==0.0.9
packaging ==21.3
pandocfilters ==1.5.0
parso ==0.8.3
pexpect ==4.8.0
pickleshare ==0.7.5
prometheus-client ==0.14.1
prompt-toolkit ==3.0.29
protobuf ==3.20.1
psutil ==5.9.0
ptyprocess ==0.7.0
pure-eval ==0.2.2
pyasn1 ==0.4.8
pyasn1-modules ==0.2.8
pycparser ==2.21
pyparsing ==2.4.7
pyrsistent ==0.18.1
python-dateutil ==2.8.2
pytz ==2022.1
pyzmq ==22.3.0
requests ==2.27.1
requests-oauthlib ==1.3.1
rsa ==4.8
scikit-learn ==1.0
scipy ==1.7.1
six ==1.16.0
sniffio ==1.2.0
soupsieve ==2.3.2.post1
stack-data ==0.2.0
tensorboard ==2.6.0
tensorboard-data-server ==0.6.1
tensorboard-plugin-wit ==1.8.1
tensorboardX ==2.2
terminado ==0.13.3
tfp-nightly ==0.17.0.dev20220507
threadpoolctl ==3.1.0
tinycss2 ==1.1.1
toolz ==0.11.2
tornado ==6.1
tqdm ==4.60.0
traitlets ==5.1.1
typing_extensions ==4.2.0
urllib3 ==1.26.9
wcwidth ==0.2.5
webencodings ==0.5.1
websocket-client ==1.3.2
zipp ==3.8.0

discrete_control_requirements.txt pypi

AutoROM ==0.4.2
AutoROM.accept-rom-license ==0.4.2
GitPython ==3.1.27
Keras-Preprocessing ==1.1.2
Markdown ==3.3.7
Pillow ==9.1.0
PyYAML ==6.0
Pygments ==2.12.0
Werkzeug ==2.1.2
absl-py ==1.0.0
ale-py ==0.7.5
astunparse ==1.6.3
atari-py ==0.2.9
backcall ==0.2.0
cached-property ==1.5.2
cachetools ==5.1.0
certifi ==2021.10.8
charset-normalizer ==2.0.12
chex ==0.1.3
clang ==5.0
click ==8.1.3
cloudpickle ==2.0.0
cycler ==0.11.0
decorator ==5.1.1
dm-haiku ==0.0.6
dm-tree ==0.1.7
docker-pycreds ==0.4.0
dopamine-rl ==4.0.2
flatbuffers ==2.0
flax ==0.4.2
future ==0.18.2
gast ==0.4.0
gin-config ==0.5.0
gitdb ==4.0.9
google-auth ==2.6.6
google-auth-oauthlib ==0.4.6
google-pasta ==0.2.0
grpcio ==1.46.1
gym ==0.23.1
gym-notices ==0.0.6
h5py ==3.1.0
idna ==3.3
importlib-metadata ==4.11.3
importlib-resources ==5.7.1
ipdb ==0.13.9
ipython ==7.33.0
jax ==0.3.13
jaxlib ==0.3.10
jedi ==0.18.1
jmp ==0.0.2
keras ==2.6.0
kiwisolver ==1.4.2
matplotlib ==3.4.2
matplotlib-inline ==0.1.3
msgpack ==1.0.3
numpy ==1.21.6
oauthlib ==3.2.0
opencv-python ==4.5.5.64
opt-einsum ==3.3.0
optax ==0.1.2
pandas ==1.3.5
parso ==0.8.3
pathtools ==0.1.2
pexpect ==4.8.0
pickleshare ==0.7.5
promise ==2.3
prompt-toolkit ==3.0.29
protobuf ==3.20.1
psutil ==5.9.0
ptyprocess ==0.7.0
pyasn1 ==0.4.8
pyasn1-modules ==0.2.8
pygame ==2.1.2
pyglet ==1.4.11
pyparsing ==3.0.9
python-dateutil ==2.8.2
pytz ==2022.1
requests ==2.27.1
requests-oauthlib ==1.3.1
rsa ==4.8
scipy ==1.7.3
sentry-sdk ==1.5.12
setproctitle ==1.2.3
shortuuid ==1.0.9
six ==1.16.0
smmap ==5.0.0
tabulate ==0.8.9
tensorboard ==2.9.0
tensorboard-data-server ==0.6.1
tensorboard-plugin-wit ==1.8.1
tensorflow ==2.6.0
tensorflow-estimator ==2.6.0
tensorflow-probability ==0.16.0
termcolor ==1.1.0
tf-slim ==1.1.0
toml ==0.10.2
toolz ==0.11.2
tqdm ==4.64.0
traitlets ==5.2.1.post0
typing-extensions ==4.2.0
urllib3 ==1.26.9
wandb ==0.12.16
wcwidth ==0.2.5
wrapt ==1.12.1
zipp ==3.8.0