Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/evgenii-nikishin%2Frl_with_resets
Stars: 85
Forks: 6
Open Issues: 0
License: mit
Language: Python
Repo Size: 1.1 MB
Dependencies:
249
Created: about 2 years ago
Updated: 10 months ago
Last pushed: about 2 years ago
Last synced: 10 months ago
Topics: atari, atari2600, deep-learning, deep-reinforcement-learning, deepmind-control-suite, dmc, drq, flax, gym, jax, machine-learning, overfitting, reinforcement-learning, sac, soft-actor-critic, spr
Files
Dependencies
- Babel ==2.10.1
- Cython ==0.29.28
- Jinja2 ==3.1.2
- Markdown ==3.3.7
- MarkupSafe ==2.1.1
- Pillow ==9.1.0
- PyOpenGL ==3.1.6
- PySocks ==1.7.1
- PyYAML ==6.0
- Pygments ==2.12.0
- Send2Trash ==1.8.0
- Werkzeug ==2.1.2
- absl-py ==0.12.0
- anyio ==3.5.0
- argon2-cffi ==21.3.0
- argon2-cffi-bindings ==21.2.0
- asttokens ==2.0.5
- attrs ==21.4.0
- backcall ==0.2.0
- beautifulsoup4 ==4.11.1
- bleach ==5.0.0
- cachetools ==4.2.4
- certifi ==2021.10.8
- cffi ==1.15.0
- charset-normalizer ==2.0.12
- chex ==0.1.3
- cloudpickle ==2.0.0
- contextlib2 ==21.6.0
- cycler ==0.11.0
- debugpy ==1.6.0
- decorator ==5.1.1
- defusedxml ==0.7.1
- distrax ==0.0.2
- dm-control ==1.0.2
- dm-env ==1.5
- dm-tree ==0.1.7
- entrypoints ==0.4
- executing ==0.8.3
- fasteners ==0.17.3
- fastjsonschema ==2.15.3
- filelock ==3.6.0
- flatbuffers ==2.0
- flax ==0.4.2
- fonttools ==4.33.3
- future ==0.18.2
- gast ==0.5.3
- gdown ==3.12.2
- glfw ==2.5.3
- google-auth ==1.35.0
- google-auth-oauthlib ==0.4.6
- grpcio ==1.46.0
- gym ==0.20.0
- idna ==3.3
- imageio ==2.9.0
- imageio-ffmpeg ==0.4.3
- importlib-metadata ==4.11.3
- importlib-resources ==5.7.1
- ipykernel ==6.13.0
- ipython ==8.3.0
- ipython-genutils ==0.2.0
- jax ==0.3.10
- jaxlib ==0.3.7
- jedi ==0.18.1
- joblib ==1.1.0
- json5 ==0.9.7
- jsonschema ==4.5.1
- jupyter-client ==7.3.0
- jupyter-contrib-core ==0.3.3
- jupyter-contrib-nbextensions ==0.5.1
- jupyter-core ==4.10.0
- jupyter-highlight-selected-word ==0.2.0
- jupyter-latex-envs ==1.4.6
- jupyter-nbextensions-configurator ==0.4.1
- jupyter-server ==1.17.0
- jupyterlab ==3.1.14
- jupyterlab-pygments ==0.2.2
- jupyterlab-server ==2.13.0
- kiwisolver ==1.4.2
- labmaze ==1.0.5
- lxml ==4.8.0
- matplotlib ==3.5.2
- matplotlib-inline ==0.1.3
- mistune ==0.8.4
- ml-collections ==0.1.0
- msgpack ==1.0.3
- mujoco ==2.1.5
- mujoco-py ==2.0.2.13
- nbclassic ==0.3.7
- nbclient ==0.6.2
- nbconvert ==6.5.0
- nbformat ==5.4.0
- nest-asyncio ==1.5.5
- notebook ==6.4.11
- notebook-shim ==0.1.0
- numpy ==1.21.0
- oauthlib ==3.2.0
- opt-einsum ==3.3.0
- optax ==0.0.9
- packaging ==21.3
- pandocfilters ==1.5.0
- parso ==0.8.3
- pexpect ==4.8.0
- pickleshare ==0.7.5
- prometheus-client ==0.14.1
- prompt-toolkit ==3.0.29
- protobuf ==3.20.1
- psutil ==5.9.0
- ptyprocess ==0.7.0
- pure-eval ==0.2.2
- pyasn1 ==0.4.8
- pyasn1-modules ==0.2.8
- pycparser ==2.21
- pyparsing ==2.4.7
- pyrsistent ==0.18.1
- python-dateutil ==2.8.2
- pytz ==2022.1
- pyzmq ==22.3.0
- requests ==2.27.1
- requests-oauthlib ==1.3.1
- rsa ==4.8
- scikit-learn ==1.0
- scipy ==1.7.1
- six ==1.16.0
- sniffio ==1.2.0
- soupsieve ==2.3.2.post1
- stack-data ==0.2.0
- tensorboard ==2.6.0
- tensorboard-data-server ==0.6.1
- tensorboard-plugin-wit ==1.8.1
- tensorboardX ==2.2
- terminado ==0.13.3
- tfp-nightly ==0.17.0.dev20220507
- threadpoolctl ==3.1.0
- tinycss2 ==1.1.1
- toolz ==0.11.2
- tornado ==6.1
- tqdm ==4.60.0
- traitlets ==5.1.1
- typing_extensions ==4.2.0
- urllib3 ==1.26.9
- wcwidth ==0.2.5
- webencodings ==0.5.1
- websocket-client ==1.3.2
- zipp ==3.8.0
- AutoROM ==0.4.2
- AutoROM.accept-rom-license ==0.4.2
- GitPython ==3.1.27
- Keras-Preprocessing ==1.1.2
- Markdown ==3.3.7
- Pillow ==9.1.0
- PyYAML ==6.0
- Pygments ==2.12.0
- Werkzeug ==2.1.2
- absl-py ==1.0.0
- ale-py ==0.7.5
- astunparse ==1.6.3
- atari-py ==0.2.9
- backcall ==0.2.0
- cached-property ==1.5.2
- cachetools ==5.1.0
- certifi ==2021.10.8
- charset-normalizer ==2.0.12
- chex ==0.1.3
- clang ==5.0
- click ==8.1.3
- cloudpickle ==2.0.0
- cycler ==0.11.0
- decorator ==5.1.1
- dm-haiku ==0.0.6
- dm-tree ==0.1.7
- docker-pycreds ==0.4.0
- dopamine-rl ==4.0.2
- flatbuffers ==2.0
- flax ==0.4.2
- future ==0.18.2
- gast ==0.4.0
- gin-config ==0.5.0
- gitdb ==4.0.9
- google-auth ==2.6.6
- google-auth-oauthlib ==0.4.6
- google-pasta ==0.2.0
- grpcio ==1.46.1
- gym ==0.23.1
- gym-notices ==0.0.6
- h5py ==3.1.0
- idna ==3.3
- importlib-metadata ==4.11.3
- importlib-resources ==5.7.1
- ipdb ==0.13.9
- ipython ==7.33.0
- jax ==0.3.13
- jaxlib ==0.3.10
- jedi ==0.18.1
- jmp ==0.0.2
- keras ==2.6.0
- kiwisolver ==1.4.2
- matplotlib ==3.4.2
- matplotlib-inline ==0.1.3
- msgpack ==1.0.3
- numpy ==1.21.6
- oauthlib ==3.2.0
- opencv-python ==4.5.5.64
- opt-einsum ==3.3.0
- optax ==0.1.2
- pandas ==1.3.5
- parso ==0.8.3
- pathtools ==0.1.2
- pexpect ==4.8.0
- pickleshare ==0.7.5
- promise ==2.3
- prompt-toolkit ==3.0.29
- protobuf ==3.20.1
- psutil ==5.9.0
- ptyprocess ==0.7.0
- pyasn1 ==0.4.8
- pyasn1-modules ==0.2.8
- pygame ==2.1.2
- pyglet ==1.4.11
- pyparsing ==3.0.9
- python-dateutil ==2.8.2
- pytz ==2022.1
- requests ==2.27.1
- requests-oauthlib ==1.3.1
- rsa ==4.8
- scipy ==1.7.3
- sentry-sdk ==1.5.12
- setproctitle ==1.2.3
- shortuuid ==1.0.9
- six ==1.16.0
- smmap ==5.0.0
- tabulate ==0.8.9
- tensorboard ==2.9.0
- tensorboard-data-server ==0.6.1
- tensorboard-plugin-wit ==1.8.1
- tensorflow ==2.6.0
- tensorflow-estimator ==2.6.0
- tensorflow-probability ==0.16.0
- termcolor ==1.1.0
- tf-slim ==1.1.0
- toml ==0.10.2
- toolz ==0.11.2
- tqdm ==4.64.0
- traitlets ==5.2.1.post0
- typing-extensions ==4.2.0
- urllib3 ==1.26.9
- wandb ==0.12.16
- wcwidth ==0.2.5
- wrapt ==1.12.1
- zipp ==3.8.0