An open API service providing repository metadata for many open source software ecosystems.

Topic: "cross-entropy-method"

alwaysbyx/Optimization-and-Search

Implementation and visualization (some demos) of search and optimization algorithms.

Language: Python - Size: 79.1 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

dhananjaisharma10/Model-based-Reinforcement-Learning

Model-based reinforcement learning using CEM, MPC and PETS

Language: Python - Size: 58.6 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 13 - Forks: 0

americocunhajr/CE-ABC

CE-ABC is a code to simulate the epidemic outbreaks with mechanistic models through a cross-entropy approximate Bayesian framework.

Language: MATLAB - Size: 68.1 MB - Last synced at: 2 days ago - Pushed at: 21 days ago - Stars: 7 - Forks: 1

jinning-li/cem-torch

Cross Entropy Method (CEM) implemented under Pytorch, supporting batch dimension and receding horizon style optimization.

Language: Python - Size: 9.77 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

ngctnnnn/Simulation_experiments_for_optimizing_objective_function

Simulation experiments for optimizing objective function with Differential Evolution, Evolution Strategies and Cross Entropy Method (2 versions)

Language: Jupyter Notebook - Size: 416 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

Prakhar-FF13/Reinforcement-Learning-With-Python

Reinforcement Learning Notebooks

Language: Python - Size: 115 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 1

vkurenkov/cem-tetris

Solving Tetris using Cross-Entropy Method

Language: Haskell - Size: 8.81 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

americocunhajr/CEopt

CEopt is a Matlab routine for non-convex optimization using the Cross-Entropy method and augmented Lagrangian formulation.

Language: MATLAB - Size: 12.1 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 2 - Forks: 0

Phrungck/reinforcement-learning-models

Simple implementation and comparison of three reinforcement learning models.

Language: Jupyter Notebook - Size: 1.42 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

nslyubaykin/mbrl_multitasking

Model-Based RL Multi-Tasking with ReLAx

Language: Jupyter Notebook - Size: 11.1 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

americocunhajr/FraCTune

FraCTune is a Matlab package for tuning fractional-order controllers with the Cross-Entropy method and augmented Lagrangian formulation.

Language: MATLAB - Size: 32.5 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

americocunhajr/CROSS-OPT

CROSS-OPT is a Matlab package for optimizing truss structures with the Cross-Entropy method and augmented Lagrangian formulation.

Language: MATLAB - Size: 20.2 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

americocunhajr/SpringpotTune

SpringpotTune is a Matlab package designed for fitting variable-order springpot models using the Cross-Entropy method.

Language: MATLAB - Size: 2.22 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0

qureshinomaan/dmp_primitives_pytorch

Tools for using motion primitives like Dynamic Motion Primitives or Differentiable Linear Dynamic Systems in PyTorch.

Language: Python - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

bmaxdk/OpenAI-Gym-MountainCar-v0-CrossEntropy

Train a Cross-Entropy Method in Policy-Based Methods with OpenAI Gtm's MountainCarContinous environment

Language: Jupyter Notebook - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

msuvorov7/base_deep_learning

Implementation of base DL tasks

Language: Jupyter Notebook - Size: 7.44 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

deepbiolab/drl

Implementation of deep reinforcement learning

Language: Jupyter Notebook - Size: 30.7 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

TienNHM/CartPole-CrossEntropyMethod

CartPole-CrossEntropyMethod

Language: Jupyter Notebook - Size: 356 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

LDRyan0/2D-Cross-Entropy-Optimisation

Two dimensional optimisation algorithm using the Cross Entropy Method. Data is iteratively fitted to a Beta Distribution in the algorithm.

Language: Jupyter Notebook - Size: 9.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yashsriram/wall-e

A neural-network controller for a differential-drive agent to reach a goal.

Language: Rust - Size: 526 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nslyubaykin/relax_cem_example

Example CEM implementation with ReLAx

Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

architsakhadeo/Automated-Hyperparam-Tuning

Automated tuning of hyperparameters using Cross Entropy Method for optimization (CEM).

Language: Python - Size: 2.96 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

antonio-f/CrossEntropy-Method

Cross-Entropy method example on OpenAI Gym's MountainCarContinuous environment. Code is from Udacity's "Deep Reinforcement Learning Nanodegree Program"

Language: Jupyter Notebook - Size: 43 KB - Last synced at: 26 days ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

JayLohokare/gradient-ascent-stochastic-policy-learning

Open AI Cartpole environment gradient ascent

Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

chrisfosterelli/intro-to-rl-talk

Workshop code for the talk on Introduction to Reinforcement Learning: https://fosterelli.co/file/talk/introduction-to-reinforcement-learning.pdf

Language: Python - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

Related Topics
reinforcement-learning 9 deep-reinforcement-learning 4 nonconvex-optimization 4 pytorch 3 cem 3 machine-learning 3 python 2 q-learning 2 sarsa 2 cross-entropy 2 openai-gym 2 optimization-methods 2 actor-critic 2 augmented-lagrangian 2 optimization 2 policy-iteration 2 value-iteration 2 fractional-calculus 2 reinforcement-learning-algorithms 2 model-based-reinforcement-learning 2 continuous-control 2 policy-based-method 2 rl 2 policy-gradient 2 model-predictive-control 2 policy-evaluation 1 structural-mechanics 1 monte-carlo-methods 1 size-optimization 1 simulation-based-optimization 1 shape-optimization 1 modern-control 1 fractional-controller 1 control-systems 1 ppo 1 optimization-tools 1 optimization-technique 1 optimization-library 1 optimization-framework 1 optimization-algorithms 1 prioritized-dqn 1 optimization-algorithm 1 matlab-toolbox 1 reinforce 1 temporal-difference 1 tile-coding 1 multi-task-rl 1 multi-task-reinforcement-learning 1 value-based-methods 1 monte-carlo 1 markov-decision-processes 1 deep-q-learning 1 deep-learning 1 python3 1 openai 1 gym-environment 1 temporal-differencing-learning 1 advantage-actor-critic 1 alphazero 1 cartpole 1 deep-deterministic-policy-gradient 1 word2vec 1 ssd300 1 dqn 1 quartznet 1 qlearning 1 pixelcnn 1 dueling-ddqn 1 dqn-agents 1 crf-pos-tagging 1 hill-climbing 1 mountaincar-coninuous 1 gym 1 structural-optimization 1 mc-control 1 functional-programming 1 codeworld 1 cem-tetris 1 hyperparameter-tuning 1 frozenlake-v0 1 cliffwalking 1 mechanistic-models 1 epidemiology 1 epidemic-simulations 1 computational-epidemiology 1 computational-biology 1 compartmental-models 1 bayesian-inference 1 approximate-bayesian-computation 1 robotics 1 planning 1 motion-prior 1 motion-primitives 1 linear-dynamical-systems 1 dynamic-motion-primitives 1 cross-entropy-method-pytorch 1 trajectory-sampling 1 probabilistic-ensemble 1 model-based-rl 1 relu 1