Build software better, together

VachanVY / Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.

reinforcement-learning deep-reinforcement-learning pytorch artificial-intelligence dqn policy-gradient deep-deterministic-policy-gradient ddpg-algorithm proximal-policy-optimization actor-critic-algorithm dqn-pytorch rl-book sutton-barto-book policy-gradient-with-baseline actor-critic-pytorch soft-actor-critic-continuous ppo-algorithm reinforcement-learning-an-introduction

Updated Aug 14, 2025
Python

kochlisGit / TraderNet-CRv2

Star

TraderNet-CRv2 - Combining Deep Reinforcement Learning with Technical Analysis and Trend Monitoring on Cryptocurrency Markets

Updated Oct 2, 2023
Jupyter Notebook

mohammadzainabbas / Reinforcement-Learning-CS

Star

💡 Grasp - Pick-and-place with a robotic hand 👨🏻‍💻

python reinforcement-learning physics-engine pytorch mamba sac gym-environment ppo model-free-rl ppo-agent brax ppo-algorithm

Updated Dec 15, 2023
Jupyter Notebook

negarhonarvar / DeepReinforcementLearning

Star

A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments

dqn boltzmann-exploration sarsa lunar-lander cartpole-v1 d3qn swimmer softmax-exploration drl-algorithms ppo-algorithm gymnasium-environment

Updated Apr 13, 2025
Python

amin-sharifi-github / quant-rl-trading-agent

Star

End-to-end RL trading framework with PPO agent, self-attention neural network, custom Gym environment, and advanced backtesting.

reinforcement-learning ai algotrading reinforcement-learning-algorithms trading-algorithms quantitative-finance attention-mechanism quantitative-trading backtesting trading-systems gym-environment reinforcement-learning-agent financial-machine-learning quantitative-research market-simulation stable-baselines3 ppo-algorithm

Updated Aug 6, 2025
Python

RhizomaticRobin / DexiGrab-Robot-Hand

Star

LibreGrabbe 16-DOF Robot Hand

open-source machine-learning opensource robot robotics simulation pytorch robots isaac open-source-hardware ppo robothand ros2-humble ppo-algorithm isaac-lab

Updated Apr 1, 2025
Roff

Sujyeet / SPEED-Intelligent-Racing-Agents

Star

Comparative research platform for Deep Reinforcement Learning and heuristic controllers in autonomous racing. Benchmarks DRL (PPO) agents against deterministic baselines in Unity MiniKart, with full reproducibility, human-like evaluation, and performance logs.

benchmarking machine-learning unity3d deep-reinforcement-learning reproducibility game-ai autonomous-agents mlagents autonomous-racing ppo-algorithm

Updated Sep 4, 2025
C#

omerjakoby / MARIO-RL-PPO

Star

This repository implements a Proximal Policy Optimization (PPO) agent that learns to play Super Mario Bros using TensorFlow/Keras and OpenAI Gym. Features CNNs for vision, Actor-Critic architecture, and parallel environments. Train your own Mario master or run a pre-trained one!

machine-learning tensorflow keras openai-gym cnn actor-critic mario-game proximal-policy-optimization ppo reinforcement-learning-agent ppo-algorithm

Updated Jun 1, 2025
PureBasic

unaizaahmedk / Balancing-Inverted-Pendulum-using-RL

Star

Reinforcement learning–based controller for balancing an inverted pendulum using Proximal Policy Optimization (PPO). Supports configurable mass, length, and gravity settings (Earth, lunar, microgravity) with automated training logs, reward visualization, and performance analysis.

reinforcement-learning openai-gym reinforcement-learning-algorithms inverted-pendulum ppo-algorithm

Updated Aug 13, 2025
Python

zxy-tech / ppo-for-S-P-500-trading-strategy

Star

This is a project for PPO S&P 500 trading

time-series-forecasting stockprediction stocktrader ppo-algorithm

Updated Mar 10, 2025
Python

Lakshvashishtha / DOOM-RL-agent

Star

python reinforcement-learning ppo-algorithm

Updated May 15, 2023
Jupyter Notebook

nikhilgrad / Reinforcement-Learning-Model-for-Super-Mario

Star

An RL based model using PPO algorithm leveraging OpenAI Gym environment to play the popular Super Mario game.

reinforcement-learning openai-gym pytorch ppo-algorithm

Updated Feb 2, 2025
Jupyter Notebook

ashioyajotham / weather_forecasting_lora

Star

How close can LoRA get to full fine-tuning (FullFT) in terms of learning speed, performance, and compute tradeoffs? And under what conditions?

finetuning weather-forecasting thinking-machines rlhf ppo-algorithm mistral-7b lora-fine-tuning

Updated Oct 12, 2025
Python

green-hat-001 / NASA-Space-Apps-Commercialising-LEO-by-OptimAI

Star

2D orbital rocket sim with PPO in PyTorch. Models thrust, drag, gravity, fuel; agent learns efficient ascent. Includes telemetry & visualization

ai python3 rocketry ppo-algorithm

Updated Oct 4, 2025
Python

mafaldaaires / Reinforcement-Learning

Star

Stable Baselines3

gymnasium a2c-algorithm car-racing-environment ppo-algorithm

Updated Dec 26, 2023
Python

sayeang / AWS-DeepRacer-Autonomous-Racing-Model

Star

Developed-an-AWS-DeepRacer-model-using-Python-&-the-PPO-algorithm,-leveraging-TensorFlow-to-train-&-fine-tune-a-deep-reinforcement-learning-model.-Designed-a-custom-reward-function-&-optimized-hyperparameters-to-improve-policy-learning-&-navigation-performance.-Utilized-AWS-infrastructure-for-scalable-training-&-deployment.

python training aws machine-learning scalable deep-learning deployment tensorflow rl hyperparameter-tuning aws-infrastructure deepracer aws-deepracer ppo-algorithm

Updated Apr 26, 2025

anjaliy11 / ReinforcementLearning

Star

This repository explores Reinforcement Learning (RL) through hands-on implementations of key algorithms and environments. It demonstrates how agents learn by interacting with environments, optimizing rewards, and adapting to tasks ranging from Atari games to autonomous driving and custom simulations.

python machine-learning reinforcement-learning tensorboard tensorboard-visualizations a2c-algorithm ppo-algorithm gymnasium-environment