r7vme / learning-to-drive-in-a-day Goto Github PK
View Code? Open in Web Editor NEWImplementation of reinforcement learning approach to make a car learn to drive
License: MIT License
Implementation of reinforcement learning approach to make a car learn to drive
License: MIT License
Hi @r7vme,
I have a quick question about controller.py
. Can you please say what is the goal of this script? I am a bit confuse about the overall approach. I understand that the Gym environment has been modified such that the input is the latent representation (coming from VAE) of each image (and not the raw image), but the rest of you approach is still unclear. Pluse, It seems you save the raw images in a buffer, but why, when you have a latent representation. Would you please clarify a bit?
Hi,
Great work.
A quick question- How did you train VAE? I mean which data-set has been used to train VAE?
Hello,
Thanks for this nice project (and kudos for using stable-baselines ;) ).
I tried to run your code but I have some issues:
With docker:
./run-in-docker.sh
starting DonkeyGym env
Missing DONKEY_SIM_PORT environment var. Using defaults
donkey subprocess started
binding to ('0.0.0.0', 9090)
waiting to load..
No protocol specified
No protocol specified
Error initializing Gtk+
No protocol specified
waiting to load..
I have intel graphics but the problem is still here even when removing device=/dev/dri:/dev/dri
In a conda env:
Everything starts properly except that I got this error message in the terminal:
Expecting property name enclosed in double quotes: line 1 column 67 (char 66) failed to read json {"msg_type":"telemetry","steering_angle":0,"throttle":0 ...
Trying the scripts from the sdsandbox repo, I ran into the same issue (see https://github.com/tawnkramer/sdsandbox/issues/11), so I assume the problem comes from my Unity version that may be too recent.
What version are you using?
System Info:
pip -r requirements.txt
Right now policy learning process is vague and does not improve over time (i.e. good policy can be learned in 3 episodes, but after 10 episodes policy can degrade completely).
So after spending time adjusting optimal buffer size and optimization steps, i see that it's pretty random. I assume that prioritized experience buffer for DDPG will help imrove situation.
In short, it will make sure
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.