Comments (3)
Hi @fisherxue, we had a similar discussion in this issue. Please look at the entire discussion and try the steps listed there. If you're still running into issues, let me know.
from senseact.
What I've done: saved the random state in a file, then loaded it using pickle. I do this before the env is loaded. I then set the tensorflow random seed and the python random seed after sess.enter()
I am also passing the random state I load from file into the environment.
I'm fairly sure my hardware is not the bottleneck.
I am able to get relatively consistent results when I run two simulations at the same time. However, when I run one after the other, I get vastly different results. Any advice?
This is what I have:
# use fixed random state
with open('random.obj', 'rb') as f:
rand_state = pickle.load(f)
np.random.set_state(rand_state)
tf_set_seeds(np.random.randint(1, 2**31 - 1))
#Create Asynchronous Simulation of InvertedDoublePendulum-v2 mujoco environment.
env = DoubleInvertedPendulumEnv(agent_dt=0.005,
sensor_dt=[0.01, 0.0033333],
is_render=False,
random_state=rand_state
)
# Start environment processes
env.start()
# Create baselines ppo policy function
sess = U.single_threaded_session()
sess.__enter__()
seed = np.random.randint(1, 2**31 - 1)
tf.set_random_seed(seed)
random.seed(seed)
Thanks!
from senseact.
@gauthamvasan
I'm getting this warning when running on one machine:
WARNING:root:Agent has over-run its allocated dt, it has been 0.008300065994262695 since the last observation, 0.003300065994262695 more than allowed
However, on the other machine I'm running it on, I only get that warning at the start of each iteration.
I'm still failing to get tight repeatability curves on double pendulum.
Any tips?
Thanks!
from senseact.
Related Issues (20)
- Soft-Q learning implementation HOT 2
- UR Reacher on CB2 HOT 13
- Change from RT Interface to RTDE to support future e-series versions - UR Reacher
- UR Reacher on UR e-series HOT 9
- DDPG + HER to replace TRPO HOT 7
- How do I implement this? HOT 1
- Warning: incomplete packet from UR HOT 12
- Where does the experiment data get saved to? HOT 5
- the meaning of Hyperparameters
- Refactor dxl_reacher and dxl_tracker to allow user to specify port and baud. HOT 1
- Update dxl_reacher and dxl_tracker to use new baselines function call format
- dxl_basic_functions.py: rewrite to allow configurable port, id, baud, etc.
- Simulating UR in ROS HOT 10
- UR 3.5.4.10845 ur5_reacher.py Error
- Remove custom PID controller from DXL envs
- Aggregate shared code in DXL and Create2 envs
- Extract communicators for Create environments
- Add check for latency for all serial connections
- Train on specific points? Discrete action_space [with distinct points]!
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from senseact.