Code Monkey home page Code Monkey logo

kair_algorithms_draft's People

Contributors

curt-park avatar geonhee-lee avatar kdh0429 avatar mch5048 avatar mrsyee avatar ohwi avatar seungjaeryanlee avatar whikwon avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

kair_algorithms_draft's Issues

Support for time-step management during training

현재 episode 별로 학습 진행이 되고 param save도 그렇게 되는데, time-step 별로 적용하는게 필요해보입니다.

  • TD3 actor update 시에 진행 time-step에 대한 조건 존재.
  • 대부분의 논문들이 time-step(x축)을 기준으로 성능 비교

How to run sacfd with open_manipulator in Gazebo?

Dear All:

I have been trying to run sacfd in Gazebo environment. To do this I have to firstly copy a sacfd (e.g. scripts/config/agent/lunarlander_continuous_v2/sacfd.py) into scripts/config/agent/open_manipulator_reacher_v0/sacfd.py

However, when I run the sacfd with the command "python run_open_manipulator_reacher_v0.py --algo sacfd --off-render", I got the following error:
Traceback (most recent call last):
File "/home/yz/research/robotics/yumi_ws/src/kair_algorithms_draft/scripts/run_open_manipulator_reacher_v0.py", line 78, in
main()
File "/home/yz/research/robotics/yumi_ws/src/kair_algorithms_draft/scripts/run_open_manipulator_reacher_v0.py", line 74, in main
agent.train()
File "/home/yz/research/robotics/yumi_ws/src/kair_algorithms_draft/scripts/algorithms/sac/agent.py", line 372, in train
loss = self.update_model(experiences)
File "/home/yz/research/robotics/yumi_ws/src/kair_algorithms_draft/scripts/algorithms/fd/sac_agent.py", line 88, in update_model
new_actions, log_prob, pre_tanh_value, mu, std = self.actor(states)
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 532, in call
result = self.forward(*input, **kwargs)
File "/home/yz/research/robotics/yumi_ws/src/kair_algorithms_draft/scripts/algorithms/common/networks/mlp.py", line 186, in forward
mu, _, std = super(TanhGaussianDistParams, self).get_dist_params(x)
File "/home/yz/research/robotics/yumi_ws/src/kair_algorithms_draft/scripts/algorithms/common/networks/mlp.py", line 152, in get_dist_params
hidden = super(GaussianDist, self).get_last_activation(x)
File "/home/yz/research/robotics/yumi_ws/src/kair_algorithms_draft/scripts/algorithms/common/networks/mlp.py", line 79, in get_last_activation
x = self.hidden_activation(hidden_layer(x))
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 532, in call
result = self.forward(*input, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/linear.py", line 87, in forward
return F.linear(input, self.weight, self.bias)
File "/usr/local/lib/python2.7/dist-packages/torch/nn/functional.py", line 1370, in linear
ret = torch.addmm(bias, input, weight.t())
RuntimeError: size mismatch, m1: [64 x 11], m2: [25 x 256] at /pytorch/aten/src/TH/generic/THTensorMath.cpp:136

I found the problem is the default data has dimension 11 but the observation space of the environment has the dimension 25. The default demo data is in "scripts/data/reacher_demo.pkl"

So I suspect the default demo data is wrong. I used the the script run_open_manipulator_demo.py to regenerate the demo data, but the format is in json and the agent training requires pkl data file?

Thank you!

ImportError: No module named pykdl_utils.kdl_kinematics

Dear All:

After running the command "rosrun kair_algorithms run_open_manipulator_reacher_v0.py --algo ddpgfd --off-render --log", I got the error:

Traceback (most recent call last):
File "/home/local/ha3/mi_ws/src/kair_algorithms_draft/scripts/run_open_manipulator_reacher_v0.py", line 16, in
from envs.open_manipulator.open_manipulator_reacher_env import OpenManipulatorReacherEnv
File "/home/local/ha3/mi_ws/src/kair_algorithms_draft/scripts/envs/init.py", line 1, in
from .open_manipulator import OpenManipulatorReacherEnv
File "/home/local/ha3/mi_ws/src/kair_algorithms_draft/scripts/envs/open_manipulator/init.py", line 1, in
from .open_manipulator_reacher_env import OpenManipulatorReacherEnv
File "/home/local/ha3/mi_ws/src/kair_algorithms_draft/scripts/envs/open_manipulator/open_manipulator_reacher_env.py", line 7, in
from ros_interface import (
File "/home/local/ha3/mi_ws/src/kair_algorithms_draft/scripts/envs/open_manipulator/ros_interface.py", line 16, in
from pykdl_utils.kdl_kinematics import KDLKinematics
ImportError: No module named pykdl_utils.kdl_kinematics

Thanks for helping!

Save replaymemory for resume training

resume training 시에 저장된 param을 사용하더라도 memory를 불러오지 않으면 동일한 조건에서 resume 할 수 없게 된다.

  • param 저장 시에 memory save 기능 추가

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.