Comments (4)
@waltersharpWEI Hi, thanks for the issue. The short answer is that Decision Transformer algorithms don't support any of evaluators because they're essentially different from Q-learning-based algorithms. Please let me close this issue since it's not a bug.
from d3rlpy.
Sorry, looking at your code again, you're not using evaluators technically. For inference, please check this documentation for the usage.
https://d3rlpy.readthedocs.io/en/v2.5.0/references/algos.html#decision-transformer
from d3rlpy.
# start training (save logs to LOGS_DIR)
dt.fit(
dataset,
n_steps=100,
n_steps_per_epoch=10,
eval_target_return=0,
# manually specify action-sampler
eval_action_sampler=d3rlpy.algos.IdentityTransformerActionSampler(),
)
actor = dt.as_stateful_wrapper(
target_return=0,
action_sampler=d3rlpy.algos.IdentityTransformerActionSampler(),
)
# interaction
observation, reward = env.reset(), 0.0
for i in range(100):
action = actor.predict(observation, reward)
observation, reward, done, truncated, _ = env.step(action)
if done or truncated:
break
print("Complete../")
Thanks for your reply, I modified the code in the above form.
I just have one question. Does the inference must be done using the .as_stateful_wrapper() function?
So does that mean the predict() is not supposed to be used in the inference?
from d3rlpy.
It's kind of right. This is because of the nature of stateful behavior of Decision Transformer, which requires carefully crafted inputs. I don't think users want to see this level of complication:
d3rlpy/d3rlpy/algos/transformer/base.py
Line 83 in bad85ac
from d3rlpy.
Related Issues (20)
- [Question]Just want to make sure that the "environment" metric collected by the logger of offline RL algorithms is the result used in papers HOT 2
- d3rlpy install d4rl HOT 4
- ValueError: too many values to unpack (expected 4) when using hopper-medium-v0 environment HOT 6
- [BUG] How to continue training from a save checkpoint HOT 2
- [QUESTION] len(observation_shape) == 1 HOT 5
- [BUG] saving and loading model with custom network gives KeyError: 'custom' HOT 3
- [QUESTION] Offline Learning via custom MDPDataset HOT 2
- gym version incompatibility HOT 2
- Differences in RTG computation between inference and training time HOT 2
- [QUESTION] Custom evaluators not working. HOT 4
- [QUESTION] How to implement a new algorithm? Any guidelines? HOT 1
- [QUESTION] Importing Stable Baselines3 (SB3) Models into d3rlpy HOT 4
- [QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy HOT 1
- [BUG] FQE Loading .d3 errors HOT 1
- a question about d3rlpy 1.1.1 install bug HOT 2
- Issue when using d3rlpy.load_learnable() to load trained model HOT 3
- [REQUEST] Annealing schedule of hyperparameters HOT 1
- create own environment HOT 4
- [Compilation error] Cython.Compiler.Errors.CompileError when running hopper example HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from d3rlpy.