Comments (5)
This relates to an issue in Kuberay: ray-project/kuberay#2155
cc: @kevin85421
from ray.
It would be great if we have an API that we can call and get the latest checkpoint location for the previous iteration of the given run.
Do you mean a Ray Train API? It makes sense to me. cc @matthewdeng
from ray.
It would be great if we have an API that we can call and get the latest checkpoint location for the previous iteration of the given run.
Do you mean a Ray Train API? It makes sense to me. cc @matthewdeng
@matthewdeng @kevin85421 , yes. Ray Train API that helps to find the last successful checkpoint.
from ray.
@sathyanarays is this the API you'd be looking for?
from ray.
@matthewdeng , yes. This works. Thank you for the pointers.
from ray.
Related Issues (20)
- [ADAG] Detect if accelerated DAG execution would block based on DAG "capacity"
- CI test linux://rllib:learning_tests_multi_agent_cartpole_crashing_and_stalling_appo_old_api_stack is flaky
- CI test linux://rllib:learning_tests_multi_agent_cartpole_crashing_and_stalling_appo_old_api_stack is flaky
- Ray component: Core - pypi repo
- Release test long_running_impala.aws failed HOT 1
- CI test linux://rllib:learning_tests_multi_agent_cartpole_crashing_and_stalling_appo_old_api_stack is flaky
- Release test long_running_many_ppo.aws failed HOT 1
- Release test rllib_learning_tests_pong_dreamerv3_tf2.aws failed HOT 1
- Release test long_running_impala.aws failed HOT 1
- Release test dask_on_ray_100gb_sort.aws.py311 failed HOT 1
- Release test dask_on_ray_1tb_sort failed HOT 3
- Release test dask_on_ray_100gb_sort.aws failed HOT 1
- rllib/examples/action_masking.py not working on dreamerV3
- [Logs] Integrate Serve's logger with Core's structured logger
- FAIL serialization: cannot pickle '_jpype._JField' object HOT 4
- [ADAG] Better handling for RayDAGTaskError
- [Doc] Make Ask AI button light up when a user clicks on the search bar
- [Ray Assistant] [Docs] broken copy button
- Core๏ผ ray job is blocked by scheduling
- Ray multiprocessing.Pool: core_worker_process.cc:278: The core worker has already been shutdown. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ray.