Comments (12)
Yes we should change to remove the actual model path from onnx.hub (was updated by #5267) instead of model_name
as @justinchuby suggested since the CI will easily break if it is running out of space.
However, 184 seems workable 2 weeks ago: https://github.com/onnx/onnx/actions/runs/8776485339/job/24080049585 and so there might be other issue. Still, we can help @ramkrishna2910 to add "test ONNX Model Zoo" label in his PR to test the fix in advance. I can help review PR as well. Thanks!
from onnx.
I can submit a fix for that.
Much appreciated! I think running the tests on all models is ok, given that the CI is triggered weekly. I find using tempfile (https://github.com/microsoft/onnxscript/blob/03b55e3cd2aeb5603b4d880a6beb02333af3974a/tools/ir/model_zoo_test/model_zoo_test.py#L28-L32) and multiprocessing https://github.com/microsoft/onnxscript/blob/03b55e3cd2aeb5603b4d880a6beb02333af3974a/tools/ir/model_zoo_test/model_zoo_test.py#L105 helpful.
from onnx.
Hi @justinchuby
Thanks for pointing this out. Let me take a look at this.
from onnx.
the lines
# remove the model to save space in CIs
if os.path.exists(model_name):
os.remove(model_name)
may not be functioning correctly because I don't believe model_name
is the actual path to the downloaded model
from onnx.
I am running the test locally and I dont see any issues so far after running through 20 models. I will let the test run all the way to see where it fails.
I dont see an exception captured on CI for this failure, which is surprising.
from onnx.
Could be disk out of space if the models are not properly cleared
from onnx.
Yeah thats likely. We are downloading 184 models.
from onnx.
Can be updated using this example: https://github.com/microsoft/onnxscript/pull/1489/files
from onnx.
I was able to run the entire test on my local machine (it took a while). The test completed without any failures but as you suspected the cache is not being deleted.
I can submit a fix for that. Also, do we want to run this test on all 184 models? I believe we can reduce the number of models in this flow to a few representative ones. Thoughts?
from onnx.
Forgot to add the label
from onnx.
Now runs fine. Thanks @ramkrishna2910 ! There are four failures:
--------------Time used: 0.31854987144470215 secs-------------
In all 184 models, 4 models failed, 25 models were skipped
ResNet-preproc failed because: Field 'type' of 'value_info' is required but missing.
VGG 16-bn failed because: /Users/runner/work/onnx/onnx/onnx/version_converter/adapters/transformers.h:35: operator(): Assertion `node->i(attr) == value` failed: Attribute spatial must have value 1
VGG 19-bn failed because: /Users/runner/work/onnx/onnx/onnx/version_converter/adapters/transformers.h:35: operator(): Assertion `node->i(attr) == value` failed: Attribute spatial must have value 1
SSD-MobilenetV1 failed because: [ShapeInferenceError] Inference error(s): (op_type:Loop, node name: generic_loop_Loop__48): [ShapeInferenceError] Inference error(s): (op_type:If, node name: Postprocessor/BatchMultiClassNonMaxSuppression/map/while/PadOrClipBoxList/cond_If__115): [ShapeInferenceError] Inference error(s): (op_type:Concat, node name: Postprocessor/BatchMultiClassNonMaxSuppression/map/while/PadOrClipBoxList/cond/concat): [ShapeInferenceError] All inputs to Concat must have same rank. Input 1 has rank 1 != 2
from onnx.
I will create a separate issue
from onnx.
Related Issues (20)
- s390x test failures - test_make_tensor_raw HOT 1
- Shape inference check fails with external data. HOT 4
- Shape Inference crash on Gemm
- onnx.checker crashes on STFT
- onnx.checker crashes on LayerNormalization
- Importing `onnx==1.16.1` causes a segmentation fault on MacOS 11 (Big Sur) HOT 7
- The model is converted to onnx format using dynamic batch precision collapse HOT 8
- Compatibility with numpy>=2.0
- version_converter can't convert model to opset21 HOT 1
- ONNX check SIGSEV when checking the attached model HOT 3
- Tensor and Integer Comparison Problem in ONNX Export HOT 4
- reporting a vulnerability of download_model function HOT 10
- How can "then_branch" and "else_branch" of "if-op" support input from out of subgraph? HOT 11
- ONNX checker does not validate C90 identifier compatibility. HOT 2
- onnx.reference: Cast to float8e4m3fnuz treats +/-inf wrong HOT 2
- Why Exporation into ONNX will cost way much higher RAM? HOT 1
- External Data Conversion is not saving most data in page aligned 4k sizes. Therefore, mmap support disabled for these initializers
- `numpy_helper.from_array` fails with `onnx._custom_element_types` `int4` and `uint4` HOT 3
- onnx.utils.extract_model failed to extract subgraph from whisper-tiny-decoder HOT 2
- pytroch2onnx CheckerError: Node .. input 0 is marked single but has an empty string in the graph
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from onnx.