Comments (5)
Thank you for bringing this up @SteveLauC. I had the same experience when I first starting working on DataFusion. You can work around memory limitations by running cargo test -- --test-threads=1
which will only run a single test at a time. It will be slower but consume substantially less memory. I think it would be a good idea to document this workaround since many new contributors won't have enough RAM to run all tests at max parallelism on their system.
The precise memory requirements to run cargo test
will vary over time and depend on your exact development set up. Running in wsl
on Windows for example is a bit more intensive as you need to reserve some memory for Windows. I personally upgraded from 32->64GB and that was plenty running natively on linux.
from arrow-datafusion.
I think the doctests are the main culprit, see #5347
Agree on documenting this as part of contributor/developer guide 👍
from arrow-datafusion.
cargo test -- --test-threads=1
Thanks for showing me this! This indeed makes the memory usage controllable:)
from arrow-datafusion.
Glad that worked for you! I opened #9402 to document this in the contributors guide.
There does not seem to be a way to configure Cargo to default to --test-threads 1
other than setting environment variables, so documentation may be our best bet for now... see rust-lang/cargo#8430
from arrow-datafusion.
I had my computer upgraded to 64GB of ram - the workaround mentioned above I used as well but the time to run the tests was using that was horrendous. Using linux under WSL2 in windows here.
from arrow-datafusion.
Related Issues (20)
- Add an example of how to use the SQL parser/unparser API HOT 2
- Support Substrait VirtualTables
- Discussion: make it easier for specify SQL --> function translation HOT 3
- Create a DataFusion blog HOT 2
- `analysis.rs` bounds check panic HOT 2
- AggregateUDF expression API design HOT 1
- Example for building an external index for parquet files
- `array_slice` can't correctly handle NULL parameters or some edge cases
- Add an example of how to convert LogicalPlan to/from SQL Strings
- Cast from string to date with "/" HOT 2
- error: this arithmetic operation will overflow (on i386)
- REmove workaround for `COUNT(*)` in subquery decorrelation code
- Make SQL strings generated from `Expr`s "prettier" HOT 11
- Dynamic schema for custom TableProvider HOT 3
- ScalarValue serialization does not handle nested dictionary values
- Using `Expr::field` panics HOT 3
- Improve signature of `get_field` is function
- UserDefinedLogicalNode::from_template does not return a Result<...> >
- UserDefindedLogicalNode::from_template does not return a Result<...>. HOT 3
- Row groups are read out of order or with completely different values HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from arrow-datafusion.