Comments (3)
@Ilyushin Thanks for reporting the issue. Can you provide more details so we can reproduce the issue on our end?
- Did you use our merlin containers, e.g.,
nvcr.io/nvidia/merlin/merlin-pytorch:22.11
or install it with conda or pip? - What are the package versions that you see when you run
python -c 'import merlin.core; print(merlin.core.__version__)'
and
python -c 'import merlin.dataloader; print(merlin.dataloader.__version__)'
- It it possible to provide us with the dataset schema
train_ds.schema
?
from dataloader.
@edknv Thank you for helping.
- I have used nvcr.io/nvidia/pytorch:22.06-py3
- I tried to use 0.0.2 and 0.0.3
- I downloaded this dataset - https://www.kaggle.com/code/radek1/howto-full-dataset-as-parquet-csv-files
from dataloader.
This seems to be due to the version of cudf
in the nvcr.io/nvidia/pytorch:22.06-py3
container. In the older version of cudf
(prior to 22.04), the keep_index
parameter was not available in df.sample()
.
@Ilyushin Is upgrading your container an option? (e.g., to nvcr.io/nvidia/pytorch:22.07-py3
or even the latest 22.12-py3
not 22.06.) Please also note that nvcr.io/nvidia/merlin/merlin-pytorch comes with merlin-dataloader
pre-installed so you don't have to install merlin-dataloader.
from dataloader.
Related Issues (20)
- Change dataloader to output 1D tensors for scalar features HOT 1
- GPU is not detected properly when using SLURM HOT 5
- GPU memory does not get freed up properly after each batch HOT 5
- Feed pre-trained embeddings to NVTabular HOT 6
- [FEA] Data loader: support to padding sparse sequential features on the left side HOT 11
- Dataloader does not work with tf.keras.layers.Embedding HOT 7
- Device assignment does not work in PyTorch HOT 4
- Out-of-memory error when iterating over merlin.dataloader.torch.Loader HOT 1
- PyTorch Loader not working HOT 1
- [BUG] Unable to extract session embeddings from a session-based transformer model HOT 3
- Can't import Loader HOT 4
- [Task] Add multi-gpu data parallel example
- [Feature Request] Make the torch dataloader support TensorDict
- Does this work with images? HOT 1
- [BUG] Exception in model when using ragged tensors with tensorflow 2.10.0 HOT 9
- [BUG] Data parallel training freezes due to different number of batches HOT 17
- [BUG] Dataloader doesnt release memory and memory growth HOT 6
- NVTabular KerasSequenceLoader costs longer time to load multi-hot features than one-hot features HOT 2
- [Question] OOM Is there a way not to load the whole dataset in the dataloader? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dataloader.