Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.
Hi,
Thanks for amazing work. You have many predefined datasets in your library. My usecase is that I have a dataset which is not listed in your repo. Also, I dont want to upload the dataset to your servers as it is of huge size and also privacy concerns.
Can you show me a way how can I analyze the dataset locally?
As a user, I want to be able to get the descriptions, stats and exploration link, without downloading the entire dataset at this moment. Follows the torch convention.
Pandas warnings are raised when calling explore . Could be fixed probably using a .loc or a .copy in the call. Could also be silenced through the warnings module.
explore only shows duplicate and leakage for pets dataset - possibly only the top of the issue list is accounted for?
Document this: .explore loads default of 50 images. Specify num_images as argument to see more.