zeno-ml / zeno Goto Github PK
View Code? Open in Web Editor NEWAI Data Management & Evaluation Platform
Home Page: https://zenoml.com
License: MIT License
AI Data Management & Evaluation Platform
Home Page: https://zenoml.com
License: MIT License
Primarily:
Data processing/Model running with caching:
DataFrame operations:
Users should be shown some automated "insights" about their slices/reports when they first land on the reports tab. This can include new failures, regressions, anomalies, etc. in tabular and graphical form.
Some file formats you want to show less, others more.
For example, metric function might not have a mutation, not need the extra variables. Can check the signature and call the function with the right arguments.
For IMU data, sensor stuff
create a view for tabular data.
Maybe consider making a default table view for Zeno generally that shows the values for all columns? idk.
Users should be able to create folders to organize slices into.
show version and help
The least generalizable piece of Zeno is the instance view.
It would be great to make this modular, where people can install e.g. zeno-pointcloud
and zeno then has an option to take in pointcloud data and visualize it without changing the zeno code itself.
e.g. if I want to create a slice for every category in a metadata column shouldn't have to do it one by one.
the Python API breaks down substantially when trying to use a multimodal model. Need to re-design and update the API to directly support multimodal models.
Slices don't sync being put in folder until clicked.
For example, what if I want to find all images with dogs, and then filter for the dogs that are "red". The first slicing function has to indicate the bounding boxes the second slicing function should focus on.
Initial running of Zeno is super slow, even before running any computation. We should figure out why.
Have to manually run npm run build
after every change when iterating, should set up an npm run dev
Since we re-render the whole sample view each time we change the table, preferences are re-written, such as showing the mask or just groundtruth for segmentation.
We should find a way to preserve these settings across re-renders of the sample view.
Push the new view packages to pypi and make available for new users
Currently you can only do single joins for filters, e.g. (A AND B OR C AND D)
We need to be able to make more complex filters such as (A AND C) OR !(D)
Generally keeping track of issues that will start to come up as we scale to larger datasets:
Can't load in all the data at once, e.g. 100,000 images into memory. Have to batch it for both preprocessing and prediction. ALSO remove from slice API, everything should be metadata.
Instantiate Zeno, pass in functions, and run it - will open new page with UI.
This will require lots of thinking about how to structure the API - do we pass functions in? try to read the notebook itself?
Some metadata columns may be dates or text. We should have a cell visualization for them that provides some overview and filtering.
Support function types given the previous output
Including
Either in exploration or analysis tab, let people create grid view that crosses slices like a confusion matrix to see metrics at different intersections.
Users should be able to select a group of instances and create a new slice. We should also support some views that help people expand their slices such as nearest neighbors and embeddings.
Users should be able to define tests given the mean and standard deviation of the model average instead of hard thresholds.
two main bugs:
Some metrics should be run on every slice, provide an way to stipulate that
Create a thorough walkthrough for getting started with CIFAR explaining different concepts.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.