Topic: data-curation Goto Github
Some thing interesting about data-curation
Some thing interesting about data-curation
data-curation,TokenEditor is a web application for manual annotation (or manual review of automatic annotations) of text. Albeit primarily aimed at reviewing PoS tags and lemmas, it is fully customizable, to support any annotation levels.
Organization: acdh-oeaw
data-curation,Digitální archiv AMČR
Organization: arup-cas
Home Page: https://digiarchiv.aiscr.cz/
data-curation,Archeologická mapa České republiky
Organization: arup-cas
Home Page: https://amcr-info.aiscr.cz/
data-curation,Python package to make URL extraction, generalization, validation, and filtration easy.
User: bluestero
Home Page: https://pypi.org/project/urlgenie/
data-curation,A web service for semi-automated conversion of raw imaging data to BIDS
Organization: brainlife
Home Page: https://brainlife.io/ezbids
data-curation,Demo showing how the Trustworthy Language Model add reliability to LLM outputs and improves RAG, agents, and data enrichment worfklows. can be used to improve fine-tuning of LLMs, accuracy of LLM outputs, and smart routing for RAG and agents.
User: cgnorthcutt
Home Page: https://help.cleanlab.ai/tutorials/tlm/
data-curation,The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Organization: cleanlab
Home Page: https://cleanlab.ai
data-curation,Client interface for all things Cleanlab Studio
Organization: cleanlab
Home Page: https://help.cleanlab.ai/
data-curation,COVID19 Case Report Form Analysis - data and collection forms.
User: cmrn-rhi
data-curation,A curated, but incomplete, list of data-centric AI resources.
User: daochenzha
data-curation,Data Curation, Winter 2021
User: deannalash
Home Page: https://doi.org/10.5281/zenodo.1088086
data-curation,🧼🔎 A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates and label errors.
Organization: digital-dermatology
Home Page: https://arxiv.org/abs/2305.17048
data-curation,A Doctor for your data
Organization: docta-ai
data-curation,Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Organization: getmetamapper
Home Page: https://www.metamapper.io
data-curation,Codes I wrote for the paper : "Global determinants of freshwater and marine fish genetic diversity" Nature Communications, 2020
User: grelot
Home Page: https://gitlab.mbb.univ-montp2.fr/reservebenefit/worldmap_fish_genetic_diversity
data-curation,Gene Curator is an open-source platform for managing and curating genetic data. It facilitates gene data analysis, entry, and reporting, serving genetics researchers with tools for efficient data handling.
Organization: halbritter-lab
data-curation,Entropy-targeted active learning for bias mitigation in materials data.
User: henrium
Home Page: https://doi.org/10.1063/5.0138913
data-curation,Package that builds a JSON inventory/manifest from public primary or derived datasets
Organization: hubmapconsortium
Home Page: https://portal.hubmapconsortium.org
data-curation,Processing code for Scientific Data Descriptor paper
Organization: icpsr
Home Page: https://doi.org/10.1038/s41597-024-03303-2
data-curation,One of the biggest barriers to widespread machine learning adoption is the difficulty in collecting a 'good' dataset. There is an overall consensus that a 'good' dataset is a big dataset, but we believe that we can do better. As such the VennData project was created to develop tools to guide in the collection, curation, augmentation and validation of data.
Organization: iqtlabs
data-curation,Code and data for "Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation" (EMNLP 2023)
User: iwangjian
data-curation,HISDAC-ES: Creating historical settlement data for Spain (1900-2020) based on cadastral building footprint data
User: johannesuhl
data-curation,data filter routines using numpy
User: khx0
data-curation,Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning
User: laureberti
data-curation,This is a task for Hamoye stage E Internship on Data curation, with focus on Amazon Book website.
User: manuelkila
data-curation,For this human-centered data science project, I analyzed some data on the Gender characteristics of Superheroes and Villains to determine the ratio of female characters that appear in comic books compared to their male counterparts using Matplotlib.
User: mars-aria
data-curation,AqSolDB: A curated aqueous solubility dataset contains 9.982 unique compounds.
User: mcsorkun
Home Page: https://www.amdlab.nl/database/AqSolDB/
data-curation,Curated list of known efforts in collecting and/or curating of chemical/materials data
Organization: neo-chem
data-curation,Scalable data pre processing and curation toolkit for LLMs
Organization: nvidia
data-curation,Curation of BIDS (CuBIDS): A sanity-preserving software package for processing BIDS datasets.
Organization: pennlinc
Home Page: https://cubids.readthedocs.io/
data-curation,Web Scraping & Text Data Collecting and Curating for Maithili Language. Also Language Modeling for collected data.
User: pr-desai2226
data-curation,Some analysis on public datasets [WIP]
User: raulrc
data-curation,Curated list of open source tooling for data-centric AI on unstructured data.
Organization: renumics
Home Page: https://renumics.com
data-curation,A library for detecting problematic data segments in structured and unstructured data with few lines of code.
Organization: renumics
data-curation,Interactively explore unstructured datasets from your dataframe.
Organization: renumics
Home Page: https://renumics.com
data-curation,Analysis of Tweets Dataset using concepts like Data Curation and Data Processing.
User: sahithikodali1
data-curation,TranSMART Arborist: Graphical tool for reshaping your data for the tranSMART data warehouse.
Organization: thehyve
data-curation,tranSMART Arborist ETL toolkit
Organization: thehyve
Home Page: https://pypi.org/project/tmtk/
data-curation,Rebalancing chemical reaction
User: tieulongphan
data-curation,Python tool using the Figshare API for data curation
Organization: ual-re
Home Page: https://ldcoolp-figshare.readthedocs.io
data-curation,Code for data linkage (curation of research database).
Organization: uhbristoldatascience
data-curation,Data Cleaning and Data Profiling Library for Python
Organization: vida-nyu
data-curation,fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
Organization: visual-layer
data-curation,The open-source tool for building high-quality datasets and computer vision models
Organization: voxel51
Home Page: https://fiftyone.ai
data-curation,Track model training experiments with MLflow and FiftyOne!
Organization: voxel51
data-curation,Lesson guide and textbook for "Data as a Science" course.
Organization: whythawk
data-curation,
Organization: wolframresearch
data-curation,A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and auto-tag/caption models for your purposes. Custom datasets can be added!
User: x-ck-x
data-curation,Graph-based NLP framework leveraging a curated database and an intuitive CLI for advanced, context-rich language understanding.
User: yago-mendoza
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.