Comments (5)
I'm writing a tutorial on the RowProcessor
at the moment, it should land in a public branch sometime early next week.
In general, yes we do plan to build out the docs, focusing on the areas people are flagging initially. Next up on the deck after the RowProcessor
tutorial is one on third party model loading using ONNXExternalModel
and XGBoostExternalModel
. We'll add TensorFlow to that tutorial once Tribuo has been migrated over to the recent tensorflow-java 0.2.0 release, as that will change how Tribuo's TF interface works.
from tribuo.
Yep, but to process csvs where you don't want to use all the columns then you need to use RowProcessor
rather than the simple CSVLoader
. If you instantiate a CSVDataSource
you can pass it a RowProcessor
instance which controls how columnar data is parsed. The RowProcessor
accepts a fieldProcessorMap
which gives the mapping between feature fields and the processor used to convert the String in that field into a list of Tribuo features, and a ResponseProcessor
which you can use to pick the response column. The RowProcessor is very flexible so there are lots of other parameters but that should be sufficient to get you going.
from tribuo.
Great, this is very clear. I appreciate it.
from tribuo.
I'll add docs to the RowProcessor constructors in the docs pass this week.
from tribuo.
Hi @Craigacp - I'm in the same boat, trying to get started with Tribuo and some basic CSV files with non-double Feature values. I've been working my way through the library but it would be great to have some code samples using RowProcessor etc. - had a quick look in the docs and tests but couldn't see anything obvious. Are there plans to build out the docs or have I just missed something?
from tribuo.
Related Issues (20)
- Error on irises-tribuo-v4.ipynb HOT 1
- TransformedModel doesn't have a protobuf
- mRMR HOT 1
- FS using wrapper approaches HOT 7
- Docs recommending IJava HOT 2
- Problem deserializing the XGBoostModel HOT 1
- Do you have any plans to support time-series predictions? HOT 1
- When packaged into docker container: FileNotFoundException: File /lib/linux-musl/x86_64/libxgboost4j.so HOT 6
- Memory and SQLDataSource HOT 1
- About csvLoader.loadDataSource HOT 4
- Configuring HyperParameters HOT 4
- Question about input feature mapping HOT 9
- Llama APIs HOT 1
- Load data from List obj in memory HOT 1
- MLP HOT 1
- TensorFlow Isuue
- Training loss HOT 1
- Weight and Bias in NN HOT 3
- HDBSCAN implementation in 4.3+ HOT 4
- Clustering Issue with Loading the Data HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tribuo.