Comments (4)
This error could be raised by two conditions:
- your dataset has missing values and/or categorical features
- you are using an old version of the LightGBM library (at some point in time, they slightly changed the file format of the model and this modification broke the reader. I fixed it by being compliant with the new version, but the old one is now no more working).
If you are in the second case, it should be enough to update LightGBM to the new version. If you are in the first case, right now rankeval does not support missing values and categorical features and I'm not sure when this features will be added. Indeed both features are framework dependant while rankeval whould be agnostic regardless the framework adopted.
from rankeval.
If I fill the missing values with '-999' WILL IT WORK ?
also, regarding categorical variables, LightGBM has support for categorical variables...why does rank_eval not have support for categorical variables?
from rankeval.
I'm using lightgbm 2.3.0
. I'm in the first case and have missing values and cat data. As @anuragreddygv323 asks, can I transform my input to numerical and still have a reasonable comparison?
from rankeval.
Missing values are tackled differently from fixed values from machine learning algorithms, especially from the ones using decision trees. So to answer your question, if you modify the dataset by removing missing values, you need also the refit the model accordingly. However, the performance of the final model could be negatively affected by doing that. Transforming categorical data is on the other hand impossible, since categories are discrete and unordered while traditional features are continuous and they impose an ordering.
Regarding categorical variables, the reason it is still not supported by rankeval is because standard learning to rank datasets do not have this kind of variables (WEB30K, Istella, Yahoo). But we could start thinking about introducing this feature in rankeval soon or later.
from rankeval.
Related Issues (19)
- Input proxy for Jforest. HOT 3
- turn coremltools dependency into a soft dependency? HOT 5
- XGBoost loader fails when the training prunes out some nodes HOT 1
- support for build with Xcode? HOT 3
- wishlist: TF-Ranking support HOT 5
- 'XGBRanker' object has no attribute 'score' HOT 7
- support Python 3 HOT 13
- KeyError: 'None of [...] are in the [index]' HOT 4
- No version file /site-packages/rankeval/../VERSION' HOT 6
- give example of loading dataset and models without using using dataset_container / load_dataset() HOT 1
- supporting (gzip) compressed svmlight formatted files? HOT 5
- feature importance should be normalised. HOT 2
- feature_importance error HOT 5
- validation and test dataset are not used in feature analysis notebook HOT 1
- dataset documentation does not match behavior HOT 2
- inaccurate description/reasoning in LIght HOT 1
- Problem with VERSION file HOT 6
- dataset.clear_X(): What is it used for? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rankeval.