josef-pkt / misc Goto Github PK
View Code? Open in Web Editor NEWjust a place to store things
just a place to store things
Why do we estimate? What are we using the results for?
Questions while pondering model diagnostics and goodness or lack of fit measures
e.g. Poisson and all LEF estimate the expected value consistently even if the rest is misspecified. But Poisson is not a good model for the distribution of overdispersed data.
When do we care about distributional fit diagnostic tests and measures?
I wrote quite a lot of examples and recipes over time that are "misplaced" somewhere.
Collect and check what's more widely useful and hasn't been included in statsmodels yet.
I sneaked some questions into the statsmodels wiki
https://github.com/statsmodels/statsmodels/wiki/SMEP:-Causality-and-Treatment-Effects
todo: read "Mostly Harmless" and ... and ...
(Imbens and Wooldridge)
Xhiahong Chen handbook article
Towards more generic estimator frameworks
All the above are extremum estimators. GEE is usually formalized as estimating equations. Besides this we can also have estimators that directly specify the estimating equations, with maybe no consistent objective function for a minimum. Examples are in robust estimation with estimating equations for mean and scale specified separately. Maybe feasible GLS would also be in this category as estimation method without the extra MLE interpretation.
Two points: The estimators like OLS, GLM, GEE, RLM solve the estimating equations directly without going through an objective function, OLS, GLM have a (Q)MLE interpretation, RLM is an M-estimator. Even in cases like robust estimators, we could cast them in a exactly identified GMM framework, however similar to FGLS, and GLM/LEF, they exploit asymptotic independence or independence in expectation of the mean and variance terms.
What's the hierarchical tree?
What is common? What is specific?
How can we structure the code with class hierarchies, mixins and function for best code reuse and flexibility?
(Finetuning and special casing can always be done on the lowest level, so that's not directly relevant for the overall structure.)
When or how do we combine different estimators in one model? (e.g. MLE and QMLE, MLE and GMM/OLS, MLE and M-estimator)
example: cov_type
makes MLE into QMLE, OLS/WLS mixes MLE/QMLE and LS/GMM
Components differ whether we want
"Are you pondering what I'm pondering?"
"Deep Thoughts" by J.H. J.P.
some thought on choosing algorithms, algorithm libraries
What's the best approach for block structures, group handling and strata
examples:
kronecker product in system of equations,
balanced or unbalanced panel or cluster data
still pretty unused: direct interval calculation for 1-D, kdtree, ball point
examples:
local regression (lowess, KernelRegression)
matching algorithms: multiple imputation, propensity score matching
Suppose we only have "wrong" models, what's the "best" model and our "best" estimates?
And, how do you define "best"?
Motivation:
Gamma regression: Greene compares MLE versus GMM, where GMM uses overidentified moment conditions. Gamma regression is in GLM/LEF and uses mean and variance assumptions.
preliminaries
So, What's the point?
... ???
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.