athenabot / k8s-issues Goto Github PK
View Code? Open in Web Editor NEWLicense: MIT License
License: MIT License
Keyword matching is quite limited, as so much of determining issue ownership is contextual. For example:
My skills in ML are very limited, so I would appreciate help if anyone feels like experimenting! issue #5 may help.
fejta-bot automatically cycles issues through a lifecycle, which ends in closing. Users can manually reset the lifecycle, or set a hold (which stops the issue from degrading).
When users set a lifecycle to counter fejta-bot, they usually want to keep the issue open indefinitely. Athena should automatically put lifecycle/frozen on an issue if a user sets the lifecycle following a fejta-bot lifecycle action.
blocks of code
throw off keyword analysis, especially when there's huge chunks of kubectl or YAML.
String matching leads to false positives.
EG service account -> service
EG cloud provider specifics means the issue is more likely, a cloud provider one and less likely to be sig-network, sig-auth, etc.
Add anti-match fields to sigs, which subtract likeliness upon match.
Clean up the structure of classification code. Functions should have a clear purpose, and not decide "too much". A good flow might be:
Check issue is open -> Decide scores -> Translate scores to labels-> Remove bad/duplicate labels
I started a basic Google Firestore class for posting data, but willing to take other integrations if they're easy to manage.
The k8s-issue bot should include classification debug data, issue data, and what the final comment was. Enough info that we can later pull stats and make comparisons w/r/t effectiveness. It would also be nice to check up to the "previously seen issue", rather than blindly running against the backlog.
Just saw @athenabot comment on this issue: kubernetes/kubernetes#75263
While it's true that vSphere is a vmware product, I'm pretty sure the bug is Windows-related, and not vmware related. Maybe a general rule for the bot should be: "don't add sigs to anything that already has them"? It has the potential to be more annoying than useful if humans have already considered which SIG something belongs to and a lil bot comes along and disagrees :)
The bot keeps matching sig-node on issues that are a tenuous match, or are completely incorrect. Contributing factors:
When people categorize an issue, they usually do it correctly. Upwards of 80% of comments on issues with sigs are wrong.
The code already biases against commenting on issues with sigs, but that bias is not proving effective.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.