Comments (4)
Kilian I'd love to hear your thoughts. Just wanted to file this before I forget.
from swe-agent.
Isn't that very similar to the replay model though? Where would we get the commands that fix the issue?
If we had gpt4
solve an issue before, we can take the trajectory and use the replay model to replay everything as if the LM was doing it.
This is the primary means by which I'm testing swe-agent at the moment :) -- though I should add some more trajectories
from swe-agent.
ok let me talk to you about this next time we chat, i think we can make this DummyAgent a bit smarter and therefore it would maybe be better than the replay solution. anyways this is not urgent at all
from swe-agent.
ok I talked with Kilian and one of the tests almost does exactly this and at some point we'll update that test to verify environment outputs too
from swe-agent.
Related Issues (20)
- Failed to clean repository: /bin/bash: line 53: cd: SWE-agent__test-repo: No such file or directory HOT 4
- What is the effect of using gpt-4o? HOT 1
- Is container necessary to run SWE-agent? HOT 3
- Improve error message for missing `--traj_path` in `run_replay`
- Add log output for different installation steps to show activity
- Doc: Speedup recommendations: Persistent container
- run.py: Option to not print out all arguments
- `SWEEnv` refactoring tasks
- env log can be left disabled when initializing multiple `SWEEnv`s HOT 1
- Fix access to log in `SWEEnv`
- Doc: Include `sweagent/agent/README.md`
- Catch errors/warnings in test executions
- `--cache_task_images` vs persistent containers
- Do agent/eval docker containers really need the requirements?
- communicate_with_handling: Better logging
- Doc: Describe manually setting up container either for installation or for speedup
- Allow to use different port for backend HOT 2
- Use faster `conda clone` based method for all setup
- Fix release-dockerhub-release action (incorrect version tag)
- Linter should only block editing because of *new* errors, not pre-existing ones HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from swe-agent.