Overview
This is a demonstration of the roadmap to v1.0.0.
The core objective of this project is to deliver a meticulously designed and sufficiently flexible framework, accompanied by a set of tools to assist developers in rapidly implementing simulation scenarios where multiple LLM agents can interact or compete to fulfill specific needs or tasks with minimal code. Simultaneously, the project incorporates pre-built, diverse simulation scenarios to enable developers to directly test the specific performance of their LLM agents within corresponding contexts, and to compare with other agents implemented by their own or the community.
By the time of the v1.0.0 release, this project will encompass the following features:
- A highly abstract core framework with standardized protocols to creat scene projects.
- A web service that is stable enough to concurrently running multiple scenario simulation tasks.
- Develop scene projects as many as possible.
- Implement popular LLM reasoning methods as many as possible.
- Support popular LLM backends as many as possible.
- Support popular prompting frameworks as many as possible.
Table of Contents
- Core Framework Implementation
- Web Service Implementation
- Scene Projects Development
- LLM Reasoning Methods Implementations
- LLM Backends Supporting
- Prompting Frameworks Supporting
Core framework implementatioin
Implement a meticulously designed, highly abstract core framework where defines all the elements necessary for creating a scene project, providing standardized protocols that accurately identify all scene projects' components in accordance with the specified configurations.
(todo list here)
Web service implementatioin
Implement a stable, high-concurrency web service that offers a range of APIs that facilitate seamless interaction with leaf-playground-webui. It will operate in a containerized manner, concurrently executing multiple scenario simulation tasks.
(todo list here)
Scene projects development
Develop a multitude of scene projects that combine entertainment value and application value to meet various evaluation needs of community users. The results of simulation tasks generated by each scene project should effectively quantify the specific application skills and general abilities of LLM agents.
(todo list here)
LLM reasoning methods implementations
See #9 for more details.
LLM Backends supporting
Support a selection of mainstream LLM backends, and define communication protocols when necessary.
(todo list here)
Prompting frameworks supporting
See #9 for more details.