Code Monkey home page Code Monkey logo

chat-doraemon-backup's Introduction

chat-doraemon

项目概述

本项目旨在创建一个基于哆啦A梦形象的桌面萌宠,通过大模型及其他相关技术使其能够与用户进行互动。该萌宠将具备哆啦A梦的语气和个性,并能够通过“四次元口袋”提供丰富的功能。项目计划使用微调大语言模型、Agent系统、检索增强生成(RAG)技术、TTS、Agent微调来实现这一目标。前端呈现考虑使用Unity,短期内将优先使用Gradio或Streamlit展示相关成果。

项目架构

技术路线

数据

  • 对话数据:

    • 大山版台配陈美贞剧集提取

    • 根据“经典语录”生成问题

  • 个人信息数据:用于agent: "rag"道具

  • 经典道具数据:用于agent: 动漫道具

微调

  • 语气个性微调
  • agent微调

agent

  • 动漫道具:例如竹蜻蜓
  • “rag“道具:辅助llm回答,例如用户询问关于“哆啦美喜欢吃什么”的时候会进行相应检索,减少llm回答的错误率
  • 现代工具:例如文生图

TTS

  • 初步锁定GPT-SoVITS

前端

  • 短期内考虑gradio/streamlit
  • 长期考虑unity

快速开始

下载仓库

cd ~
git clone https://github.com/time1527/chat-doraemon.git
cd chat-doraemon/
git submodule update --init --recursive

安装环境

# 在intern-studio:
studio-conda -t doraemon -o pytorch-2.1.2
# 在本地:
# conda create -n doraemon python=3.10 -y
# conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=11.8 -c pytorch -c nvidia -y
conda activate doraemon
bash setup.sh

运行web demo

# 在第一个终端
conda activate doraemon
bash api_server.sh
# 在第二个终端
conda activate doraemon
cd agent/lagent/examples
streamlit run react_lmdeploy_web_demo.py --server.address 127.0.0.1 --server.port 7860

进展

  • 数据:经典道具数据:2024/05/20
  • agent:动漫道具:2024/05/23
  • agent:现代工具github trend:2024/05/24
  • 数据:人物数据:2024/05/25
  • agent:rag工具:2024/05/27
  • agent:现代工具llm ranker:2024/05/27
  • agent:现代工具remove image background:2024/05/27
  • agent:现代工具文生图:2024/05/28
  • roleplay v0.1:数据:2024/06/02

项目成员

说明:排名不分先后

成员 贡献
SunInCrack agent:现代工具文生图
agent:现代工具翻译【WIP】
roleplay微调剧集数据:大山版台配陈美贞1980.434-456校对
其他校对
time1527 项目策划
agent:现代工具github trend/llm ranker/remove image background
roleplay微调剧集数据:大山版台配陈美贞1980.374-433校对
web demo v0
roleplay其他数据
其他校对
yolo.w agent:动漫道具
agent:rag道具
roleplay微调剧集数据:大山版台配陈美贞1980.457-491校对
actions后处理
web demo v0.1
其他校对

参考

数据

mp4文件提取对话:

https://github.com/LC1332/Chat-Haruhi-Suzumiya/tree/main/yuki_builder

相关人物和工具词条:

https://doraemon.fandom.com/zh/wiki/%E9%A6%96%E9%A0%81?utm_source=fandom-explore-zh

https://duola.huijiwiki.com/wiki/%E8%A7%92%E8%89%B2%E5%88%97%E8%A1%A8

https://baike.baidu.com/item/%E5%93%86%E5%95%A6A%E6%A2%A6/185384

agent

动漫道具:

https://github.com/LC1332/Chat-Haruhi-Suzumiya/blob/main/notebook/Doraemon_Agent.ipynb

agent实现:

https://github.com/InternLM/lagent

致谢

感谢书生浦语

chat-doraemon-backup's People

Contributors

time1527 avatar 626394316 avatar sunincrack avatar

Stargazers

 avatar

Watchers

 avatar

chat-doraemon-backup's Issues

多轮对话数据格式

想问一下'role'和‘content'作为数据集基础特征的话,是否比'role', 'speaker', 'content'和‘emotion'这样的形式更好处理

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.