Light

wenge-research / yayi Goto Github PK

View Code? Open in Web Editor NEW

3.2K 12.0 42.0 157 KB

雅意大模型：为客户打造安全可靠的专属大模型，基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型，由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

Home Page: https://www.wenge.com/yayi/index.html

License: Apache License 2.0

Python 100.00%

bloom chinese llama llama2 llm yayi chat lora

yayi's People

Contributors

Stargazers

Watchers

Forkers

jicro lucinao540 djt-hadoop solister00 qiuyu-zhao aimu921 xrishang dalaoyel jucaowei edentime bfcz1 chrisfang3 penguinx damon-wyg shudaixiongliu xysnqdd sunxiaoshou silentfatty snowwanggit wangpin000 project4linwang coder-hou jia1819 jc-wenge wjunzhang zxf864823150 soytn1ce renchauncy allensmile w6022511 ai-jie01 songkq itsharex baifengbai skyrookieyu jiexunniao turbochow vaginessa goneout aichiyouzi988 aiedward lexikuma

yayi's Issues

开源模型训练方式

请问yayi开源的模型，是使用全量参数训练，还是使用lora训练的？使用了多少训练资源？期待您的回答。

支持vllm?

如题，会支持vllm吗？会有更快的推理速度。
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/yi.py

我尝试参照此改写，发现有num_kv_heads 和 share_kv_heads_num 的差异，我尝试改写加载参数时会报错。

他官方的添加方式是这个，https://docs.vllm.ai/en/latest/models/adding_model.html

希望取得联系，并支持 InternLM

尊敬的 YaYi 应用开发者，我是 InternLM 社区开发者&志愿者尖米, 大佬开源的工作对我的启发很大，希望可以探讨使用 InternLM 实现 YaYi 的可能性和实现路径，我的微信是 mzm312，希望可以取得联系进行更深度的交流；

增加openai接口部署吧

参考阿里的：python -m vllm.entrypoints.openai.api_server --model Qwen/Qwen1.5-7B-Chat

网页端[新闻主题分类/主题分类]功能无法使用，无论发什么内容*（包括内置示例）都是涉政不让问

@wenge-research 请教下正确食用方法

可以提供更详细一些的信息吗？

你好，关于你们的模型yayi-13b-llama2，希望能够提供更多的信息。比如我现在能看到词表32005是没有动过的。你们的增量中文预训练数据大约是什么样的量级？有没有拿他跑过类似ceval和cmmlu这类的分数？谢谢！

效果如何，怎么一个提问都没有呢

关于开源数据集

网上说本次开源了部分高质量训练数据 500GB（约100Btoken），请问数据开源在了什么地方？谢谢~

关于C-Eval的测评

您好，感谢您开源YaYi相关模型，我想问一下您在C-Eval榜单上的模型，是您开源的版本么？如果是其他版本，能稍微给些大致信息么？因为我用您开源的YaYi-13B-LLaMa2的模型在C-Eval的测试集测评，效果和榜单差距过大。

yayi大模型没有那种demo部署的吗

yayi大模型没有那种像glm一样的demo部署吗？

请问yayi支持function call功能吗？

如果支持，该怎么使用该功能，如果不支持，可以通过prompt来支持吗？

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.