Comments (9)
同求,多轮对话形式,及对应的代码,谢谢
from cogvlm.
能否指点一下,谢谢
from cogvlm.
是微调吗?
from cogvlm.
是的,微调。
问1,答1,问2,答2
应该如何设计数据格式和训练模式。
from cogvlm.
多轮对话是采用这个吗 chat_old_history_to_prompt
如果数据标签为图文对:问1,答1,问2,答2
chat_old_history_to_prompt生成prompt=问1,答1,问2,预测结果与答2计算loss ?
这样的一条数据:问1,答1,问2,答2。要在网络里面训练几次?
第一次:训练 prompt=问1,第二次训练prompt=问1,答1,问2 ?
对于dataset.py有应该如何读取多轮对话数据标签:
from cogvlm.
能否详细解释一下,非常感谢
from cogvlm.
能否详细解释一下,非常感谢
from cogvlm.
能否详细解释一下,非常感谢
from cogvlm.
能否解答一下,非常感谢
from cogvlm.
Related Issues (20)
- 关于模型量化 HOT 14
- GPU selection / multi-GPU HOT 2
- Deploy HOT 2
- Chat using one image and three prompt HOT 1
- CogVLM是开放中文模型了吗,开源模型是否已经支持中文提问回答以及中文数据微调呢? HOT 1
- [CogVLM-chat-v1.1] LM weights are different with vicuna-7b-v1.5 HOT 3
- Running Gradio app locally results in inappropriate error: "NETWORK ERROR DUE TO HIGH TRAFFIC. PLEASE REGENERATE OR REFRESH THIS PAGE." HOT 1
- Using CogVLM as an API HOT 1
- Code of finetuning the cogagent on Mind2Web ? HOT 1
- Deploy CogVLM using Docker
- Could we replace the vicuna-7b directly with stronger llm? HOT 1
- 我想用同样的promt,在每次都清除上下文的情况下得到3种答案,为什么结果都是一样的 HOT 2
- Chat with PDF documentation instead of images
- CogAgent 视觉预训练模型 EVA2-CLIP-L HOT 1
- CogVLM源代码是否支持多轮对话训练 HOT 5
- 关于模型视觉定位原理
- 运行微调脚本报错缺少相关参数 HOT 2
- 如何构建CogAgent的微调数据集? HOT 1
- 两张3090微调CogVLM的可能性? HOT 1
- 加载cogvlm-chat-hf模型报错 Error while deserializing header: MetadataIncompleteBuffer
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cogvlm.