Comments (5)
Thanks! We will inform you once our paper be out next week!
from glm-130b.
@younesbelkada Hi,
Thanks for your interest! Is it urgent for you add the citation? Our paper is in its final proofreading period, and will be released next week. At that time you can cite our paper : )
If you are immediately about to release some paper or technical report, you can cite this one (only temporarily):
@Article{zeng2022GLM,
title={GLM-130B: An Open Bilingual Pre-Trained Model},
author={Zeng, Aohan and Liu, Xiao and Du, Zhengxiao and Ding, Ming and Zheng, Qinkai and Lai, Hanyu and Wang, Zihan and Yang, Zhuoyi and Yu, Jifan and Zhang, Xiaohan and Zheng, Wendi and Xia, Xiao and Xu, Yifan and Tam, Weng Lam and Dong, Yuxiao and Ma, Zixuan and He, Jiaao and Sun, Zhenbo and Zhai, Jidong and Chen, Wengguang and Zeng, Guoyang and Han, Xu and Zhao, Weilin and Liu, Zhiyuan and Xue, Yufei and Wang, Shan and Shan, Jiecai and Jiang, Haohan and Guo, Zhengang and Zhang, Peng and Tang, Jie},
year={2022},
publisher={Technical report, Tsinghua KEG},
url={ https://keg.cs.tsinghua.edu.cn/glm-130b }
}
from glm-130b.
Thanks a lot for the quick answer 🔥 !
It is not urgent at all, I just wanted to properly cite your work and repo on a technical report, the citation mentioned above should be sufficient ;)
Thanks again!
from glm-130b.
@Xiao9905 hi, the paper is released ?
from glm-130b.
Thanks for your patient waiting! Our paper is now out. You may read it and cite in your later work via:
@article{zeng2022glm130b,
title={GLM-130B: An Open Bilingual Pre-trained Model},
author={Aohan Zeng and Xiao Liu and Zhengxiao Du and Zihan Wang and Hanyu Lai and Ming Ding and Zhuoyi Yang and Yifan Xu and Wendi Zheng and Xiao Xia and Weng Lam Tam and Zixuan Ma and Yufei Xue and Jidong Zhai and Wenguang Chen and P. Zhang and Yuxiao Dong and Jie Tang},
journal={arXiv:2210.02414},
year={2022}
}
from glm-130b.
Related Issues (20)
- 6 cards inference HOT 1
- [Question]GLM-130B模型有vocab文件吗? HOT 1
- GLM-130B 模型结构超参问题
- 关于docs/quantization.md中图片疑问
- 训练目标
- 关于FT inference benchmark数据的疑问
- 每个token耗时呈脉冲式变化
- GLM-130B文档中描述model weights,GPU内存需要260G,测试demo中实际测试总占用在240G左右,请问是什么原因
- 模型并行集群怎么搭建
- 请问GLM可以在输出内容时,同时输出引用内容的来源吗?
- 模型申请页面无法提交申请 HOT 1
- 基于130B有chat版本开源的计划吗?
- 申请邮件收到的模型下载链接都失效了 HOT 5
- FasterTransformer能否支持Glm6B呢
- glm2-130B will it be made? HOT 1
- 请问,课程链接在哪里? HOT 1
- RuntimeError: probability tensor contains either `inf`, `nan` or element < 0answers, answers_with_style, blanks = fill_blanks(raw_text, model, tokenizer, strategy)
- 8卡 fastertransformer 推理报错RuntimeError: [FT][ERROR] Assertion fail: /home/young.ruan/FasterTransformer/src/fastertransformer/th_op/glm/GlmOp.h:539
- 执行bash scripts/generate.sh --input-source interactive时出现的错误。大佬救救! HOT 1
- Clarification Request on GLM-130B Model Architecture and Licensing for Commercial Use
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from glm-130b.