Light

junnyu / chinesebert_pytorch Goto Github PK

View Code? Open in Web Editor NEW

14.0 14.0 1.0 113 KB

huggingface ChineseBert Tokenizer

License: MIT License

Python 100.00%

chinesebert_pytorch's Introduction

Hi there 👋

🔭 I’m currently working on pytorch2paddle.
🌱 I’m currently learning NLP.
💬 Ask me about pytorch2paddle.
📫 How to reach me:
- Email：[email protected]
- Wechat：yjcatlover

chinesebert_pytorch's People

Contributors

Stargazers

Watchers

Forkers

sanforfive

chinesebert_pytorch's Issues

请问如果要输入一个列表的话应该怎么办呢？比如输入[cls,我，是，谁，sep，我，是，谁，sep]

请问遇到输入是一个列表的情况下如何处理呢？比如在做MRC时需要插入特殊的token

字形嵌入

请问字形嵌入是不是简化了？

老哥请问这个怎么做文本分类

批量tokenize的时候报错

您好，请问如何修改达到复现文章中去掉字形或者去掉字音的模型

文章中可以在fusion embedding之前去掉字音或者字形，请问如何修改？

请问模型参数和官方是否是一样的呢

请问加载的pytorch_model.bin是否和官方的是一样的？
模型参数是否是一致的
感谢！

请问example的tnew中，import的dataset的代码在哪里呢？

你好，非常感谢代码的分享，请问
import datasets
from datasets import load_dataset, load_metric
21行，22行的这个datasets的代码是什么呢？

请问如何用BertTokenizer

您好！首先非常感谢您的代码的分享
请问BertTokenizer和BertTokenizerFast有很大的区别吗？如果我想用BertTokenizer的话，应该如何构建呢？
我知道tokenizer的初始话需要.from_pretrain(),那请问如何构建一个使用BertTokenizer的tokenizer呢？

Glyph Information怎样处理呢？

字形信息原始论文是将一个字通过三种字体呈现出来，也就是三个24x24的图片。这里的图片咋弄？

您好，请教一下使用ChineseBERT的问题

您好，我想调用ChineseBERT来生成字向量用作我自己的NER任务，但是实际上并没有如bert模型那般通用，能解答一下吗？

请问这个可以调用large模型吗

如何调用large模型呢？是不是改一下config.json就行了呢？
需要将将config.json中的哪些部分进行修改呢

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.