Code Monkey home page Code Monkey logo

tencent2020_rank1st's People

Contributors

guoday avatar guody5 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

tencent2020_rank1st's Issues

package的版本

想请教下,你们当时运行这份代码使用的包的版本!感谢!

我这边在运行的时候,会因为包的版本报一些错误,例如gensim

预训练词向量频次过滤问题

您好,我看 src/w2v.py预训练中没有通过词频对出现的各种id进行过滤,直接min_count=1,请问这是因为min_count=1的结果最好还是为了与后续bert训练保持词表一致?或者还是其他的原因?然后请教下这个min_count参数的设置有什么经验吗?谢谢!!!

请问模型config中的vocab_size=5表示什么意思?

我看到在预训练和训练模型的encoder中都有config.vocab_size=5这个参数,请问这个表示什么意思和什么作用?在后面找了很久也没有找到用到这个的地方,而且输入encoder的直接是embeding向量吧,所以也没有用到bert encoder里面的embeding。那么这个vocab_size似乎是没用到?

有关模型细节的几个疑问

有几个疑问希望郭大能答疑解惑一下!

  1. Bert预训练时产生的输入端词典和输出端词典都是至多保留10W个ID,并且在分类时也是使用的相同词典,是不是可以理解为只使用了高频的ID进行预训练和分类。

  2. Bert预训练时,MASK、PAD、UNK 所对应的 embedding 是否都是 全0 。如果是的话,这三个 special token 成为等价的话,会不会影响预训练。

  3. Bert预训练时,输入端拼接了 word2vec 和 一个 16 维的 embeddings,这个16维的embeddings 是起到一个什么作用。

关于最后的结果

请问,最终要预测的年龄看起来是个连续的整数,但是看模型最后输出的是20个类别的概率,请问这是怎么对应上的

请问python的版本

hello,您好。我使用python 3.6 加载下载的w2v的table会报错如下:
File "run.py", line 94, in
args.embeddings_tables[x[0]] = gensim.models.KeyedVectors.load_word2vec_format(x[0], binary=False)
File "/Users/dwing/anaconda3/lib/python3.6/site-packages/gensim/models/keyedvectors.py", line 1549, in load_word2vec_format
limit=limit, datatype=datatype)
File "/Users/dwing/anaconda3/lib/python3.6/site-packages/gensim/models/utils_any2vec.py", line 276, in _load_word2vec_format
header = utils.to_unicode(fin.readline(), encoding=encoding)
File "/Users/dwing/anaconda3/lib/python3.6/site-packages/gensim/utils.py", line 368, in any2unicode
return unicode(text, encoding, errors=errors)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte

将load_word2vec_format 函数改成load 的时候,加载data/sequence_text_user_id_product_id.128d时报错:
File "run.py", line 95, in
args.embeddings_tables[x[0]] = gensim.models.KeyedVectors.load(x[0])
File "/Users/dwing/anaconda3/lib/python3.6/site-packages/gensim/models/keyedvectors.py", line 1553, in load
model = super(WordEmbeddingsKeyedVectors, cls).load(fname_or_handle, **kwargs)
File "/Users/dwing/anaconda3/lib/python3.6/site-packages/gensim/models/keyedvectors.py", line 228, in load
return super(BaseKeyedVectors, cls).load(fname_or_handle, **kwargs)
File "/Users/dwing/anaconda3/lib/python3.6/site-packages/gensim/utils.py", line 435, in load
obj = unpickle(fname)
File "/Users/dwing/anaconda3/lib/python3.6/site-packages/gensim/utils.py", line 1398, in unpickle
return _pickle.load(f, encoding='latin1')
OSError: [Errno 22] Invalid argument

Word2Vector和BERT权重必须一致是什么意思

恭喜大佬再次喜提冠军, Word2Vector和BERT权重必须一致是什么意思?


这里提供两种方式获得预训练权重: 重新预训练或下载预训练好的权重

注: Word2Vector和BERT权重必须一致,即要么全部重新预训练,要么全部下载

vocab_dim_v1

请教一下,vocab_dim_v1=64是什么维度

尊敬的作者,请问提供的bert-small是否经过预训练?

非常感谢您愿意开源,感谢您精彩的工作,请问您的bert-small是否经过预训练(因为您的注解里面提到bert-small需要经过预训练)
不知道能否浪费您宝贵的时间,麻烦您发送一下经过预训练的bert-small,菜鸡的配置比较低(内存比较64G,显存24G)[email protected]
再次感谢您出色的工作
另能否小声提个建议,您上传资源能否选择国内的运营商,因为Google drive需要科学上网

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.