anubhavgupta3377 / text-classification-models-pytorch Goto Github PK

View Code? Open in Web Editor NEW

483.0 14.0 136.0 12.62 MB

Implementation of State-of-the-art Text Classification Models in Pytorch

License: MIT License

Python 100.00%

deep-learning nlp convolutional-neural-networks pytorch fasttext seq2seq rcnn attention transformer classification

text-classification-models-pytorch's People

Contributors

Stargazers

Watchers

Forkers

dsp6414 soulspirit1229 alexemanuel27 snlpatel001213 wangxing0608 nidhi-parmar syemc chl916185 aarti9 tahmedge yjfiejd zjcanjux moqi112358 isunym anandamrita suxuanyuan shuomei magickingg dqgdqg yapdianang devanshb26 senlau agemagician nimnab guhaifudeng sxiazr gaurav0651 shubhampachori12110095 damianphung kshitijtayal nushrathumaira marcelomata manoj9april richardsun-voyager prabhakar267 xeron56 burakakrishna hojunpark xy-always allensmile legendtianjin csqjxiao provenclei huaizhengzhang flydsc jeansding simba2017 dalperdomoe xinxia2019 cyprestar xxp17457741 primasanjaya dmytrobabenko mma1979 zqqqqq1 mrbananahuman databill86 dadelani yx0119 oliverzju nk2000 pink0763 kartikkannapur guoziang0412 dulangaweerakoon micdp ssinghgithub bilal-rachik awoziji josephding2019 yuhaozh88 nishantgupta-gep roger-g pablorr100 harlanhong 14h034160212 badboy101x askintution binhna yueyub aobaize getmlcode tayhengee jonathanloscalzo yip522364642 enescigdem whatyouknow123 lasp73 jiansfoggy kingart00 tonellotto lorenzozangari a11en0 jtquisenberry datnnt1997 janardhanv david-li0826 param9498 yunx-z zkz

text-classification-models-pytorch's Issues

no non linearity in fasttext model

In fasttext model:
def forward(self, x):
embedded_sent = self.embeddings(x).permute(1,0,2)
h = self.fc1(embedded_sent.mean(1))
z = self.fc2(h)
return self.softmax(z)

Why is there no call to relu on h?

why is the result of RCNN is about 25%?

I don't know why , I just add the code in config.py
max_sen_len = None

can you give me some ideas,thanks

seq2seq-Attention inference batch size \<pad\> effects

hi, setting max_length to none, but short sentence would still be padded to the longest length among the batch, thus affects the training and prediction result
eg.
predict only one sentence, "0516酸菜鱼", index input to tensor([[241542],
[ 7789],
[ 192],
[ 260]])
but add "'Bird&Bird 香港鸿鹄律师事务所北京代表处鸿鹄知识产权代理(北京有限公司)'
index input to ":
whereas index 1 is "<pad>"
and output differs:

Model_Transformer

Hi, would you please share your point in "Only encoder part of Transformer model is used for classification",.
I apply this model in my dataset, the accuracy only 60% .I am thinking whether the number of layer is a cause?

Use of permute in RCNN

On lines 53 and 61 of Text-Classification-Models-Pytorch/Model_RCNN/model.py , function permute is used.

input_features = torch.cat([lstm_out,embedded_sent], 2).permute(1,0,2)
...
linear_output = linear_output.permute(0,2,1) # Reshaping fot max_pool

Could you please explain why it is necessary or useful to permute the dimensions of these tensors?

TypeError: forward() missing 1 required positional argument: 'mask'

use Model_Transformer
when I run train.py,I got as follow:
TypeError: forward() missing 1 required positional argument: 'mask'

Can you give me the reason?

RuntimeError: index out of range

Hi, when i run the train.py in the folder "textCNN model",there is an error that I cant find out what's wrong. Can you help me ? Thank you very much

Loaded 96000 training examples
Loaded 7600 test examples
Loaded 24000 validation examples
Epoch: 0
Traceback (most recent call last):
  File "/Users/y/Documents/Code/Text-Classification-Models-Pytorch-master/Model_TextCNN/train.py", line 43, in <module>
    train_loss,val_accuracy = model.run_epoch(dataset.train_iterator, dataset.val_iterator, i)
  File "/Users/y/Documents/Code/Text-Classification-Models-Pytorch-master/Model_TextCNN/model.py", line 85, in run_epoch
    y_pred = self.__call__(x)
  File "/anaconda3/python.app/Contents/lib/python3.6/site-packages/torch/nn/modules/module.py", line 489, in __call__
    result = self.forward(*input, **kwargs)
  File "/Users/y/Documents/Code/Text-Classification-Models-Pytorch-master/Model_TextCNN/model.py", line 45, in forward
    embedded_sent = self.embeddings(x).permute(1,2,0)
  File "/anaconda3/python.app/Contents/lib/python3.6/site-packages/torch/nn/modules/module.py", line 489, in __call__
    result = self.forward(*input, **kwargs)
  File "/anaconda3/python.app/Contents/lib/python3.6/site-packages/torch/nn/modules/sparse.py", line 118, in forward
    self.norm_type, self.scale_grad_by_freq, self.sparse)
  File "/anaconda3/python.app/Contents/lib/python3.6/site-packages/torch/nn/functional.py", line 1454, in embedding
    return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: index out of range at /Users/administrator/nightlies/pytorch-1.0.0/wheel_build_dirs/conda_3.6/conda/conda-bld/pytorch_1544137972173/work/aten/src/TH/generic/THTensorEvenMoreMath.cpp:191

Process finished with exit code 1

NLLLoss

Hi, thanks for your code! However, I think there might be a bug on NLLLoss: I think the input of nn.NLLloss() is after logsoftmax according to this manual: https://pytorch.org/docs/stable/generated/torch.nn.NLLLoss.html, however, here only softmax is provided in model.py therefore, the loss is always negative. I only check the Transformer model.

Kind regards,
John

ag_news.train file

Hi,
What are the files named "ag_news.train" and "ag_news.test"?

anubhavgupta3377 / text-classification-models-pytorch Goto Github PK

text-classification-models-pytorch's People

Contributors

Stargazers

Watchers

Forkers

text-classification-models-pytorch's Issues

no non linearity in fasttext model

why is the result of RCNN is about 25%?

seq2seq-Attention inference batch size \<pad\> effects

Model_Transformer

Use of permute in RCNN

TypeError: forward() missing 1 required positional argument: 'mask'

RuntimeError: index out of range

NLLLoss

ag_news.train file

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent