alexyangli / absa_keras Goto Github PK

View Code? Open in Web Editor NEW

151.0 6.0 46.0 880 KB

Keras Implementation of Aspect based Sentiment Analysis

Python 100.00%

absa sentiment-analysis sentiment-classification attention td-lstm atae memnet ian ram cabasc

absa_keras's Introduction

Aspect-based Sentiment Analysis

Keras implementation (tensorflow backend) of aspect based sentiment analysis

Models

Cabasc, WWW 2018
Liu et al. "Content Attention Model for Aspect Based Sentiment Analysis"
RAM, EMNLP 2017
Chen et al. "Recurrent Attention Network on Memory for Aspect Sentiment Analysis"
IAN, IJCAI 2017
Ma ei al. "Interactive Attention Networks for Aspect-Level Sentiment Classification"
MemNet, EMNLP 2016
Tang et al. "Aspect Level Sentiment Classification with Deep Memory Network"
ATAE-LSTM(AE-LSTM, AT-LSTM), EMNLP 2016
Wang et al. "Attention-based LSTM for Aspect-level Sentiment Classification"
TD-LSTM(TC-LSTM), COLING 2016
Tang et al. "Effective LSTMs for Target-Dependent Sentiment Classification"

Pre-processing

download glove embedding glove.42B.300d.zip, unzip and put it in raw_data

wget https://nlp.stanford.edu/data/wordvecs/glove.42B.300d.zip
unzip glove.42B.300d.zip
mv glove.42B.300d.txt ./raw_data

pre-processing

python3 process_raw.py
python3 preprocess.py

Training

python3 train.py

Environment

python==3.6.4
keras==2.2.4
nltk==3.2.5
tensorflow=1.6.0

Data Analysis

see data_analysis.md

Performance

Note: results in the () are the performances of models with word embeddings fixed but aspect embeddings fine tuned; results in the [] are the performances of models with both word embeddings and aspect embeddings fixed

Accuracy

model	laptop(paper)	laptop	restaurant(paper)	restaurant	twitter(paper)	twitter
td_lstm		0.69905(0.7194)		0.7732(0.8008)	0.708	0.69219(0.7109)
tc_lstm		0.69122(0.6912)[0.7178]		0.7642(0.7696)[0.79375]	0.715	0.71387(0.7124)[0.72543]
ae_lstm	0.689	0.69122(0.6974)[0.69749]	0.766	0.7651(0.7625)[0.78125]		0.68497(0.68641)[0.6820]
at_lstm		0.69122(0.6880)[0.70689]		0.7678(0.7812)[0.78571]		0.67196(0.7052)[0.70086]
atae_lstm	0.687	0.69749(0.6927)[0.70376]	0.772	0.7732(0.7732)[0.78482]		0.66907(0.6965)[0.6921]
memnet	0.7237	0.52821(0.5266)[0.53448]	0.8095	0.6580(0.65)[0.65803]		0.57369(0.5751)[0.5780]
ram	0.7449	0.70376(0.7225)[0.72727]	0.8023	0.7937(0.8071)[0.81607]	0.6936	0.69653(0.71387)[0.6979]
ian	0.721	0.6865(0.7100)	0.786	0.7732(0.7821)		0.68208(0.71965)
cabasc	0.7507	0.6974(0.6990)[0.67554]	0.8089	0.7919(0.8080)[0.80357]	0.7153	0.69219(0.69508)[0.6690]

Macro-F1

model	laptop(paper)	laptop	restaurant(paper)	restaurant	twitter(paper)	twitter
td_lstm		0.64172(0.6636)		0.6653(0.6986)	0.690	0.6746(0.6898)
tc_lstm		0.62847(0.6335)[0.6666]		0.6651(0.6492)[0.70170]	0.695	0.6983(0.6848)[0.7122]
ae_lstm		0.62486(0.6435)[0.6355]		0.6419(0.6370)[0.68319]		0.66644(0.6670)[0.65437]
at_lstm		0.61267(0.6259)[0.65907]		0.6630(0.6689)[0.6676]		0.6553(0.68194)[0.6823]
atae_lstm		0.6246(0.63185)[0.6539]		0.6505(0.6657)[0.68006]		0.6651(0.67400)[0.67085]
memnet		0.38574(0.4118)[0.2322]		0.3547(0.4040)[0.3395]		0.48982(0.49209)[0.4989]
ram	0.7135	0.6474(0.6747)[0.66476]	0.7080	0.6887(0.7035)[0.72720]	0.6730	0.66536(0.69679)[0.6685]
ian		0.62007(0.6604)		0.6550(0.6768)		0.65071(0.69550)
cabasc		0.64601(0.6356)[0.5886]		0.6815(0.7201)[0.69357]		0.66790(0.67948)[0.6610]

Combining with ELMo Embedding

There are 2 ways to use ELMo in this project. Before training model, change the configuration as belows:

In config.py, set self.use_elmo to True and self.elmo_hub_url to None or https://tfhub.dev/google/elmo/2. By running train.py the tensorflow hub will download the elmo module and cached in the local temporay directory. But the module will be deleted after a machine reboot.
So, I recommend you load the elmo module locally. First, download the elmo using url: https://tfhub.dev/google/elmo/2?tf-hub-format=compressed. Then change its name to tfhub_elmo_2 (whatever you want), untar the file, and put it in the raw_data directory. It will be like:

And don't forget to set self.use_elmo in config.py to True and self.elmo_hub_url to raw_data/tfhub_elmo_2 (the name you just renamed).

Here's the results of combining ELMo Embedding:

Personal conclusion

Surprisingly, I failed to achieved similar performance as stated in the parper of Memnet. Or maybe there are bugs in the code?
TD-LSTM performs unexpectedly well.
Models with fixed embeddings are generally better than those with fine-tuned embeddings, which is consistent with the paper of RAM.
L2 regularization didn't help.
I tried elmo embedding, but it didn't bring a performance gain.

absa_keras's People

Contributors

Stargazers

Watchers

Forkers

shangcaiwangtao renkexinlily imissucc shubhampachori12110095 rajaswa phelanwang binliang-nlp crm0922 liuwq168 julyhcw abhinavm24 ninadakolekar bailianfa dutyhong guoyin90 madeibao whatyouknow123 jepyh rayieee linwangolo jaingnanming tianzhou2011 prasys evaoooos mbkan gokulsg chizala kmelkkk samensah enrinal teamlir warren195 sumerzhang gitoffice tvjoseph figoyu kimhyeonjinny yiyg510 llj110 huyennguyenhelen hzee233 oussamayousre nilzmoradi94

absa_keras's Issues

Some questions

1. why use SpatialDropout1D not dropout ?
2. why right part use go_backwords=True not pad_sequence(padding='post', truncating='post')
3. why left part ended with $t$+(aspect words length) and right part start from left of $t$?
4.  ATAE paper mentioned 'tanh(Wx Hn+Wp r)' in your code using Activation('tanh')(Add()([v1, v2])), this need think?

what is the word "noth" mean?

results in the [] are the performances of models with noth word embeddings and aspect embeddings fixed

Run issue

Hi,

I tried to run this code but I got this error

could someone help plz

Thank u

run preprocess.sh to generate word_glove.npy

Originally posted by @AlexYangLi in #3 (comment)

Running the preprocess.sh file does not generate word_glove.npy
Could you please upload that file?

准确率问题

您好，我用您的代码跑了这几个数据集，发现实现结果跟Paper上的低了1%或者2%。请问是什么问题？感谢回复

train_model 'cabasc' can't run

Hi,

I ran train.py of the previous commit, but only 'cabasc' got an error as follows:

tensorflow.python.framework.errors_impl.InvalidArgumentError: Matrix size-incompatible: In[0]: [2336,300], In[1]: [1324,1324] [[{{node model_1/content_attention_1/MatMul_1}}]] [[{{node metrics/acc/Mean}}]]

At that time, the code toggle comments on most of train_model(...) and just ran fixed word/aspect embedding part, i.e.

config.word_embed_trainable = False
config.aspect_embed_trainable = False
train_model('laptop/term', 'laptop', 'word', 'td_lstm')
train_model('laptop/term', 'laptop', 'word', 'tc_lstm')
train_model('laptop/term', 'laptop', 'word', 'ae_lstm')
train_model('laptop/term', 'laptop', 'word', 'at_lstm')
train_model('laptop/term', 'laptop', 'word', 'atae_lstm')
train_model('laptop/term', 'laptop', 'word', 'memnet')
train_model('laptop/term', 'laptop', 'word', 'ram')
train_model('laptop/term', 'laptop', 'word', 'ian')
# train_model('laptop/term', 'laptop', 'word', 'cabasc')

Only 'cabasc' was toggled comment and did not plot on the results of combining ELMo Embedding as well.

So I guess that 'cabasc' can not run in this condition?

word_glove.npy file not found

When I am trying to run train.py It is showing me the error

File "train.py", line 183, in <module>
    train_model('twitter', 'twitter', 'word', 'atae_lstm')
  File "train.py", line 46, in train_model
    model = SentimentModel(config)
  File "C:\Users\papa\Documents\ABSA_Keras\models.py", line 75, in __init__
    self.config.word_embed_type))
  File "C:\ProgramData\Anaconda3\lib\site-packages\numpy\lib\npyio.py", line 415, in load
    fid = open(os_fspath(file), "rb")
FileNotFoundError: [Errno 2] No such file or directory: './data/twitter/word_glove.npy'

New Dataset

Dear author(s)

What great work! I was cloning your codes, and it was running very well. I plan to run these codes using my own datasets according to my research. However, I have some questions:

May I directly replace the new dataset in folder data?
The data must be in XML format, isn't it?
Which part command should I change when the new dataset is used?

Thank you in advance.

regards,
Novi

run the preprocess.py ， ValueError: invalid literal for int() with base 10: ','

Thank you for sharing this wonderful tutorial. I downloaded the glove.42B.300d file and run the preprocess.py code，but occurd an error: ValueError: invalid literal for int() with base 10: ',' have you ever come across this error? how can i slove it ？I am looking forward for your reply. Thank you .