yapengtian / av-robustness-cvpr21 Goto Github PK

Can audio-visual integration strengthen robustness under multimodal attacks?

License: MIT License

Python 84.22% Shell 15.78%

av-robustness-cvpr21's Issues

unexcepted args

main_attack.py line 550
builder.build_sound builder.build_frame
fc_dim is unexcepted args, i delete and it worked well ,should i delete it?

some problems occurred when reproducing results in this paper

hi, sorry to disturb you but i got some problems when i tried to reproduce your results. my results somtimes would be more than 10% lower than yours so i think there must be something wrong when i run your code. my running procedure on AVE dataset is listed as follows:
(1) run extract_audio.py and extract_frames.py to get audio and frame files.
(2) run train_attack_AVE.sh.
(3) run eval_attack_AVE.sh to see the defense ability of the model obtained by (2). since the epsilons in your defense settings are all 0.006 and in your attack settings are 0.012 sometimes, in order to see and compare the defense ability of the model in the same settings as other defense methods, i changed the epsilons whose values are 0.012 in main_attack.py to 0.006 just like them in main_defense.py.
(4) run train_defense_AVE.sh.
(5) run eval_defense_AVE.sh to see the defense ability of the approach you proposed. according to my understanding i could get the results equal to those in your paper in this step but i didn't.
are there something wrong with my steps?
besides the procedure, i also have other questions:
when i run extract_audio.py and extract_frames.py, i found more than 20 videos in AVE dataset didn't have their audio files. is this normal?
are your results of defense ability of your model on AVE dataset actually under 60 epochs instead of 30? because i have to run (30 + 30) epochs to get your results.
i found the epsilons are 0.06 and 0.12 in your paper and 0.006 and 0.012 in your code, are these typos? i used 0.006 and 0.012 since i thought the parameters in your code are the correct version.

when update

hi thx for your amazing work.
can i ask when will the code be updated? looking forward to your awesome work!!!

several differences on some parameters

hello, i am wondering:
why the frequency when extracting waveforms is 11025hz in readme.md but in extract_audio.py it is 11000hz?
why the recommended epochs of AVE dataset is 100 in your paper but in train_defense_AVE.sh it is 30?
why the learning rate of frame and audio of training defense on AVE are 1e-3 and 1e-4 respectively in your paper but in train_defense_AVE.sh they are 1e-4 and 1e-3?

'data/val.csv'

Hi,thank you for your sharing.I am trying to reproduce your work.It seems that 'data/val.csv' is missing?

yapengtian / av-robustness-cvpr21 Goto Github PK

av-robustness-cvpr21's People

Contributors

Stargazers

Watchers

Forkers

av-robustness-cvpr21's Issues

unexcepted args

some problems occurred when reproducing results in this paper

when update

several differences on some parameters

'data/val.csv'

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent