Comments (1)
the same issue is coming while training from scartch
sh-4.4$ sh run.sh
2020-06-30 14:00:05 | INFO | fairseq_cli.train | Namespace(activation_dropout=0.1, activation_fn='gelu', adam_betas='(0.9, 0.999)', adam_eps=1e-08, all_gather_list_size=16384, arch='ngram_transformer_prophet_large', attention_dropout=0.1, best_checkpoint_metric='loss', bf16=False, bpe=None, broadcast_buffers=False, bucket_cap_mb=25, checkpoint_suffix='', clip_norm=0.1, cpu=False, criterion='ngram_language_loss', curriculum=0, data='cnndm/processed', data_buffer_size=10, dataset_impl=None, ddp_backend='no_c10d', decoder_attention_heads=16, decoder_embed_dim=1024, decoder_ffn_embed_dim=4096, decoder_layers=12, device_id=0, disable_ngram_loss=False, disable_validation=False, distributed_backend='nccl', distributed_init_method=None, distributed_no_spawn=False, distributed_port=-1, distributed_rank=0, distributed_world_size=1, distributed_wrapper='DDP', dropout=0.1, empty_cache_freq=0, encoder_attention_heads=16, encoder_embed_dim=1024, encoder_ffn_embed_dim=4096, encoder_layers=12, eval_bleu=False, eval_bleu_args=None, eval_bleu_detok='space', eval_bleu_detok_args=None, eval_bleu_print_samples=False, eval_bleu_remove_bpe=None, eval_tokenized_bleu=False, fast_stat_sync=False, find_unused_parameters=False, fix_batches_to_gpus=False, fixed_validation_seed=None, fp16=True, fp16_init_scale=128, fp16_no_flatten_grads=False, fp16_scale_tolerance=0.0, fp16_scale_window=None, keep_best_checkpoints=-1, keep_interval_updates=-1, keep_last_epochs=10, label_smoothing=0.1, left_pad_source='True', left_pad_target='False', load_alignments=False, load_from_pretrained_model='../prophetnet_large_pretrained_160G_14epoch_model.pt', load_sep=True, localsgd_frequency=3, log_format=None, log_interval=100, lr=[0.0001], lr_scheduler='inverse_sqrt', max_epoch=10, max_sentences=2, max_sentences_valid=2, max_source_positions=512, max_target_positions=512, max_tokens=None, max_tokens_valid=None, max_update=0, maximize_best_checkpoint_metric=False, memory_efficient_bf16=False, memory_efficient_fp16=False, min_loss_scale=0.0001, min_lr=-1, model_parallel_size=1, ngram=2, no_epoch_checkpoints=False, no_last_checkpoints=False, no_progress_bar=False, no_save=False, no_save_optimizer_state=False, nprocs_per_node=1, num_batch_buckets=0, num_buckets=32, num_workers=4, optimizer='adam', optimizer_overrides='{}', patience=-1, profile=False, quantization_config_path=None, relative_max_distance=128, required_batch_size_multiple=8, reset_dataloader=False, reset_lr_scheduler=False, reset_meters=False, reset_optimizer=False, restore_file='checkpoint_last.pt', save_dir='cnndm/finetune_cnndm_checkpoints', save_interval=1, save_interval_updates=0, seed=1, sentence_avg=False, share_all_embeddings=True, share_decoder_input_output_embed=True, skip_invalid_size_inputs_valid_test=True, slowmo_algorithm='LocalSGD', slowmo_momentum=None, source_lang=None, target_lang=None, task='translation_prophetnet', tensorboard_logdir='cnndm/finetune_cnndm_tensorboard', threshold_loss_scale=None, tokenizer=None, tpu=False, train_subset='train', truncate_source=False, update_freq=[32], upsample_primary=1, use_bmuf=False, use_old_adam=False, user_dir='./prophetnet', valid_subset='valid', validate_interval=1, warmup_init_lr=1e-07, warmup_updates=1000, weight_decay=0.01)
Traceback (most recent call last):
File "D:\windows_program\conda\envs\p\Scripts\fairseq-train-script.py", line 33, in <module>
sys.exit(load_entry_point('fairseq', 'console_scripts', 'fairseq-train')())
File "e:\fairseq\fairseq_cli\train.py", line 347, in cli_main
cli_main_helper(args)
File "e:\fairseq\fairseq_cli\train.py", line 385, in cli_main_helper
main(args)
File "e:\fairseq\fairseq_cli\train.py", line 64, in main
task = tasks.setup_task(args)
File "e:\fairseq\fairseq\tasks\__init__.py", line 17, in setup_task
return TASK_REGISTRY[args.task].setup_task(args, **kwargs)
File "e:\fairseq\fairseq\tasks\translation.py", line 226, in setup_task
raise Exception('Could not infer language pair, please provide it explicitly')
Exception: Could not infer language pair, please provide it explicitly
from prophetnet.
Related Issues (20)
- Can use_fp16 be used?
- Why is the GENIE result in AR-diffusion very different from the original paper? Also, you come from a team. HOT 1
- Character level
- Can't Find Pretrained Checkpoint of Prophetnet: HOT 1
- Unable to load the GENIE model HOT 1
- The datasets have no dev set? HOT 1
- Options Employed for Training or Inference on the CNN/DM Dataset HOT 1
- It seems that the core code of CRITIC, particularly the part involving Google API search, is not implemented HOT 4
- Missing key documents for AR-Diffusion HOT 1
- where is "mbr_select.py" in AR-Diffusion HOT 1
- Unable to run Genie_Finetune.py HOT 1
- “load_fairseq” not found in "AR-Diffusion/data_utils" HOT 3
- Question for GENIE Finetuning, how to specify epochs for training/finetuning? HOT 1
- “load_fairseq” not found in "AR-Diffusion/data_utils" HOT 1
- [AR-Diffusion] predict_xstart vs predict_x_start HOT 3
- AR-Diffusion data.name and exp.name HOT 2
- Request the execution code of llama2
- AR-diffusion: where the code for algorithm 1 is located? HOT 4
- (AR-Diffusion) RuntimeError: Error(s) in loading state_dict for CrossAttention_Diffusion_LM HOT 3
- what is the need for `num_samples` parameter in inference? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from prophetnet.