New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
MultiGPU Trainer: each processes uses more memory than 1 GPU job
Distributed Training / Models
#7520
opened Oct 1, 2020 by
sshleifer
Overflow error: Can't convert negative value to unsigned it [RAG Model]
#7517
opened Oct 1, 2020 by
sashank06
2 of 2
[XLNet] attention_mask / input_mask - Why two `attention_mask` inputs?
#7512
opened Oct 1, 2020 by
patrickvonplaten
[Transfo-XL] Impossible to pass `attention_mask` to model
#7511
opened Oct 1, 2020 by
patrickvonplaten
[Reformer, Longformer, Roberta, GPT2, CTRL] attention_mask should be at second argument
#7510
opened Oct 1, 2020 by
patrickvonplaten
Turning the SQuAD dataset class into an iterator to save ram and redistribute time
#7503
opened Oct 1, 2020 by
mariusjohan
Functionality to pass first few tokens as input to the decoder in T5 model
#7502
opened Oct 1, 2020 by
ayushtiku5
Trucated Outputs while finetuning 'bart-base' on XSUM [Summarization Task]
#7500
opened Oct 1, 2020 by
yashgupta-7
How to generate data using beam search from a custom gpt2 model?
#7497
opened Oct 1, 2020 by
nrjvarshney
BertforSequenceClassification MSELoss() without normalizing using sigmoid/softmax
#7496
opened Oct 1, 2020 by
liusiyi641
Is the multiple-choice head for the pre-trained `LongformerForMultipleChoice` model pre-trained?
#7494
opened Oct 1, 2020 by
h56cho
`run_squad_trainer` doesn't actually use a Rust tokenizer + errors in `squad_convert_example_to_features` when using a Rust tokenizer
#7492
opened Sep 30, 2020 by
k8si
3 of 4
RAG: Can we have a document that explains the fine-tuning mechanism?
#7476
opened Sep 30, 2020 by
shamanez
Seq2SeqTrainer: add a fast test that doesn't learn anything but can run on CPU
#7466
opened Sep 30, 2020 by
sshleifer
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.

