New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
one of the variables needed for gradient computation has been modified by an inplace operation
#15677
opened Feb 16, 2022 by
bjmajic
How can I use "accelerate launch" command to run training job on Multi-GPU?
#15675
opened Feb 16, 2022 by
JucyCherry
DebertaForMaskedLM cannot load the parameters in the MLM head
#15673
opened Feb 16, 2022 by
yardenTal1
Unable to generate chunks (If length is greater than 512 in bert), we can use to split into chunks
#15672
opened Feb 16, 2022 by
nithinreddyy
#15663
opened Feb 15, 2022 by
Ataago
2 of 4 tasks
cannot import name 'CONFIG_MAPPING' from 'transformers' (unknown location)
#15662
opened Feb 15, 2022 by
OmarMohammed88
GPT-2 pretrained model fails to load when TF v2 behaviour is disabled
#15661
opened Feb 15, 2022 by
rifatarefin
Is it fine if we do not pass the optimizer through accelerator.prepare() in DDP?
#15656
opened Feb 15, 2022 by
tuvuumass
Add support for ONNX-TensorRT conversion for GPT-J6B (and possible bug in rotary embedding)
#15640
opened Feb 13, 2022 by
tomerip
1 of 2 tasks
DDP training hangs with
run_glue.py
and run_seq2seq.py
#15618
opened Feb 11, 2022 by
Shivanshu-Gupta
2 of 4 tasks
T5 AttributeError: 'T5Encoder' object has no attribute 'main_input_name'
#15610
opened Feb 10, 2022 by
weilixiang
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.