huggingface / transformers Public

Notifications
Fork 13.7k
Star 58.2k

Code
Issues 342
Pull requests 95
Actions
Projects 24
Wiki
Security
Insights

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

342 Open 8,346 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇄1�7 + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated

Most reactions

Tokenizer prepare_for_model Error inconsistency

#15679 opened Feb 16, 2022 by r-stiller

one of the variables needed for gradient computation has been modified by an inplace operation

#15677 opened Feb 16, 2022 by bjmajic

How can I use "accelerate launch" command to run training job on Multi-GPU?

#15675 opened Feb 16, 2022 by JucyCherry

model.generate() using a user specified keyword argument

#15674 opened Feb 16, 2022 by JH-lee95

DebertaForMaskedLM cannot load the parameters in the MLM head

#15673 opened Feb 16, 2022 by yardenTal1

Unable to generate chunks (If length is greater than 512 in bert), we can use to split into chunks

#15672 opened Feb 16, 2022 by nithinreddyy

Add Video Vision Transformer New model

#15666 opened Feb 15, 2022 by jegork

3 tasks

Why are certain models with a higher WER (on the eval set) performing better or as good as models with a lower WER 1�7 when tested on the test set?

#15664 opened Feb 15, 2022 by drishtishrrma

🤗 Transformers **Trainer** API raises exception on train if triggered from an already started ML Flow run.

#15663 opened Feb 15, 2022 by Ataago

2 of 4 tasks

cannot import name 'CONFIG_MAPPING' from 'transformers' (unknown location)

#15662 opened Feb 15, 2022 by OmarMohammed88

GPT-2 pretrained model fails to load when TF v2 behaviour is disabled

#15661 opened Feb 15, 2022 by rifatarefin

Is it fine if we do not pass the optimizer through accelerator.prepare() in DDP?

#15656 opened Feb 15, 2022 by tuvuumass

Inference API with GPT2 {"error": "Unknown error"}

#15650 opened Feb 14, 2022 by nbravulapalli

TrOCR not working anymore after 4.16.2 update

#15648 opened Feb 14, 2022 by thetruejacob

GPT-NeoX-20B Integration

#15642 opened Feb 13, 2022 by sdtblck

Add support for ONNX-TensorRT conversion for GPT-J6B (and possible bug in rotary embedding)

#15640 opened Feb 13, 2022 by tomerip

1 of 2 tasks

Loading a fairseq trained wav2vec2 model with transformers

#15635 opened Feb 12, 2022 by tensorfoo

TPU slow finetuning T5-base

#15621 opened Feb 11, 2022 by gennarovaccaro

Why is my train_samples_per_second graph growing?

#15619 opened Feb 11, 2022 by drunkinlove

DDP training hangs with run_glue.py and run_seq2seq.py

#15618 opened Feb 11, 2022 by Shivanshu-Gupta

2 of 4 tasks

T5 AttributeError: 'T5Encoder' object has no attribute 'main_input_name'

#15610 opened Feb 10, 2022 by weilixiang

Documentation of DataCollatorForLanguageModeling

#15609 opened Feb 10, 2022 by yarongef

❄1�7 T5 pre-training dataset

#15608 opened Feb 10, 2022 by pietrolesci

Auto tokenizer for question-context tokenization

#15605 opened Feb 10, 2022 by Arij-Aladel

gpt2 error using torch.jit.trace

#15598 opened Feb 10, 2022 by Biaocsu

Previous 1 2 3 4 5 … 13 14 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.