Generate does not take into account config.decoder.eos_token_id #14905

NielsRogge · 2021-12-23T17:18:02Z

As reported by some people (see NielsRogge/Transformers-Tutorials#53 and on the forum), the generate() method currently does not take into account config.decoder.eos_token_id, only config.eos_token_id to properly stop generation.

Hence, models that are made using EncoderDecoderModel/VisionEncoderDecoderModel/SpeechEncoderDecoderModel will not properly stop generation if config.eos_token_id is not set.

cc @patrickvonplaten @patil-suraj

The text was updated successfully, but these errors were encountered:

patrickvonplaten · 2021-12-23T23:50:32Z

Hmm, yeah I think I'm fine with adding some if - statements to the generate() method

patrickvonplaten · 2021-12-23T23:50:42Z

Do you want to open a PR for it? :-)

github-actions · 2022-01-23T15:02:48Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

thinksoso · 2022-01-29T10:56:24Z

i will try to fix it

thinksoso · 2022-01-29T13:40:27Z

I pulled a request #15403, but the ci failed. By analysing the ci failture log, I find that it has a hidden logic, if you don't pass a eos_pos_id, you want the model to generate until max-length. That is what the code do

transformers/tests/test_modeling_encoder_decoder.py

Line 404 in 16d4acb

    
           self.assertEqual(generated_output.shape, (input_ids.shape[0],) + (decoder_config.max_length,))

So, adding self.config.decoder.eos_pos_id simply is not enough.
cc @NielsRogge @patrickvonplaten

Batese2001 · 2022-10-13T05:09:46Z

Hi, the above pull request mentioned offhandedly that this had been fixed, is that the case or is this still open as indicated?

NielsRogge · 2022-10-13T07:14:52Z

Seems like this got fixed, closing the issue.

NielsRogge added the Good First Issue label Jan 23, 2022

thinksoso mentioned this issue Jan 29, 2022

Correct eos_token_id settings in generate #15403

Merged

NielsRogge mentioned this issue Mar 4, 2022

Fine tune TrOCR using bert-base-multilingual-cased #15823

Closed

NielsRogge closed this as completed Oct 13, 2022

Generate does not take into account config.decoder.eos_token_id #14905

Generate does not take into account config.decoder.eos_token_id #14905

NielsRogge commented Dec 23, 2021

patrickvonplaten commented Dec 23, 2021

patrickvonplaten commented Dec 23, 2021

github-actions bot commented Jan 23, 2022

thinksoso commented Jan 29, 2022

thinksoso commented Jan 29, 2022 •

edited

Batese2001 commented Oct 13, 2022

NielsRogge commented Oct 13, 2022

Generate does not take into account config.decoder.eos_token_id #14905

Generate does not take into account config.decoder.eos_token_id #14905

Comments

NielsRogge commented Dec 23, 2021

patrickvonplaten commented Dec 23, 2021

patrickvonplaten commented Dec 23, 2021

github-actions bot commented Jan 23, 2022

thinksoso commented Jan 29, 2022

thinksoso commented Jan 29, 2022 • edited

Batese2001 commented Oct 13, 2022

NielsRogge commented Oct 13, 2022

thinksoso commented Jan 29, 2022 •

edited