Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fix] fix token healing tests and usage errors #33931

Merged
merged 16 commits into from
Oct 16, 2024

Conversation

alpertunga-bile
Copy link
Contributor

What does this PR do?

Hello! As I described in the #33827; token healing functionality is failing in usage and tests.

This PR is fixing the token healing tests and usage bugs.

Fixes #33827

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline,
    Pull Request section?
  • Was this discussed/approved via a Github issue or the forum? Please add a link
    to it if that's the case.
  • Did you make sure to update the documentation with your changes? Here are the
    documentation guidelines, and
    here are tips on formatting docstrings.
  • Did you write any new necessary tests?

Who can review?

@gante

@ArthurZucker
Copy link
Collaborator

cc @gante! 🤗

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@gante gante left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@alpertunga-bile thank you for pinning the issue, finding the cause and problems with the test setup, and opening a PR with the fix -- contributors like you make transformers shine 💛

LGTM 👍

Note for our future selves, in case we need more context about PR: check this comment

@gante gante requested a review from LysandreJik October 7, 2024 13:03
@gante
Copy link
Member

gante commented Oct 7, 2024

The red CI should get fixed after #33950 is merged. At that point, I'd like to kindly ask you to rebase and force-push, to re-run CI 🤗

@gante
Copy link
Member

gante commented Oct 7, 2024

@alpertunga-bile
Copy link
Contributor Author

The red CI should get fixed after #33950 is merged. At that point, I'd like to kindly ask you to rebase and force-push, to re-run CI 🤗

Uhh, my bad. Sorry, i understand it wrong. I will add comments to my changes and push it to re-run the CI when #33950 is merged.

@LysandreJik
Copy link
Member

cc @ArthurZucker as we just discussed it in person

@alpertunga-bile alpertunga-bile force-pushed the fix_token_healing branch 2 times, most recently from 81ced1c to 641b200 Compare October 9, 2024 17:11
@alpertunga-bile
Copy link
Contributor Author

@gante I ran CI again after the #33950 PR was merged but now a test named pipelines_torch is failing. After which PR do I need to run CI again?

@gante
Copy link
Member

gante commented Oct 10, 2024

@alpertunga-bile sorry about that :( #34067 -- this would be the PR

@gante gante requested a review from ArthurZucker October 10, 2024 17:10
@alpertunga-bile
Copy link
Contributor Author

@alpertunga-bile sorry about that :( #34067 -- this would be the PR

Thanks, it is passed through the tests now.

Copy link
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice, let's merge 🚀

@ArthurZucker ArthurZucker merged commit 98bad9c into huggingface:main Oct 16, 2024
24 checks passed
NielsRogge pushed a commit to NielsRogge/transformers that referenced this pull request Oct 21, 2024
* auto-gptq requirement is removed & model is changed & tokenizer pad token is assigned

* values func is changed with extensions & sequence key value bug is fixed

* map key value check is added in ExtensionsTree

* empty trimmed_ids bug is fixed

* tail_id IndexError is fixed

* empty trimmed_ids bug fix is updated for failed test

* too much specific case for specific tokenizer is removed

* input_ids check is updated

* require auto-gptq import is removed

* key error check is changed with empty list check

* empty input_ids check is added

* empty trimmed_ids fix is checked with numel function

* usage change comments are added

* test changes are commented

* comment style and quality bugs are fixed

* test comment style and quality bug is fixed
BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024
* auto-gptq requirement is removed & model is changed & tokenizer pad token is assigned

* values func is changed with extensions & sequence key value bug is fixed

* map key value check is added in ExtensionsTree

* empty trimmed_ids bug is fixed

* tail_id IndexError is fixed

* empty trimmed_ids bug fix is updated for failed test

* too much specific case for specific tokenizer is removed

* input_ids check is updated

* require auto-gptq import is removed

* key error check is changed with empty list check

* empty input_ids check is added

* empty trimmed_ids fix is checked with numel function

* usage change comments are added

* test changes are commented

* comment style and quality bugs are fixed

* test comment style and quality bug is fixed
BernardZach pushed a commit to innovationcore/transformers that referenced this pull request Dec 6, 2024
* auto-gptq requirement is removed & model is changed & tokenizer pad token is assigned

* values func is changed with extensions & sequence key value bug is fixed

* map key value check is added in ExtensionsTree

* empty trimmed_ids bug is fixed

* tail_id IndexError is fixed

* empty trimmed_ids bug fix is updated for failed test

* too much specific case for specific tokenizer is removed

* input_ids check is updated

* require auto-gptq import is removed

* key error check is changed with empty list check

* empty input_ids check is added

* empty trimmed_ids fix is checked with numel function

* usage change comments are added

* test changes are commented

* comment style and quality bugs are fixed

* test comment style and quality bug is fixed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

bug in the token healing
5 participants
  NODES
Bugs 5
COMMUNITY 2
innovation 1
Note 1
Project 5
USERS 1