Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

populate quantization_config for kv-cache-scheme only configs #33874

Conversation

horheynm
Copy link
Contributor

@horheynm horheynm commented Oct 1, 2024

What does this PR do?

Follow up to https://github.com/huggingface/transformers/pull/31704/files.
Previously if quantization was done for the kv-cache pathway only from compressed-tensors, then quantization_config would not populate.
With this pr, it will now populate

Fixes # (issue)
Same as above

Who can review?

@SunMarc @younesbelkada @dsikka @mgoin

Copy link
Contributor

@kylesayrs kylesayrs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Indeed, compression only occurs when there is a QuantizationScheme in the config groups or there is a kv cache scheme. No compression occurred <=> both of those fields are empty

Copy link
Member

@SunMarc SunMarc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM !

@SunMarc SunMarc requested a review from LysandreJik October 2, 2024 11:10
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@LysandreJik LysandreJik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome

@LysandreJik LysandreJik merged commit 181c962 into huggingface:main Oct 2, 2024
22 checks passed
@horheynm horheynm deleted the compressed-tensors-quantizer-bug-fix branch October 2, 2024 12:42
NielsRogge pushed a commit to NielsRogge/transformers that referenced this pull request Oct 21, 2024
BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024
BernardZach pushed a commit to innovationcore/transformers that referenced this pull request Dec 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants
  NODES
COMMUNITY 2
innovation 1
Project 5
USERS 1