populate quantization_config for kv-cache-scheme only configs #33874

horheynm · 2024-10-01T21:38:59Z

What does this PR do?

Follow up to https://github.com/huggingface/transformers/pull/31704/files.
Previously if quantization was done for the kv-cache pathway only from compressed-tensors, then quantization_config would not populate.
With this pr, it will now populate

Fixes # (issue)
Same as above

Who can review?

@SunMarc @younesbelkada @dsikka @mgoin

kylesayrs

Indeed, compression only occurs when there is a QuantizationScheme in the config groups or there is a kv cache scheme. No compression occurred <=> both of those fields are empty

SunMarc

LGTM !

HuggingFaceDocBuilderDev · 2024-10-02T11:35:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

LysandreJik

Awesome

…gface#33874)

populate quantization_config for kv-cache-scheme only configs

dc19bc1

kylesayrs approved these changes Oct 1, 2024

View reviewed changes

dsikka approved these changes Oct 1, 2024

View reviewed changes

SunMarc approved these changes Oct 2, 2024

View reviewed changes

SunMarc requested a review from LysandreJik October 2, 2024 11:10

LysandreJik approved these changes Oct 2, 2024

View reviewed changes

LysandreJik merged commit 181c962 into huggingface:main Oct 2, 2024
22 checks passed

horheynm deleted the compressed-tensors-quantizer-bug-fix branch October 2, 2024 12:42

NielsRogge pushed a commit to NielsRogge/transformers that referenced this pull request Oct 21, 2024

populate quantization_config for kv-cache-scheme only configs (huggin…

f5655c3

…gface#33874)

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

populate quantization_config for kv-cache-scheme only configs (huggin…

4179a92

…gface#33874)

BernardZach pushed a commit to innovationcore/transformers that referenced this pull request Dec 6, 2024

populate quantization_config for kv-cache-scheme only configs (huggin…

266d5c4

…gface#33874)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

populate quantization_config for kv-cache-scheme only configs #33874

populate quantization_config for kv-cache-scheme only configs #33874

horheynm commented Oct 1, 2024 •

edited

Loading

kylesayrs left a comment

SunMarc left a comment

HuggingFaceDocBuilderDev commented Oct 2, 2024

LysandreJik left a comment

populate quantization_config for kv-cache-scheme only configs #33874

populate quantization_config for kv-cache-scheme only configs #33874

Conversation

horheynm commented Oct 1, 2024 • edited Loading

What does this PR do?

Who can review?

kylesayrs left a comment

Choose a reason for hiding this comment

SunMarc left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 2, 2024

LysandreJik left a comment

Choose a reason for hiding this comment

horheynm commented Oct 1, 2024 •

edited

Loading