Skip to content

Conversation

ColinPeppler
Copy link
Contributor

Summary:

What

  1. restore changes from D79899622
  2. fixes numerical issue with llama

Llama numerical issue fix

In the original diff, I was not checking the value of std::optional<bool> symmetric only that a value existed...

https://www.internalfb.com/code/fbsource/[e97b4a76367b61af327d1aa2683d691730220493]/fbcode/deeplearning/fbgemm/fbgemm_gpu/experimental/gen_ai/src/kv_cache/kv_cache.cu?lines=2903%2C2911%2C2981-2982

This is now fixed with

bool use_symmetric_quantization = symmetric.value_or(false);
if (use_symmetric_quantization) {
  CALL_DEQUANTIZE_FP8_CACHE(false, true);
}

Differential Revision: D82320500

Copy link

netlify bot commented Sep 12, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 323989a
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68cc3aa6db37b700082f67f1
😎 Deploy Preview https://deploy-preview-4868--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@meta-cla meta-cla bot added the cla signed label Sep 12, 2025
@facebook-github-bot
Copy link
Contributor

@ColinPeppler has exported this pull request. If you are a Meta employee, you can view the originating diff in D82320500.

@facebook-github-bot
Copy link
Contributor

@ColinPeppler has exported this pull request. If you are a Meta employee, you can view the originating diff in D82320500.

ColinPeppler added a commit to ColinPeppler/FBGEMM that referenced this pull request Sep 18, 2025
ColinPeppler added a commit to ColinPeppler/FBGEMM that referenced this pull request Sep 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants