-
Notifications
You must be signed in to change notification settings - Fork 555
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AMD] Add CK to dependencies to enable AMD build.
cla signed
#3929
opened Apr 3, 2025 by
jwfromm
Loading…
Pass in sharding position information to TBE to facilitate logging / dump / etc.
cla signed
fb-exported
#3927
opened Apr 3, 2025 by
levythu
Loading…
Support prefetch pipeline in bounds_check_indices
cla signed
fb-exported
#3923
opened Apr 3, 2025 by
sryap
Loading…
Cleanups to
StochasticRoundingRNGState
cla signed
fb-exported
module: rocm
#3922
opened Apr 3, 2025 by
q10
Loading…
Add tests for bounds_check_indices v2
cla signed
fb-exported
#3920
opened Apr 2, 2025 by
sryap
Loading…
[fbgemm_gpu] Enable ROCm builds for GenAI
ciflow/rocm
cla signed
module: rocm
#3910
opened Apr 1, 2025 by
q10
Loading…
Enable slow accumulation in fp8 grouped gemm
cla signed
fb-exported
#3904
opened Mar 31, 2025 by
jwfromm
Loading…
support permute_multi_embedding_function on torch.export
cla signed
fb-exported
#3897
opened Mar 28, 2025 by
zejunh
Loading…
Debug A100 too many resources requested for launch issue
cla signed
fb-exported
#3893
opened Mar 28, 2025 by
jianyuh
Loading…
Add NEON transpose kernel for half-precision
cla signed
#3892
opened Mar 28, 2025 by
skykongkong8
Loading…
Integrate D71065405 and D71079311 into stochastic rounding
cla signed
fb-exported
#3882
opened Mar 26, 2025 by
q10
Loading…
nested dispatching of segment_csr on cpu/gpu
cla signed
fb-exported
#3881
opened Mar 26, 2025 by
jeetkanjani7
Loading…
Improve Fused8BitRowwiseQuantizedSBFloatToFloatOrHalfNeon by 2%-10%
cla signed
fb-exported
#3879
opened Mar 25, 2025 by
Nicoshev
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-04-01.