-
Notifications
You must be signed in to change notification settings - Fork 573
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix cpuinfo not being initialized before checking for ARM SVE2
cla signed
fb-exported
#4121
opened May 13, 2025 by
MatzeB
Loading…
Trim constexpr from isA to improve Windows clang-cl support.
cla signed
#4119
opened May 13, 2025 by
ScottTodd
Loading…
Add MXFP4 PT reference quantization kernel and refactor CUTLASS FP4 GEMM
cla signed
fb-exported
#4117
opened May 13, 2025 by
jiawenliu64
Loading…
Support Triton unpacked MXFP4 quantization kernel
cla signed
fb-exported
#4116
opened May 13, 2025 by
jiawenliu64
Loading…
[fbgemm_gpu] Support ROCm 6.4 builds
ciflow/rocm
cla signed
module: rocm
#4114
opened May 12, 2025 by
q10
Loading…
introduce kernel for converting e4m3fn kv_cache to e4m3fnuz
cla signed
fb-exported
#4113
opened May 12, 2025 by
bradleyhd
Loading…
Support ordered read based on weight id in KVT
cla signed
fb-exported
#4108
opened May 10, 2025 by
emlin
Loading…
Update cmake requirement version
cla signed
fb-exported
#4100
opened May 9, 2025 by
gchalump
Loading…
Replace
C10_CUDA_KERNEL_LAUNCH_CHECK()
in the KernelLauncher
cla signed
fb-exported
#4097
opened May 8, 2025 by
q10
Loading…
Do FP8 rowwise bias addition in higher precision
cla signed
fb-exported
#4095
opened May 8, 2025 by
jwfromm
Loading…
Use bounds_check_indices v2 on ROCm
ciflow/rocm
cla signed
fb-exported
module: rocm
#4085
opened May 6, 2025 by
sryap
Loading…
Change GenAI OSS runner to fix OOM
cla signed
fb-exported
#4082
opened May 6, 2025 by
spcyppt
Loading…
Migrate TBE backward kernels to
FBGEMM_LAUNCH_KERNEL
cla signed
fb-exported
#4076
opened May 5, 2025 by
q10
Loading…
Back out "Simplify weight row cache load and evict routines"
cla signed
fb-exported
#4064
opened May 1, 2025 by
q10
Loading…
Integrate compat with Trunk and enable disabling BufferOps
cla signed
fb-exported
#4060
opened May 1, 2025 by
njriasan
Loading…
Simplify weight row cache load and evict routines
cla signed
fb-exported
#4050
opened Apr 30, 2025 by
q10
Loading…
Fix
int32_t
to auto
for code around WeightRow
cla signed
fb-exported
#4045
opened Apr 30, 2025 by
q10
Loading…
integrate dramKV with kvtensorwrapper
cla signed
fb-exported
#4043
opened Apr 29, 2025 by
steven1327
Loading…
torchrec support on kvzch emb lookup module
cla signed
fb-exported
#4035
opened Apr 28, 2025 by
duduyi2013
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.