-
Notifications
You must be signed in to change notification settings - Fork 689
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add separate LSE correctness test
cla signed
fb-exported
meta-exported
#5185
opened Dec 4, 2025 by
Aya-ZIbra
Loading…
fix l2 cache set and get
cla signed
fb-exported
meta-exported
#5184
opened Dec 4, 2025 by
zhaojuanmao
Loading…
[fbgemm_gpu] Remove support for older architectures for PyPI releases
cla signed
#5181
opened Dec 4, 2025 by
q10
Loading…
Add aarch64-specific EmbeddingSpMDM8Bit
cla signed
fb-exported
meta-exported
#5180
opened Dec 2, 2025 by
Nicoshev
Loading…
Add CUDA implementation for fb::masked_select_jagged_1d()
cla signed
fb-exported
meta-exported
#5179
opened Dec 2, 2025 by
mfkaplan
Loading…
enable detailed memory breakdown for fixed number of iterations
cla signed
fb-exported
meta-exported
#5170
opened Nov 24, 2025 by
ashuaibi7
Loading…
Add robust field filtering in TBEDataConfig.from_json
cla signed
fb-exported
meta-exported
#5164
opened Nov 21, 2025 by
gchalump
Loading…
Add type error suppressions for upcoming upgrade
cla signed
fb-exported
meta-exported
#5162
opened Nov 21, 2025 by
maggiemoss
Loading…
Fix TBB cmake
cla signed
fb-exported
meta-exported
module: rocm
#5159
opened Nov 20, 2025 by
gchalump
Loading…
Reduce FP16 kernel size by replacing ldr pairs with ldp
cla signed
fb-exported
meta-exported
#5150
opened Nov 19, 2025 by
mcfi
Loading…
Add support rowwise_adagrad_wtith_counter on CPU
cla signed
fb-exported
meta-exported
#5146
opened Nov 18, 2025 by
gchalump
Loading…
cutlass-fa3 new mask interface
cla signed
fb-exported
meta-exported
#5141
opened Nov 17, 2025 by
arsatis
Loading…
Bump setuptools from 75.1.0 to 78.1.1 in /fbgemm_gpu
cla signed
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#5139
opened Nov 16, 2025 by
dependabot
bot
Loading…
Updated asmjit & adapted to its latest changes
cla signed
#5137
opened Nov 15, 2025 by
kobalicek
Loading…
Slightly improve requantize_ AVX2 code performance
cla signed
fb-exported
meta-exported
#5135
opened Nov 14, 2025 by
mcfi
Loading…
Extend raw_id_tracker to track ShardedManagedCollisionEmbeddingCollection
cla signed
fb-exported
meta-exported
#5128
opened Nov 14, 2025 by
FriedCosey
Loading…
Enable arm64 convolution for fbgemm through the reference convolution APIs
cla signed
fb-exported
meta-exported
#5126
opened Nov 13, 2025 by
mcfi
Loading…
Backward optimization for group_index_select_or_add_2d_kernel
cla signed
#5123
opened Nov 13, 2025 by
shbiswas834
Loading…
embedding forward optimization for rocm
cla signed
module: rocm
#5120
opened Nov 12, 2025 by
JaxChen29
Loading…
add group_index_select_or_add_2d_kernel optimizations
cla signed
module: rocm
#5119
opened Nov 12, 2025 by
shbiswas834
•
Draft
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.