Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
clo-github/test_620699551
d3557f75
·
Enable QD8 AMX 16x16c4 GEMM and VNNI 16x16c4 IGEMM microkernel
·
Mar 31, 2024
clo-github/test_621860213
a9074a0d
·
F32 & F16 rsum microkernels return the sum instead of taking it as a parameter.
·
Apr 04, 2024
clo-github/test_606237475
923f4df9
·
Add iterative `vsqrt` microkernels for `x86_64`, which computes `x*rsqrt(x)`, i.e.
·
Apr 05, 2024
clo-github/test_623037691
240bd607
·
Enable AVX512 and AVX2 F32_RADDSTOREEXPMINUSMAX microkernels
·
Apr 09, 2024
clo-github/ds/clone-prs
d6500d26
·
Merge branch 'ds/clone-5912' into ds/clone-prs
·
Apr 11, 2024
clo-github/test_619155668
887c3d8b
·
Exported helper functions for transposition normalization.
·
Apr 11, 2024
clo-github/test_623781016
35c99f21
·
When no weight cache is provided to XNNPack, create one to share packed weights between operations.
·
Apr 12, 2024
clo-github/test_624151682
ea546fa9
·
Add `WAsm SIMD` microkernel for `f32-rsqrt`.
·
Apr 12, 2024
clo-github/test_624574039
8a7896ef
·
Change AMX k-block from 64 to 4 for faster testing.
·
Apr 13, 2024
clo-github/test_624768685
52df8e54
·
AMX QD8_F32_QC8W GEMM generate all tile sizes
·
Apr 14, 2024
clo-github/test_625052790
15c2312b
·
Don't attempt to call `cpuinfo_initialize()` unless `XNN_ENABLE_CPUINFO` is enabled.
·
Apr 15, 2024
clo-github/test_625223421
d427ff73
·
F16 GEMM use F16-FP32ACC for improved performance
·
Apr 16, 2024
clo-github/test_625303192
6ff7b9ec
·
Mean op can handle arbitrary reduction axis
·
Apr 16, 2024
clo-github/test_625303190
c6131ce2
·
Rsum ukernels accumulate into output.
·
Apr 16, 2024
clo-github/test_625284727
3d01cf78
·
Fix missing `#include`s in `XNNPACK/test` subdirectory.
·
Apr 16, 2024
clo-github/test_625063668
5db1d29c
·
Use the `ReplicableRandomDevice` instead of...
·
Apr 16, 2024
clo-github/test_625273447
c1aa9784
·
Member functions in `class` definitions need not be marked as `inline`.
·
Apr 16, 2024
clo-github/test_626845388
2bd7b24c
·
F16-RMAX using sign compliment for scalar microkernel
·
Apr 21, 2024
clo-github/test_626510308
56ec7042
·
Improve GEMM unittest performance
·
Apr 22, 2024
clo-github/test_627355959
a564ce78
·
Accumulating AVX rdsum microkernels
·
Apr 24, 2024
Prev
1
…
3
4
5
6
7
8
9
10
11
…
13
Next