File tree Expand file tree Collapse file tree 1 file changed +11
-6
lines changed Expand file tree Collapse file tree 1 file changed +11
-6
lines changed Original file line number Diff line number Diff line change 18
18
# can be repeated indefinitely, as any build with debug symbols can be sampled
19
19
# adds about 20% runtime sample conversion overhead (todo: reduce)
20
20
# no noticeable overhead unless perfdata is actually sampling
21
- # todo: might get better results when gathering and applying samples on same CPU
22
-
23
- # top performers from tests on bdver2 - rough comparison with official project binaries
24
- # 1. 11% faster - gcc-13.3.1 USE="-clang pgo" CXXFLAGS="-march=native -O3 -flto -fno-profile-partial-training"
25
- # 2. 06% faster - gcc-13.3.1 USE="-clang -pgo" CXXFLAGS="-march=native -O3"
26
- # 3. 02% faster - clang-18.1.8 USE="clang perfdata-sample-use" CXXFLAGS="-march=native -O3 -flto -fno-profile-sample-accurate -fno-sample-profile-use-profi" with samples from skylake
21
+ # better results when gen and use on same CPU arch
22
+
23
+ # rough comparisons with official project binaries
24
+ # top performers on bdver2
25
+ # 1. 11% faster - gcc-13.3.1 USE="pgo" CXXFLAGS="-march=native -O3 -flto -fno-profile-partial-training"
26
+ # 2. 06% faster - gcc-13.3.1 USE="-pgo" CXXFLAGS="-march=native -O3"
27
+ # 3. 02% faster - clang-18.1.8 USE="perfdata-sample-use" CXXFLAGS="-march=native -O3 -flto -fno-profile-sample-accurate -fno-sample-profile-use-profi" with samples from skylake
28
+ # top performers on skylake
29
+ # 1. 16% faster - gcc-13.3.1 USE="pgo" CXXFLAGS="-march=native -O3 -flto"
30
+ # 2. 14% faster - clang-18.8.1 USE="-pgo perfdata-sample-use" CXXFLAGS="-march=native -O3 -flto -fno-profile-sample-accurate" with samples from skylake
31
+ # 3. 12% faster - clang-18.8.1 USE="pgo" CXXFLAGS="-march=native -O3 -flto"
27
32
#
28
33
29
34
EAPI=8
You can’t perform that action at this time.
0 commit comments