0% found this document useful (0 votes)

164 views37 pages

Rodrigo Freire - Rhel 6 Performance & Tuning

This document discusses performance tuning for Red Hat Enterprise Linux 6. It covers various techniques for optimizing CPU, memory, I/O, and network performance. Specific topics include transparent huge pages, NUMA tuning, scheduler policies, network multiqueueing, and tools like tuned and SystemTap for monitoring and improving system performance. The overall message is that testing, measuring, and iterative tuning is needed to optimize a system for specific workloads.

Uploaded by

Filipe Luciano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

164 views37 pages

Rodrigo Freire - Rhel 6 Performance & Tuning

Uploaded by

Filipe Luciano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

RHEL 6 PERFORMANCE & TUNING

Rodrigo Freire
Sr. Technical Account Manager
22/Mai/2014

Agenda

Performance Tuning Theory

RHEL6 Performance Improvements

CPU Performance Tuning/Power Management

Memory/NUMA Performance Tuning

Network Performance Tuning

Jeremy Eder / Rodrigo Freire

Performance Tuning Theory

Jeremy Eder / Rodrigo Freire

Performance Tuning Food Groups

Jeremy Eder / Rodrigo Freire

CPU

Memory

I/O

Network

Performance or Reliability?

Faster transactions?
or

Efficiency?

Tradeoffs

Risks

Cost

Jeremy Eder / Rodrigo Freire

Basic OS Setup

Disable unnecessary services and use runlevel 3

Avoid disk access in the critical path

Consider disabling filesystem journaling, {a,dir}time

Ever consider running swapless ? (vm.swappiness)

CPU Isolation?
Be aware of BIOS making Power Management
decisions

Jeremy Eder / Rodrigo Freire

Efficiency decision

Bandwidth: Maximum
possible throughput

Larger and less packets

Throughput: Current
bandwidth usage

High efficiency approach

Low latency approach:
Send data immediately
(more and smaller pkts)

Jeremy Eder / Rodrigo Freire

More bandwidth?

Cutting edge costs

Non-linear $/performance

Benefits?

Jeremy Eder / Rodrigo Freire

In a nutshell...

TEST, MEASURE, TEST, MEASURE, TEST...

Be patient, accurate and methodical...it's iterative

Know your hardware!!!!!!!!!

Be aware of the latency vs throughput balancing act

This cannot be stressed enough...

The enemy of extreme low-latency:
batching/coalescing

Avoid disk when you can...

Use tools such as systemtap and perf

Jeremy Eder / Rodrigo Freire

RHEL6 Performance
Improvements

Jeremy Eder / Rodrigo Freire

Performance Improvements in RHEL6

Component

Feature

CPU/Kernel

NUMA Ticketed spinlocks; Completely Fair Scheduler;

Extensive use of Read Copy Update (RCU)
Scales up to 64 VCPUs per guest

Memory

Large memory optimizations: Transparent Huge Pages is

ideal for virtualization

Networking

vhost-net a kernel based virtio w/ better throughput and

latency. SRIOV for ~native performance, RFS/XPS

Block

AIO, MSI, scatter gather.

Jeremy Eder / Rodrigo Freire

tuned profile summary...

Tunable

default

enterprise-stor virtual-ho virtual-g

age
st
uest

kernel.sched_min_
granularity_ns

4ms

10ms

kernel.sched_wakeup
_granularity_ns

4ms

15ms

vm.dirty_ratio

20% RAM

40%

10%

40%

vm.dirty_background
_ratio

10% RAM

vm.swappiness

I/O Scheduler
(Elevator)

CFQ

deadline

Filesystem Barriers On

Off

CPU Governor

performance

Disk Read-ahead

ondemand

latency-perfo throughput-pe
rmance
rformance

deadline

performance

Disable THP

Yes

Disable C-States

Yes

Jeremy Eder / Rodrigo Freire

Block Devices

Jeremy Eder / Rodrigo Freire

Available Technologies

Solid-State Device

Spinning HDD

Jeremy Eder / Rodrigo Freire

Virtualization Tuning I/O elevators - OLTP

Performance Impact of I/O Elevators on OLTP Workload
Host running Deadline Scheduler
300K

Transactions per Minute

250K
200K
150K
100K
50K
K

Noop
CFQ
Deadline

1Guest

2 Guests

4 Guests

Jeremy Eder / Rodrigo Freire

But if you use central Storage...

Use deadline elevator

Storage have its own
optimization algorithm
and caches
VM image is on the
storage?

Jeremy Eder / Rodrigo Freire

CPU Performance Tuning

Jeremy Eder / Rodrigo Freire

A word about CPU Power Mgmt...

C&P-states
You probably don't always need what you paid for...
Recent chips from major vendors slow themselves
down
Called P-states
Or lower voltages/disable portions of the core like
timers
Called C-states
And spin them back up on-demand.
Adds latency

Monitoring:
Use powertop, or turbostat from kernel source

Jeremy Eder / Rodrigo Freire

CPU Tuning

Variable frequencies
Multiple cores
Power saving modes (cpuspeed governors)
performance
ondemand
userspace

Examples:
echo "performance" > \
/sys/devices/system/cpu/cpu0/cpufreq/scaling_governor
Best of both worlds cron jobs to configure the governor mode using
tuned-adm
tuned-adm profile {default,latency-performance}

Jeremy Eder / Rodrigo Freire

Scheduler Policies

TS SCHED_NORMAL (aka SCHED_OTHER, the default policy)

FF SCHED_FIFO (realtime policy, first-in-first-out)

Don't set your RTPRIO to 99. This will starve out kernel threads
that need to run sometimes.
There is no way to fully isolate a core for 100% userspace
processing. Recent study in a previous slide...

RR SCHED_RR, same as FIFO but with a defined quantum

SCHED_RR only useful with > 1 tasks of same priority.

SCHED_BATCH, ISO SCHED_ISO, IDL SCHED_IDLE

Change programmatically, or with chrt

# ps -emo pid,pcpu,psr,nice,cmd,rtprio,policy

Jeremy Eder / Rodrigo Freire

Your hardware might be fooling you!

SMIs

BIOS power governors

Broken BIOS!

Jeremy Eder / Rodrigo Freire

Memory & NUMA

Jeremy Eder / Rodrigo Freire

Transparent Huge Pages

Standard pages: 4 kB

Huge page: 2048 kb

[root@rfreire~]#(disableTHP)
[root@rfreire~]#timememhog1g
real
user
sys

0m0.600s
0m0.186s
0m0.412s

512 times larger!

[root@rfreire~]#(enableTHP)
[root@rfreire~]#timememhog1g

Less L1 TLB
consumption
Memory intensive: WIN!

real
user
sys

Jeremy Eder / Rodrigo Freire

0m0.303s
0m0.199s
0m0.100s

Memory Tuning Transparent Hugepages

Introduced in RHEL6.0
Anonymous memory only (swappable, can be disabled)
Can coexist with traditional hugepages
Does not require application support (anon memory).
In RHEL6.2, added counters...explained in transhuge.txt
# egrep 'trans|thp' /proc/vmstat
nr_anon_transparent_hugepages 2018
thp_fault_alloc 7302
thp_fault_fallback 0
thp_collapse_alloc 401
thp_collapse_alloc_failed 0
thp_split 21

Jeremy Eder / Rodrigo Freire

NUMA - What is it?

Non-Uniform Memory Access

CPU-Bound

Central memory controller?

Access Cost

Jeremy Eder / Rodrigo Freire

NUMA

Multi-socket/Multi-core Architecture used for scaling

RHEL5/6 Completely NUMA Aware
Additional, significant performance gains by
enforcing NUMA locality.

How do you enforce NUMA locality ?

numactl -c1 -m1 ./command
Command executes on CPUs in socket 1
And memory allocations are served out of memory
node 1.
NUMA automation is an area of significant
research and investment by both Red Hat and the
community.
AutoNUMA, schedNUMA, numad
26

Jeremy Eder / Rodrigo Freire

NUMA

Jeremy Eder / Rodrigo Freire

NUMA Topology and PCI Bus

Server may have more than 1 PCI bus.

Optimal performance reduces/eliminates inter-node cross-talk. Install
NIC in slot local to node that your application will run on. Use
systemtap numa_faults.stp. irqbalance will learn this soon.
In the below case, the NIC is in PCI bus 0001. CPUs 1,3,5,7 are local
to that PCI slot.

lspci output:
0001:06:00.0 Ethernet controller: Solarflare Communications SFC9020 [Solarstorm]
# cat /sys/devices/pci0000\:00/0000\:00\:00.0/local_cpulist
0,2,4,6
# cat /sys/devices/pci0001\:40/0001\:40\:00.0/local_cpulist
1,3,5,7
# dmesg|grep "NUMA node"
pci_bus 0000:00: on NUMA node 0 (pxm 0)
pci_bus 0001:40: on NUMA node 1 (pxm 1)

Jeremy Eder / Rodrigo Freire

NUMA Topology and PCI Bus

Jeremy Eder / Rodrigo Freire

Network

Jeremy Eder / Rodrigo Freire

Network Determinism

Do you really need to use TCP?

If so, experiment with TCP_NODELAY socket option (Nagle)
From Wikipedia:

if there is new data to send

if the window size >= MSS and available data is >= MSS
send complete MSS segment now
else
if there is unconfirmed data still in the pipe
enqueue data in the buffer until an acknowledge is received
^^^ latency ^^^ more noticeable in high RTT/WAN environments.
else
send data immediately

Jeremy Eder / Rodrigo Freire

Buffer Bloat
Buffers are everywhere... www.bufferbloat.net

What is buffer bloat ?

Latency caused by excessive buffering
Side-effect if ignoring latency in the race for
greater throughput.

http://www.bufferbloat.net/projects/bloat/wiki/Introd
uction
Find out about your buffers.
Use 'ss -e' or 'netstat -nt'
NIC ring buffers and new Byte Queue Limits

http://linuxplumbersconf.org/2011/ocw/sessi
ons/171

https://lwn.net/Articles/454390/

Jeremy Eder / Rodrigo Freire

Multiqueue Networking (aka RSS)

Jeremy Eder / Rodrigo Freire

Invented to allow
Linux networking to
scale along with
hardware
2 socket/8 cores
extremely common,
optimize for this
use-case
Hash of src/dst
IP:PORT determines
receiving CPU

But the OS should handle all of this...

Performance Group couldn't agree more!

Out of the box performance experience highest priority.
Defaults work for the majority of use-cases
Auto-tuning where it doesn't
Hand tuning as a last resort
irqbalance being taught about PCI bus locality
(local_cpulist)
Kernel already knows, RHEL6.3+ will set it for you.
numad can automatically balance NUMA node
utilization to avoid NUMA faults.

Jeremy Eder / Rodrigo Freire

Some useful resources

Perftun guide
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html-single/Performance_Tuning_Guide
/index.html
Systemtap:
https://access.redhat.com/site/solutions/5441
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/SystemTap_Beginners_Guide/inde
x.html
Benchmarking tools:
https://access.redhat.com/site/solutions/173863
Tuned:
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Power_Management_Guide/Tune
d.html
Seekwatcher & blktrace:
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Performance_Tuning_Guide/ch06
s03.html
Process Schedulers:
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_MRG/2/html/Realtime_Reference_Guide/chap-R
ealtime_Reference_Guide-Priorities_and_policies.html
Memory tuning:
https://access.redhat.com/site/solutions/16995
Numad:
https://access.redhat.com/site/articles/223693
RPS and RSS:
35
https://access.redhat.com/site/solutions/62869

Jeremy Eder / Rodrigo Freire

THANK YOU!

Rodrigo Freire
[email protected]
http://people.redhat.com/rfreire/cce-perftun-bsb.pdf

DDR5 Speed Boost
From Everand
DDR5 Speed Boost
Mei Gates
No ratings yet
Valleylab Force FX C Esu Service Manual
No ratings yet
Valleylab Force FX C Esu Service Manual
214 pages
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
From Everand
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
Rodrigo Copetti
No ratings yet
RHCA Syllabus
No ratings yet
RHCA Syllabus
13 pages
Redhat Summit Perf Analysis and Tuning Part 2 2013
100% (1)
Redhat Summit Perf Analysis and Tuning Part 2 2013
57 pages
Redhat Summit Perf Analysis and Tuning Part 1 2013
100% (1)
Redhat Summit Perf Analysis and Tuning Part 1 2013
69 pages
Tburke Rhel6 Summit
No ratings yet
Tburke Rhel6 Summit
46 pages
Rao W 0230 Tuning Rhel For Databases
No ratings yet
Rao W 0230 Tuning Rhel For Databases
69 pages
Book-Sle-Tuning Color en
No ratings yet
Book-Sle-Tuning Color en
218 pages
MHVLUG 2017-04 Network Receive Stack
No ratings yet
MHVLUG 2017-04 Network Receive Stack
43 pages
CPU Bottlenecks in Linux
No ratings yet
CPU Bottlenecks in Linux
4 pages
Improving Performance of 100G Data Transfer Nodes PDF
No ratings yet
Improving Performance of 100G Data Transfer Nodes PDF
48 pages
RHCC Toronto Sept 2016 PDF
No ratings yet
RHCC Toronto Sept 2016 PDF
85 pages
Book-Tuning en
No ratings yet
Book-Tuning en
220 pages
Book-Sle-Tuning Color en PDF
No ratings yet
Book-Sle-Tuning Color en PDF
222 pages
Analize de Sistemas e Modificação Suse
No ratings yet
Analize de Sistemas e Modificação Suse
218 pages
Book Sle Tuning
No ratings yet
Book Sle Tuning
231 pages
Book Sle Tuning
No ratings yet
Book Sle Tuning
231 pages
Red Hat Enterprise Linux Network Performance Tuning Guide - Red Hat Customer Portal
No ratings yet
Red Hat Enterprise Linux Network Performance Tuning Guide - Red Hat Customer Portal
39 pages
Red Hat Enterprise Linux 7: Ondřej Vašík RHEL 5/7 Engineering Lead, Red Hat Czech
No ratings yet
Red Hat Enterprise Linux 7: Ondřej Vašík RHEL 5/7 Engineering Lead, Red Hat Czech
29 pages
10 GB Ethernet Mark Wagner: Senior Software Engineer, Red Hat
No ratings yet
10 GB Ethernet Mark Wagner: Senior Software Engineer, Red Hat
58 pages
Tuning System Performance
No ratings yet
Tuning System Performance
8 pages
Tuning Openbsd
No ratings yet
Tuning Openbsd
17 pages
50+ Linux Commands before joining a Company (1)
No ratings yet
50+ Linux Commands before joining a Company (1)
44 pages
Book Sle Tuning
No ratings yet
Book Sle Tuning
231 pages
RHEL 6 Performance Tuning Guide
No ratings yet
RHEL 6 Performance Tuning Guide
76 pages
Book - Sle.tuning Color en
No ratings yet
Book - Sle.tuning Color en
217 pages
Linux Troubleshooting Tips
No ratings yet
Linux Troubleshooting Tips
12 pages
Vertical Performance Tuning-MOSC2016
100% (2)
Vertical Performance Tuning-MOSC2016
73 pages
CMPE 246 Lecture 4-(Jan.16) (1)
No ratings yet
CMPE 246 Lecture 4-(Jan.16) (1)
82 pages
Tuning Linux OS On Ibm Sg247338
No ratings yet
Tuning Linux OS On Ibm Sg247338
494 pages
SLES12 System Analysis and Tuning
No ratings yet
SLES12 System Analysis and Tuning
212 pages
Performance 1738330914993
No ratings yet
Performance 1738330914993
44 pages
Linux Performance Tools (LinuxCon NA) - Brendan Gregg
No ratings yet
Linux Performance Tools (LinuxCon NA) - Brendan Gregg
90 pages
Linux Performance Tools: Brendan Gregg
No ratings yet
Linux Performance Tools: Brendan Gregg
90 pages
RHEL 6&7-V3 - Latest Manual (1)
No ratings yet
RHEL 6&7-V3 - Latest Manual (1)
345 pages
T5-Linux Performance Tuning
No ratings yet
T5-Linux Performance Tuning
52 pages
201501 Perf Brief Low Latency Tuning Rhel7 v2.1
No ratings yet
201501 Perf Brief Low Latency Tuning Rhel7 v2.1
34 pages
SBP-performance-tuning Color en
No ratings yet
SBP-performance-tuning Color en
29 pages
Enhancing the Monitoring using Linux_101112024111
No ratings yet
Enhancing the Monitoring using Linux_101112024111
74 pages
OpenRHCE Slides
No ratings yet
OpenRHCE Slides
367 pages
Linux Perf Tuning 2010 1up
No ratings yet
Linux Perf Tuning 2010 1up
91 pages
Installation - Guide RAC 12C On REH7 Draft v2
No ratings yet
Installation - Guide RAC 12C On REH7 Draft v2
23 pages
LINUX UNIT IV (1)
No ratings yet
LINUX UNIT IV (1)
148 pages
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
Nginx-HTTPs-with-Crypto-NI-Tuning-Guide-on-3rd-Generation-Intel-Xeon-Scalable-Processors
No ratings yet
Nginx-HTTPs-with-Crypto-NI-Tuning-Guide-on-3rd-Generation-Intel-Xeon-Scalable-Processors
12 pages
Operating System Noise in The Linux Kernel
No ratings yet
Operating System Noise in The Linux Kernel
12 pages
rh442 Notes
No ratings yet
rh442 Notes
26 pages
Install Oracle RAC 10g On Oracle Enterprise Linux Using VMware Server
No ratings yet
Install Oracle RAC 10g On Oracle Enterprise Linux Using VMware Server
52 pages
Red Hat Enterprise Linux Network Performance Tuning Guide: Packet Reception in The Linux Kernel
No ratings yet
Red Hat Enterprise Linux Network Performance Tuning Guide: Packet Reception in The Linux Kernel
29 pages
Linux Network Performance Tuning
No ratings yet
Linux Network Performance Tuning
29 pages
Linux Interview Questions
No ratings yet
Linux Interview Questions
6 pages
Red Hat System Administration I: Document Version
No ratings yet
Red Hat System Administration I: Document Version
8 pages
RT Linux - Aims
No ratings yet
RT Linux - Aims
29 pages
Storage Performance Tuning For FAST Virtual Machines - Fam Zheng
No ratings yet
Storage Performance Tuning For FAST Virtual Machines - Fam Zheng
42 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
From Everand
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
Rodrigo Copetti
No ratings yet
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
From Everand
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
Rodrigo Copetti
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Comptia Server+ Primer
From Everand
Comptia Server+ Primer
John Greene
5/5 (1)
BMH-FED-SYN-00-L0-DS-001 - Datasheet For Manual Valve - Rev A - AN
No ratings yet
BMH-FED-SYN-00-L0-DS-001 - Datasheet For Manual Valve - Rev A - AN
20 pages
UAE OMV Upstream
No ratings yet
UAE OMV Upstream
2 pages
Mindmap Osteoarthritis Pain
No ratings yet
Mindmap Osteoarthritis Pain
1 page
UML to Python
No ratings yet
UML to Python
6 pages
Introduction To Donchian Channels Report
No ratings yet
Introduction To Donchian Channels Report
6 pages
Fenrg 09 723775
No ratings yet
Fenrg 09 723775
6 pages
Homework en Allemand
100% (1)
Homework en Allemand
8 pages
Titrations 2: © WWW - CHEMSHEETS.co - Uk 22-May-2018 Chemsheets GCSE 1106
No ratings yet
Titrations 2: © WWW - CHEMSHEETS.co - Uk 22-May-2018 Chemsheets GCSE 1106
2 pages
Cons DPT N S Nit Final
No ratings yet
Cons DPT N S Nit Final
11 pages
Everything Is Poison Joy Mccullough instant download
No ratings yet
Everything Is Poison Joy Mccullough instant download
35 pages
Chapter 11 - Arithmetic Progression PDF
No ratings yet
Chapter 11 - Arithmetic Progression PDF
81 pages
Acute Transverse Myelitis - Neurologic Disorders - MSD Manual Professional Edition PDF
No ratings yet
Acute Transverse Myelitis - Neurologic Disorders - MSD Manual Professional Edition PDF
3 pages
GULF
No ratings yet
GULF
1 page
Presser Seal
No ratings yet
Presser Seal
2 pages
App Form Questions
No ratings yet
App Form Questions
24 pages
TIB 0145 - Converting PW1.0 + & PW2.0 To PW1.1 & PW1.1 & PW2.1 (REV2)
No ratings yet
TIB 0145 - Converting PW1.0 + & PW2.0 To PW1.1 & PW1.1 & PW2.1 (REV2)
8 pages
A112045 - AR2 Centaur Street Assessment Report Rev A PDF
No ratings yet
A112045 - AR2 Centaur Street Assessment Report Rev A PDF
198 pages
90 110003 601
No ratings yet
90 110003 601
462 pages
Automation And Robotics
No ratings yet
Automation And Robotics
31 pages
Book Review and Analysis of "Harrison Bergeron" by Kurt Vonnegut, Jr.
No ratings yet
Book Review and Analysis of "Harrison Bergeron" by Kurt Vonnegut, Jr.
5 pages
Interview and Its Various Types: Submitted By: Komal Sahi MBA-HR Semester 1
No ratings yet
Interview and Its Various Types: Submitted By: Komal Sahi MBA-HR Semester 1
39 pages
Unit 4 Insurance
No ratings yet
Unit 4 Insurance
4 pages
P6 - 2 Rev
No ratings yet
P6 - 2 Rev
4 pages
Prefab
No ratings yet
Prefab
40 pages
Finance Crypto Assignment
No ratings yet
Finance Crypto Assignment
8 pages
Physics 9th Mcq's
No ratings yet
Physics 9th Mcq's
9 pages
Effect of Lateral Torsional Buckling On Web Tapered I-Beams: CR CR CR D
No ratings yet
Effect of Lateral Torsional Buckling On Web Tapered I-Beams: CR CR CR D
10 pages
ZZZShortMenuBiomeShell-READ ME
No ratings yet
ZZZShortMenuBiomeShell-READ ME
3 pages
Week 3 Housekeeping
No ratings yet
Week 3 Housekeeping
11 pages