vroc-vs-hba-performance-comparison
vroc-vs-hba-performance-comparison
(Intel® VROC)
Detailed Comparison to RAID HBA
Notices and Disclaimers
Notices & Disclaimers
Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.
Performance results are based on testing as of dates shown in configurations and may not reflect all publicly available upda tes. See backup for configuration details. No
product or component can be absolutely secure.
© Intel Corporation. Intel, the Intel logo, and other Intel marks are trademarks of Intel Corporation or its subsidiaries. Other names and brands may be claimed as the
property of others.
Agenda:
1. Architecture and Feature Comparison
2. Key findings
3. Intel® Optane™ SSD Comparisons
4. Test Configuration Details
5. Pass-thru Mode (No RAID) Comparison
6. RAID0/1/5/10 Performance Results
7. Detailed RAID0/5 Review (Latency, CPU%, Power)
Intel® VROC onboards RAID HBA functionality onto Intel® Xeon® CPUs1
1-Intel VROC and Intel VMD are available on all generations (Gen. 1, 2 and 3) and SKUs (Bronze, Silver, Gold, and Platinum) of Intel Xeon Scalable Processor
Intel Optane Group 5
Intel® VROC vs RAID HBA
Major RAID Features HW RAID VROC Intel® VROC Comment
Both architectures isolates SSD error/event handling to reduce OS
Error Handling/Isolation √ √ crash/reboot
Reliable data storage √ √ Enterprise data protection, even when power loss occurs
In-band Management Tools √ √ Various UEFI, GUI, and CLI Utilities for each
Out-of-band RAID Config. √ X Intel VROC has OOB on roadmap for upcoming releases
Full NVMe SSD x4 Bandwidth X √ Intel VROC + Intel VMD allows full x4 access to SSDs, no HW Uplink
Uses powerful Intel® Xeon® CPU to RAID the fast NVMe* SSDs. Better
RAID Processing Location On HBA On Intel® Xeon scaling for heavy workloads (see Detailed CPU Review)
Supported RAID Levels 0/1/5/6/10/50/60 0/1/5/10 RAID6/50/60 not needed for perf./AFR of NVMe SSDs
Integrated Caching +
Write back cache DRAM + BBU Replace DRAM WB Cache + BBU with persistent Intel® Optane™ media
Intel® Optane™ SSD
SED Key Management On HBA Platform Integrated Intel VROC uses platform protocols and remote KMS to manage keys
Idle Power1 577W 562W Tested 15W reduction in Idle Power Usage with Intel VROC
See backup for configuration details. For more complete information about performance and benchmark results, visit www.intel.com/benchmarks..
Intel Optane Group 6
Key Findings
*CPU Usage measured as total platform CPU % consumption, includes workload generation, storage stack (RAID) usage, and background activity
Measured as “Cores Used” = CPU% report out * # cores on system (64 cores)
4,000,000
IOPS
3,000,000
161% more IOPS on RAID0
50% more IOPS on RAID5 2,000,000
Intel VROC RAID5 > HBA RAID10 performance VROC Reads/Writes HBA Reads/Writes
• ↑ IOPS IOPS
Hi gher is better 111
Latency (µsec)
Lower i s better
2,000,000 50
• ↓ Latency 2,177,404
VROC 43
0 HBA 0
• ↓ CPU Usage
12 24,000
• ↑ Bandwidth 22,760
8 16,000
RAID0 provides higher performance metrics 9 CPU Cores Used Bandwidth
Lower i s better (MB/sec)
but with lower resource usage (CPU) 4
8
8,000
Hi gher is better
• ↑ IOPS
IOPS Latency (µsec)
461
Hi gher is better Lower i s better
362
400,000 743,373
200
• ↓ Latency VROC
0 HBA 0
• ↑ CPU Usage*
9 4,800
• ↓ Bandwidth 4,488
6 3,200
*RAID5 uses 4 more cores but delivers up to 7
CPU Cores Used Bandwidth
Lower i s better (MB/sec)
380K additional IOPS 3 1,600
2,973 Hi gher is better
3
Up to 50% more Read/Write IOPS
0 0
Up to 50% more Bandwidth
(higher is better)
1x Pass-thru Comparisons 4x Pass-thru Comparisons
(lower is better) (higher is better) (lower is better)
80,000 100 300,000 100
60,000 75 225,000 75
Latency (usec)
Latency (usec)
IOPS
IOPS
40,000 50 150,000 50
20,000 25 75,000 25
0 0 0 0
100% Writes 70/30 R/W 100% Read 100% Writes 70/30 R/W 100% Read
VROC 1x Pt IOPS HBA 1x Pt IOPS VROC 4x Pt's IOPS HBA 4x Pt's IOPS
VROC 1x Pt Ave. Latency HBA 1x Pt Ave. Latency VROC 4x Pt's Ave. Latency HBA 4x Pt's Ave. Latency
Intel VROC provides unimpeded access to storage for lower latency I/0
▪ Single Drive, 100% Write: {40% IOPS ↑, 32% Latency ↓}
▪ Single Drive, 100% Read: {29% IOPS ↑, 23% Latency ↓}
(higher is better)
1x Pass-thru Comparisons 4x Pass-thru Comparisons
(lower is better) (higher is better) (lower is better)
800,000 2.0 3,000,000 15
1x 4x
IO
WΔ WΔ
600,000 1.5
IOPS
IOPS
400,000 1.0
17W 70/30 30W
1,000,000 5
200,000 0.5
22W Read 46W
VROC 1x Pt IOPS HBA 1x Pt IOPS VROC 4x Pt's IOPS HBA 4x Pt's IOPS
VROC 1x Pt CPU Cores Used HBA 1x Pt CPU Cores Used VROC 4x Pt's CPU Cores Used HBA 4x Pt's CPU Cores Used
600,000 2,500,000
500,000
2,000,000
400,000
IOPS
IOPS
1,500,000
300,000
1,000,000
200,000
100,000 500,000
- -
RAID0 RAID5 RAID10 RAID1 RAID0 RAID5 RAID10 RAID1
Intel VROC Read Performance scales to maximum 4x SSD Spec (~2.8M IOPS RAID0/5/10)
▪ HBA hits 2.2M IOPS Bottleneck; Intel VROC delivers up to 27% more IOPS on RAID0/5/10 reads
RAID0 RAID5
Intel VROC RAID0/5 reads provides: 3,000,000
• ↑ IOPS
2,000,000 2,808,785
2,811,404 IOPS
2,255,698 2,196,795 Hi gher is better
1,000,000
• ↓ Latency
0
600
• ↓ Power Consumption 0
10
RAID0 RAID5
Intel VROC RAID0/5 reads provides: 700,000
• ↓ Latency
0
3,200
IOPS
500,000 4 2,000,000 10
IOPS
250,000 2 1,000,000 5
0 0
0 0
1x Pt 4x Pt RAID0 RAID5
1x Pt 4x Pt RAID0 RAID5
VROC IOPS HBA IOPS VROC CPU Cores Used HBA CPU Cores Used
HOW?
See backup for configuration details. Results may vary
VROC HBA
RAID5 writes require high CPU%
RAID5 64-Thread RAID5 256-Thread • Highest of any Intel VROC supported RAID level per IOP
300,000
RAID HBA offload generates iowait at higher
150,000 219,960
workloads:
214,286
161,031 IOPS
161,029
Hi gher is better • If limits of HBA architecture are reached (more IO), host CPU
0
usage ramps up in iowait%
4
• Iowait could be wasted cycles depending on application