掌握效能 新一代開放式高效伺服器 · 2015-08-17 · •IBM Cloud Provisioning •IBM...
Transcript of 掌握效能 新一代開放式高效伺服器 · 2015-08-17 · •IBM Cloud Provisioning •IBM...
掌握效能 - 新一代開放式高效伺服器
張名安
Linux銷售打敗Windows!奪2014在臺伺服器OS市占龍頭
根據IDC發布最新的臺灣伺服器市場研究結果,Linux作業系統的營收成長幅度高得嚇人,在2014年的市占率首度擊敗Windows,竄升成為銷售第一大的作業系統平臺,營收衝破1億6千萬美元,搶下了超過4成的市占率。
OpenPOWER Foundation 聯盟日益成長
Boards / Systems
I/O / Storage / Acceleration
Chip / SOC
System / Software / Services
Implementation / HPC / Research
Complete member list at www.openpowerfoundation.org
Power平臺的開放優勢
• 開放的管理框架,開放的系統,開放的核心技術
Linux and
POWER 處理器發展藍圖
2004 2007 2010 2014
POWER7/7+45/32 nm
POWER822nm
Eight CoresOn-Chip eDRAMPower-Optimized CoresMemory Subsystem ++SMT++Reliability +VSM & VSXProtection Keys+
POWER6/6+65/65 nm
Dual CoreHigh Frequencies Virtualization +Memory Subsystem +AltivecInstruction RetryDynamic Energy MgmtSMT +Protection Keys
POWER5/5+130/90 nm
Dual CoreEnhanced ScalingSMTDistributed Switch +Core Parallelism +FP Performance +Memory Bandwidth +Virtualization
More CoresSMT+++Reliability ++Huge bandwidthPCIe Gen3 I/OFPGA SupportTransactional MemoryPCIe Acceleration
POWER9Extreme Analytics
OptimizationExtreme Big Data
OptimizationOn-chip accelerators
Power Security優勢
• The PowerVM hypervisor has
never had a reported security
vulnerability and provides the
bullet-proof security that customers
demand for mission-critical
workloads
• Dare to compare – search any
security tracking DB and compare
Power against x86
Search term or hypervisor(unfiltered)
NIST NVDResults 2013
NIST NVDResults 2015
ProcesorArchitecture
VMware 640 823 X86
Xen 153 245 X86
VMware ESX 95 107 X86
KVM 58 91 X86
VMware vSphere 48 54 X86
Windows Server 2012 43 133 X86
Oracle VM 24 58 X86
Hyper-V 3 5 X86
PowerVM 0 0 POWER
Source: National Vulnerability database Mar 2015
https://web.nvd.nist.gov/view/vuln/search
NVD is the U.S. government repository of standards based vulnerability management data. This data enables automation of vulnerability management, security
measurement, and compliance (e.g. FISMA).
Common Vulnerabilities and Exposures (CVE® ) is a dictionary of common names (i.e., CVE Identifiers) for publicly known information security vulnerabilities.
CVE’s common identifiers make it easier to share data across separate network security databases and tools, and provide a baseline for evaluating the coverage
of an organization’s security tools. If a report from one of your security tools incorporates CVE Identifiers, you may then quickly and accurately access fix
information in one or more separate CVE-compatible databases to remediate the problem.
Still 0reported security
breeches on the
PowerVM hypervisor
0
5000
10000
15000
20000
25000
POWER S824
2s/24c/192t
IBM POWER8
Fujitsu RX300 S8
2s/24c/48t
Intel Xeon Ivy Bridge
ERP – SAP 2-Tier (Users)
2.1x Performance
1) Results are based on best published results on Xeon E5-2697 v2 from the top 5 Intel system vendors.
2) SAP results are based on the two-tier SAP SD standard application benchmark running SAP enhancement package 5 for the SAP ERP 6.0 application. Results valid as of June 2, 2014. Source: http://www.sap.com/benchmark
3) SPECjbb2013 results are valid as of 7/7/2014. For more information go to http://www.specbench.org/jbb2013/results
4) SPECcpu2006 results are submitted as of 4/22/2014. For more information go to http://www.specbench.org/cpu2006/results/
0
20000
40000
60000
80000
100000
120000
140000
160000
180000
POWER S824
2s/24c/192t
IBM POWER8
Cisco UCS C240 M3
2s/24c/48t
Intel Xeon Ivy Bridge
Java – SPECjbb2013 (Max-
jOPS) 2.7x Performance
0
200
400
600
800
1000
1200
1400
1600
1800
2000
POWER S824
2s/24c/192t
IBM POWER8
Dell PowerEdge
T620
2s/24c/48t
Intel Xeon Ivy
Bridge
SPECint_rate2006
1.8x Performance
0
200
400
600
800
1000
1200
1400
1600
SPECfp_rate2006
2x Performance
Dell
PowerEdge
T620
2s/24c/48t
Intel Xeon
Ivy Bridge
AIX LoP (RHEL)POWER S824
2s/24c/192t
IBM POWER8
AIX LoP (RHEL)
Nearly Equal
Performance
with both
AIX and LINUX
on POWER8
Nearly Equal
Performance
with both
AIX and LINUX
on POWER8
相對 24 核 Intel Xeon 伺服器,基準測試值性能提升最多 2.7 倍
IBM POWER8 伺服器性能之王者
PowerLinux vs X86Linux: 客戶優勢
PowerLinux vs X86Linux: 客戶優勢
Power大資料,企業級的大資料——讓大數據”快”起來
開源的BigData 企業級的BigData
HDFS
MapReduce
HBase
Hive
Impala
HBase增強
Hive增強
BigSQL
PlatformSymphony 資料處理
計算密集型工作負載( SOA )
資料密集型工作負載( M / R )
分散式檔案系統(GPFS-FPO)
流式數據即時處理InforSphere Streams
Linux on Power
Storm
數據”公務艙”
Model GL4
Analytics and CloudModel GL6
PetaScale Storage
Model GL2
Analytics Focused
Model GS1 Model GS2 Model GS4 Model GS6
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC 5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
FC
5887
Elastic Storage Server Model GS
Elastic Storage Server Model GL
併發查詢數 20 50 100 150 200
E公司平均查詢時間 (s) 1.98 3.47 5.6 7.098 9.59
IBM平均查詢時間(s) 0.85 1.31 1.88 2.47 3.4
IBM / E公司 (per core) 6.96 7.95 8.91 8.64 8.46
More than6.96x faster per core
ESS彈性存儲,拓展了大資料的基礎,具備大容量,低延時,可彈性擴展等特性
基於GPFS增強的文件處理及管理功能,Power發揮了SNC集群的超強處理能力
IBM Power 創新結合 Flash 和 CAPI 來優化 NoSQL 應用Significant Cost Savings for In-Memory NoSQL Data Stores
The Market: Explosive growth of new mobile, social apps requiring lightening fast response at high volume
Enabled by in-memory NoSQL, Key Value Stores like Redis
Ordered (key, value) pairs provide type of in-memory, lightening fast distributed hash table
Plays an important role in many large websites
GitHub, Amazon, Facebook, Twitter & more…
The Issue: x86 memory limited by max RAM
• Scale-out x86 servers limited memory size
• Results in costly, complex infrastructure
Load Balancer
500GB Cache
Node
WWW
500GB Cache
Node500GB Cache
Node500GB Cache
Node1U x86 server (24)
512 GB memory
The POWER8 + CAPI Flash as RAM Advantage:
• New FLASH as RAM for Redis in-memory apps
• Provides means for large FLASH exploitation
• Lower cost memory, greater workload density
• Dramatically reduce costs to deliver services
• Can be offered as a cloud-based service or as
an on-premise solution for enterprises
24:1 server
consolidation3
Up to 3x
lower TCA
24U
Power S822L/S812L
Ubuntu 14.10
FlashSystem 840
2TB to 40 TB Flash
WWW
4U
The Solution: POWER8 + CAPI FLASH as RAM
- Up to 40 TB in 4U
極速三劍客:CAPI+ Flash+ Redis
Traditional
PCIeDrivers
20,000Instructions
POWER8
CAPI
< 500Instructions
TraditionalPCIe
CAPIDevice
NoSQL on Power——讓IO延遲”低”起來指令
1/40
IOPS6x
延遲0.5x
Java on power——讓Java會話”併發”起來
• 靈活的大記憶體頁特性,使得記憶體消耗型Java應用能夠更高效的利用記憶體。
• IBM JVM針對Power平臺的專門優化,使得Java應用在PowerLinux平臺能發揮更好的性能。
• 併發多執行緒技術(SMT4/SMT8)使得多執行緒Java
應用能獲得非常好的併發性能。
• Java指令預取(pre-fetch)技術,大大提高指令執行效率。
• PowerVM在虛擬化性能方面的優勢,保障Java應用在虛擬化環境中能獲得比x86更好的性能。
• IBM專業的技術團隊提供針對應用的性能優化,保障不同應用在PowerLinux平臺能獲得最佳性能。
E5-2600v3 E7-x800 v2 POWER 7+ POWER8
SMT options 1, 2* 1, 2* 1, 2, 4 1, 2, 4, 8
Memory Bandwidth
51.2-68.3 GB/s 68-85GB/s** 100 – 180 GB/sec230 - 410 GB/sec
1 Performance comparisons based on published SPECjbb2013 results as of October 1, 2014http://www.spec.org/jbb2013/results/ . All IBM benchmark results will be submitted to spec.org on October 15, 2014.
• POWER8 demonstrates the fastest
Java code1 performance in the
industry- 2.84xthan E5-2699 v3
- 3.92xthan E7-4890 v2
- 4.53xthan Oracle T5
JPA
AME
Power Java stack•Power處理器本身的性能優勢,使得CPU密集型Java應用獲得更好的性能。
- 8核(Power7/7+)/12核(Power8)封裝- 高主頻(Power7+ 4.2GHz/Power8 4GHz)
- 嵌入式L3 cache( Power7+ 80M/ Power8 96M)- 併發多執行緒技術SMT4(Power7/7+)/SMT8(Power8)- 更高的記憶體通道頻寬( Power7/7+ 100GB/ Power8 230GB)
Virtualization on Power——讓資源”活”起來
• Power Cloud /Resource pool
–基礎虛擬化(資源池)
–Entry Cloud
–Advanced Cloud
• IBM提供更加高效靈活的虛擬化–更高的資源利用率
–動態的資源調配
–開放標準的生態環境Openstack支持
PowerVM,PowerKVM,Docker*支持
Unix/Linux mixed資源池支援
Number of Virtual Servers On the Physical
Server(32cores)
% C
PU利用率
0
5
10
15
20
25
30
35
1 2 3 4
20.04
15.06
9.86
5.225.75
12.806
22.8
31.07
IBM Power 750Competitor
資源浪費
3% CPU overheadwith 2 VMs
11% CPU overheadwith 4 VMs
7% CPU overheadwith 3 VMs
Orchestrate: Advanced Cloud
• IBM Cloud Provisioning
• IBM Cloud Orchestrator
Automate: Entry Cloud
• IBM SmartCloud Entry for Power*
• IBM Cloud Monitoring
• IBM Cloud Cost Management
Integrate: VIRTUALIZED FOUNDATION
• From Entry to Enterprise, pre-configured solutions* built on POWER7+ andLinux w/PowerKVM and POWER8
• PowerVM, PowerVC1*
• Systems Director / VMControl*
• Leverage PowerSC for extra security
• Utilitze PowerHA for increased availability
Ad
va
ncin
g C
lou
d C
ap
ab
ilitie
s
Rapid provisioning, scalability, and advanced image management for virtualized environments. Orchestrate the development, deployment, and management of enterprise clouds
Basic cloud functions including simple self-service interface and infrastructure with automated provisioning
* included in one-button e-config pre-configured Power Cloud Solutions1 All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
Flexibility Factors VMware vSphere 4 & 5 PowerVM
動態CPU資源 Yes Yes
動態記憶體資源 Add(but not Remove) Yes
動態IO資源 Some (network only) Yes
VM直接IO設備訪問 Some (with recent Intel Xeon) Yes
Hypervisor I/O隔離 No Yes
虛擬化實現方式 3rd Party SoftwareBuilt-in
Firmware
高效的資源
動態的資源開放標準的資源
© 2014 International Business Machines Corporation
For ComputingFor resourceFor Java
Linux on Power主要應用場景和方案
• Scenario: Bigdata
• Product/Solution–處理節點:P7+/P8 Powerlinux node
–資料節點:ESS storage server
–Openpower node(TBD)
• Advantage–4+ performance
–enterprise Symphony and GPFS
15
Bigdata HANA NoSQL
HPCLinux resource poolPowrelinux application server
• Scenario: Compete with X86 HANA
• Product/Solution
–Powerlinux for HANA
• Advantage
–Performance
–CAPI accelerate(TBD)
• Scenario: Redis NoSQL server
• Product/Solution
–Power/CAPI/Flash
• Advantage
–12x Flash capacity than memory.
–High IOPS and low latency like
memory.
• Scenario: Java application server
• Product/Solution:
–P7+/P8 Powerlinux node
• Advantage
–2-4x thread
–IBM Native JVM
–Java performance advisor
• Scenario: Linux resource pool
• Product/Solution
–PowerVM/PowerKVM based
dynamice linux resource pool.
• Advantage
–Embedded enterprise VM
–Dynamic resource allocation
across LDP and IFL
• Scenario: HPC
• Product/Solution
–S824L for FNC and Oil HPC
–AIX Power for meteorological HPC
–ESS use for IO node.
• S824L HPC server and ESS.
• Advantage
–Power floating performance.
• Leverage Power performance, Mem and IO bandwidth and CAPI to improve Bigdata/HANA/NoSAL capacity
快 寬 低
並 活 合
For Data
POWER8 Scale-out Servers –
Innovation to Put Data to Work
POWER8 (1 and 2 Socket) systems
Expanded Linux focus: Linux, KVM, and OpenStack
OpenPOWER Innovations
1 & 2 Socket Power Systems
S812L S822L S822 S814 S824L S824
• 1-socket, 2U• Up to 12 cores• 512 GB Memory• 6 PCIe Gen 3• Linux only• PowerVM &
PowerKVM• CAPI support• August 29, 2014
• 2-socket, 2U• Up to 24 cores• 1 TB memory• 9 PCIe Gen3• Linux only• PowerVM &
PowerKVM• CAPI support• June 10, 2014
• 2-socket, 2U• Up to 20 cores• 1 TB memory• 9 PCIe Gen 3• AIX & Linux• PowerVM• CAPI support• June 10, 2014
• 1-socket, 4U• Up to 8 cores• 512 GB memory• 7 PCIe Gen 3• AIX, IBM i, Linux• PowerVM• CAPI support• June 10, 2014
• 2-socket, 4U• Up to 24 cores• Linux only• NVIDIA GPU
accelerator • October 31, 2014
• 2-socket, 4U• Up to 24 cores• Up to 2 TB memory• 11 PCIe Gen 3• AIX, IBM i, Linux• PowerVM• CAPI support• December, 2014 for 2
TB of Memory• June 10, 2014 for
1 TB Memory
New Features!
Scale-out
強 強
POWER8 Processor Is Purpose Built Which Help Deliver Premium Performance
* Intel calls this Hyper-Threading Technology (No HT and with HT)
*32KB running in “Non-RAS mode” 16KB results in better RAS
**85GB running in “Non-RAS mode” and dual-device error NOT supported
1 = The Micro-architecture features above contributed to premium performance for the POWER8 systems compared to Ivy Bridge systems
Sandy Bridge EP
E5-x6xx
Ivy Bridge EP
E5-26xx v2
Ivy Bridge EX
E7-88xx v2
Haswell EP
E5-26xx v3
POWER 7+ Systems
POWER8
Clock rates (GHz)
1.8–3.6 1.7-3.7 1.9-3.4 1.6-3.5 3.1–4.4 GHz 3.0-4.1 GHz
SMT options 1,2* 1, 2* 1, 2* 1, 2* 1, 2, 4 1, 2, 4, 8
Max Threads / sock
16 24 30 36 32 96
Max L1 Cache 32KB 32KB* 32KB* 64 KB 32KB 64KB
Max L2 Cache 256 KB 256 KB 256 KB 256KB 256 KB 512 KB
Max L3 Cache 20 MB 30 MB 37.5 MB 45 MB 80 MB 96 MB
Max L4 Cache 0 0 0 0 0 128 MB
Memory Bandwidth
31.4-51.2 GB/s
42.6-59.7 GB/s
68-85**
GB/s
51-68
GB/s
100 – 180 GB/sec
230 - 410 GB/sec
Power Performance優勢
• X86對市場宣稱的性能往往基於其頂級規格處理器的性能基準測試結果,例如E5-2699v3或E7-4890v2.
• POWER處理器在SPECcpu,SPECjbb,SAP 2-tier等多項測試中基本保持對X86頂級規格處理器2倍以上的領先優勢.
–兩倍以上性能於頂級X86處理器
–四倍以上性能于優秀X86處理器
–六倍以上性能于普通X86處理器
–八倍以上性能於一般X86處理器
X86 E7 Cores GHz X86 E5 Cores GHz
E7-4890 v2 15 2.8 E5-2699 v3 18 2.3
E7-4880 v2 15 2.5 E5-2698 v3 16 2.3
E7-4870 v2 15 2.3 E5-2697 v3 14 2.6
E5-2695 v3 14 2.3
E7-4860 v2 12 2.6 E5-2690 v3 12 2.6
E5-2680 v3 12 2.5
E7-4850 v2 12 2.3 E5-2670 v3 12 2.3
E5-2660 v3 10 2.6
E7-4830 v2 10 2.2 E5-2650 v3 10 2.3
E5-2640 v3 8 2.6
E5-2630 v3 8 2.4
E7-4820 v2 8 2.0 E5-2620 v3 6 2.4
E5-2609 v3 6 1.9
E7-4809 v2 6 1.9 E5-2603 v3 6 1.6
E5-2699v3 E7-4890v2 POWER8/3.5GHz 性能倍數
# Cores 36 60 24
SAP 2-Tier 16500 25000 21212 1.9-2.1
SPECjbb2013 (max-jOPS)
195119 274000 361293 2.7-3.29
SPECint_rate2006 1430 2400 1750 1.8
SPECfp_rate2006 965 1770 1370 1.9-2.1
1) IBM Power System S824 on the two-tier SAP SD standard application benchmark running SAP enhancement package 5 for the SAP ERP 6.0 application;
4 processors / 24 cores / 96 threads, POWER8; 3.52GHz, 512 GB memory, 21,212 SD benchmark users, running AIX® 7.1 and DB2® 10.5, Certification
# 2014016. Source: http://www.sap.com/benchmark All results valid as of October 3, 2014
2) Dell PowerEdge R730, on the two-tier SAP SD standard application benchmark running SAP enhancement package 5 for the SAP ERP 6.0 application; 2
processors/36 cores/72 threads, Intel Xeon Processor 2699v3; 2.30 GHz, 256 GB memory; 16,500 SD benchmark users, running RHEL 7 and SAP ASE
16; Certification # 2014033. Source: http://www.sap.com/benchmark.
3) SPECcpu2006 results are submitted as of 9/8/2014. For more information go tohttp://www.specbench.org/cpu2006/results/
4) SPECjbb2013 results are submitted as of 10/15//2014. For more information go tohttp://www.specbench.org/jbb2013/results
5) SPECjEnterprise2010 results are valid as of 9/8/2014. For more information go tohttp://www.specbench.org/jEnterprise2010/results/
6) Oracle eBS 12.1.3 Payroll Batch Extra Large Kit and are current as of 3/24/2014. For more information go
tohttp://www.oracle.com/us/solutions/benchmark/apps-benchmark/results-166922.html
7) Siebel 8.1.1.4 PSPP Kit and are current as of 3/24/2014. For more information go tohttp://www.oracle.com/us/solutions/benchmark/white-papers/siebel-
167484.html
E5-2600v3 E7-x800v2 POWER 7+ POWER8
Clock rates 1.7-3.7GHz 1.9-3.4 GHz 3.1-4.4 GHz 3.0-4.15 GHz
SMT options 1, 2* 1, 2* 1, 2, 4 1, 2, 4, 8
Cores per socket 18 15 8 12
Max L1 Cache 32KB* 32KB* 32KB 64KB
Max L2 Cache 256 KB 256 KB 256 KB 512 KB
Max L3 Cache 45 MB 37.5 MB 80 MB 96 MB
Max L4 Cache 0 0 0 128 MB
Memory Bandwidth
51.2-68.3 GB/s 68-85** GB/s 100 – 180 GB/sec 230 - 410 GB/sec
• 技術創新驅動的Power領先優勢– 更高的處理器頻率
– 2-4倍的執行緒數以及L1/L2/L3緩存容量
– 獨特的記憶體控制器技術和L4緩存
– 2倍以上的記憶體頻寬
– 事物記憶體
– SMP互聯
– CAPI(一致性加速處理介面)技術
POWER8 系統-為新興工作負載及開放平臺而生
“數”之本• POWER8 創建更為高效和經濟的雲平臺
- 執行緒 : 動態ST & 並行多執行緒SMT2/4/8 提供了更為高效的混合雲計算平臺並極大地減少了軟體許可權上的費用
- PowerLinux KVM Micro-Threading技術使得虛擬機器使用率提升顯著,4倍於VM密度上的性能
- 加強電力管理,助力客戶構建綠色資料中心
- PowerKVM 為客戶在PowerLinux伺服器上提供了更為開放的虛擬化選擇
“雲”之根
• POWER8 為大資料分析應運而生 - 融合動態ST & 並行多執行緒SMT2/4/8, 記憶體壓縮
- Transactional memory 技術極大地降低了大量串列指令的等待時間,提高系統的整體性能
- PCIe-3 顯著增加了頻寬,減少了I/O延時,比Gen2技術提升了2倍的性能
OpenPOWER 基金會• 利用OpenPOWER基金會-唯一開放的伺服器平臺來打破伺服
器研發的壁壘
• CAPI與 FPGA 技術結合,為ISV及合作夥伴基於POWER8的應用開發提供優化環境
Throughput differentiation through increased threading
Single thread differentiation through:
– Large frequency boost with power-gating
– ECO mode for more cache, memory
Transactional Memory in POWER8
• 對Little Endian的支持 – 更容易從x86移植到Power 平臺
#EJ0L
Huge cache
PCIe Gen3 SAS RAID Adapters
- 低延時,連貫地訪問整個系統記憶體
- PCIe 槽位上的加速器實現高頻寬的需求 (x16 PCIe Gen 3)
- 全功能轉換及虛擬化的能力 (虛擬位址能力)
POWER8
CAPP
Coherence Bus
CustomHardware
Application
PSL
FPGA or ASIC