IDF Session Presentation Template

32
芯动,行动,共创未来! 基于英特尔® 图形虚拟化技术(英特尔® GVT-gOpenStack* 的媒体云 Xiao Zheng – 英特尔公司软件工程师 SFTS002

Transcript of IDF Session Presentation Template

Page 1: IDF Session Presentation Template

芯动,行动,共创未来!

基于英特尔® 图形虚拟化技术(英特尔® GVT-g)和 OpenStack* 的媒体云

Xiao Zheng – 英特尔公司软件工程师

SFTS002

Page 2: IDF Session Presentation Template

2

议程

• 媒体云

• 媒体云基础架构

• 案例研究:虚拟化媒体服务器

• 针对虚拟化媒体服务器而优化

Page 3: IDF Session Presentation Template

3

媒体云

Page 4: IDF Session Presentation Template

4

媒体处理机遇

互联网视频流量的年复合增长率预计为 29%,到 2017 年将占据消费者流量的 69%

资料来源:Cisco* Systems Inc,视觉网络指数 (VNI),2013,2015

消费者互联网流量

互联网视频

文件共享

Web、电子邮件和数据

在线游戏

每月P

eta

By

tes

以千为单位 2011-2018 中国在线视频行业市场规模

在线视频市场规模(亿) 年同比增长 (%)

资料来源:2015.1 iResearch Inc., www.iresearch.com.cn

Page 5: IDF Session Presentation Template

5

视频分析

行业前景

视频投递存储/视频流,转码离线/实时处理、

OTT、VOD、IPTV

搜索、监控

Page 6: IDF Session Presentation Template

6

媒体云

灵活性

可管理性

供应

高可用性

扩展标准硬件

视频分析

视频交付存储/流,转码离线、实时

搜索、监控

Page 7: IDF Session Presentation Template

7

媒体云

OTT、VOD、 离线视频 …

传输边缘(RAN、IP、Cable)

服务提供商

媒体交付示例

内容提供商 内容分发网络

1视频文件存放在云存储中 2

交付至 CDN 之前进行媒体转码

Servers

3针对边缘设备中的客户端进行媒体转码

Laptop

VMVMVM

VMVMVM

VMVMVM

SDNNFV

数据中心

边缘云

实时、视频会议、视频感知

Servers

Page 8: IDF Session Presentation Template

8

技术障碍

在没有 GPU 加速功能的情况下,低成本 CPU 转码吞吐量非常低

DSP 解决方案难以在云环境中集成

无 GPU 实例感知能力无 vGPU 调度无 vGPU 资源监控功能

基于英特尔® 图形虚拟化解决方案的媒体云

无云协调无 GPU 虚拟化??

Page 9: IDF Session Presentation Template

9

媒体云基础架构

Page 10: IDF Session Presentation Template

10

基本架构图

主机操作系统或虚拟机

客户虚拟机Ubuntu*14.04 LTS

OpenStack*

Nova控制点

Xen* Hypervisor

英特尔® 至强™ E3 处理器

客户虚拟机Windows* 7 64 位

客户虚拟机Windows 8 32 位

多媒体服务器视频会议

KVM Hypervisor

英特尔高清显卡驱动程序

第四代英特尔® 酷睿™ i7 处理器云基础架构

多媒体服务器实时转码

英特尔高清显卡驱动程序

多媒体服务器VoD 流

英特尔® 高清显卡驱动程序

英特尔® 多媒体软件开发套件

OpenStack网络节点

OpenStack存储/映像

XenGT KVMGT

ISV/非英特尔

英特尔硬件

英特尔组件

包含英特尔贡献

英特尔多媒体软件开发套件

英特尔多媒体软件开发套件

GPU 实例

Page 11: IDF Session Presentation Template

11

GPU 虚拟化要求

性能

功能

共享

直接 GPU 加速。

多标准的兼容性

多虚拟机共享

Page 12: IDF Session Presentation Template

12

GPU 虚拟化方法API forwarding

(英特尔® GVT-s)Direct Pass-through(英特尔 GVT-d)

完整 GPU 虚拟化(英特尔 GVT-g)

优点:• 高性能• 共享缺点:• 无多媒体处理/GPGPU• 兼容性

优点:• 性能• 功能缺点:• 无共享

优点:• 性能• 功能• 共享

英特尔® 图形虚拟化技术(英特尔® GVT)

Page 13: IDF Session Presentation Template

13

英特尔® 图形虚拟化技术(英特尔® GVT-g)

英特尔® 图形虚拟化技术(英特尔® GVT-g),支持基于 vGPU 的共享功能

• 面向 Xen* 的英特尔® GVT-g (XenGT)

• 面向 KVM 的英特尔® GVT-g (KVMGT)

性能

3DMark:80%H.264 转码:90% (本机性能)

特性

运行本机驱动程序DirectX* 11.1OpenGL* 4.2

OpenCL* 1.2

MediaSDK 16.2

共享多个虚拟机支持 Ubuntu* 客机支持 Windows* 7 x32/x64支持 Windows 8 x32/x64

Page 14: IDF Session Presentation Template

14

面向媒体云的协调

• 中间件扩展- 支持英特尔® 图形虚拟化技术 API(libvirt)

• OpenStack* 扩展- GPU 实例 (Flavor)

- GPU 感知调度

查找匹配的 vGPU 功能

QoS

- GPU 资源监控和分配

Page 15: IDF Session Presentation Template

15

案例研究:虚拟化的媒体服务器

Page 16: IDF Session Presentation Template

16

视频会议使用案例

媒体服务器池(基于英特尔® 处理器显卡的

x86 平台)

4K 显示屏

基于英特尔显卡的编码、解码和构建OpenStack* 媒体云

1080P 摄像头

4K 相机

媒体平台 NFV 演示上的 4K 视频会议(2015 年世界移动通信大会)

Page 17: IDF Session Presentation Template

17

媒体面应用转至英特尔® 架构

英特尔与华为* 在MWC15 上联合演示

• H264/MPEG2/VC1/JPEG/MJEG 解码

• H264/MPEG2 编码/转码

• 后期视频处理

Page 18: IDF Session Presentation Template

18

资源指示板监控

GPU 资源监控

Page 19: IDF Session Presentation Template

19

虚拟化多媒体服务器的优化

Page 20: IDF Session Presentation Template

20

新的挑战

内存密集型

实时处理

混合Media/3D

增加 GPU 虚拟化负担!

Page 21: IDF Session Presentation Template

21

37.1

716.85

0.0

100.0

200.0

300.0

400.0

500.0

600.0

700.0

800.0

previously optimized

15ch-720p_2Mbps_transcoding

优化

本机性能:757 fps

70%

20%

10%

提高系统内存

支持跨引擎同步

智能影子(Smart shadow)GPU 页表

5%

95%

配置:I7 4770,客机 Ubuntu* 14.04LTS、4GB 内存、1.5G GraphicMem、MediaSDK

Page 22: IDF Session Presentation Template

22

性能摘要

接近本机性能!

0.050.0

100.0150.0200.0250.0300.0350.0400.0450.0

高密度工作负载:1080p 转码

native vMediaServer

0.0

200.0

400.0

600.0

800.0

1000.0

1200.0

1400.0

1ch-D1 5ch-D1 10ch-D1 20ch-D1 25ch-D1 30ch-D1

高密度工作负载:480p 转码

Native vMediaServer

配置:I7 4770,客机 Ubuntu* 14.04LTS、4GB 内存、1.5G GraphicMem、MediaSDK

Page 23: IDF Session Presentation Template

23

CPU 转码与 GPU 转码

实验室数据。配置:I7 4770、客机 Windows* 7_x64、4GB 内存、1.5G GraphicMem、MediaSDK

22.579

37.426

66.1160.925

159.448

131.142137.496

142.4

0.00%

20.00%

40.00%

60.00%

80.00%

100.00%

120.00%

0

20

40

60

80

100

120

140

160

180

1 vCPU 2 vCPU 4 vCPU 6 vCPU

CP

U 利用率

FP

S 吞吐率

vCPU 与英特尔® 图形虚拟化技术(英特尔® GVT-g)总 fps

(1ch x 1080p MPEG2->720p H264 转码)

IA Software Solution Intel GVT-g basedSW Solution vCPU % Intel GVT vCPU %

英特尔 GVT-g 助力性能显著提升!

英特尔® 架构 (IA)

Page 24: IDF Session Presentation Template

24

总结

• 行业媒体处理领域每年以较快的速度增长,市场机遇巨大

• 采用英特尔® 图形虚拟化技术(英特尔® GVT-g)的媒体云提供了接近本机的性能、灵活性、可扩展性以及相对更低的成本。

Page 25: IDF Session Presentation Template

25

查看更多相关信息

• 您可通过IDF网站的技术课程目录下载此次课程的PDF演讲文稿www.intel.com/idfsessionsSZ 也可以在技术课程目录的首页链接下载。

• 您可以在4月13日以后通过该目录收看所有课程的视频录像

• 更多网络信息:- 项目:https://01.org/xen/blogs

- 演示视频:https://www.youtube.com/watch?v=V2i8HCcAnY8

- 演示视频 2:http://v.youku.com/v_show/id_XNzQ5MDg1MTg4.html

Page 26: IDF Session Presentation Template

26

其他技术课程

课程编号 标题 日期 时间 教室

DATS004 采用英特尔® 快速视频同步技术的高密度媒体解决方案 星期四 14:30 – 15:30 JING 景

Page 27: IDF Session Presentation Template

27

Legal Notices and DisclaimersIntel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation.Learn more at intel.com, or from the OEM or retailer.

No computer system can be absolutely secure.

Tests document performance of components on a particular test, in specific systems.Differences in hardware, software, or configuration will affect actual performance.Consult other sources of information to evaluate performance as you consider your purchase.For more complete information about performance and benchmark results, visit http://www.intel.com/performance.

Cost reduction scenarios described are intended as examples of how a given Intel-based product, in the specified circumstances and configurations, may affect future costs and provide cost savings.Circumstances will vary.Intel does not guarantee any costs or cost reduction.

This document contains information on products, services and/or processes in development.All information provided here is subject to change without notice.Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps.

Statements in this document that refer to Intel’s plans and expectations for the quarter, the year, and the future, are forward-looking statements that involve a number of risks and uncertainties.A detailed discussion of the factors that could affect Intel’s results and plans is included in Intel’s SEC filings, including the annual report on Form 10-K.

The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications.Current characterized errata are available on request.

No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.

Intel does not control or audit third-party benchmark data or the web sites referenced in this document.You should visit the referenced web site and confirm whether referenced data are accurate.

Intel, Core and the Intel logo are trademarks of Intel Corporation in the United States and other countries.

OpenCL and the OpenCL logo are trademarks of Apple Inc. used by permission by Khronos.

*Other names and brands may be claimed as the property of others.

© 2015 Intel Corporation.

Page 28: IDF Session Presentation Template

28

Risk FactorsThe above statements and any others in this document that refer to plans and expectations for the first quarter, the year and the future are forward-looking statements that involve a number of risks and uncertainties.Words such as "anticipates," "expects," "intends," "plans," "believes," "seeks," "estimates," "may," "will," "should" and their variations identify forward-looking statements.Statements that refer to or are based on projections, uncertain events or assumptions also identify forward-looking statements.Many factors could affect Intel's actual results, and variances from Intel's current expectations regarding such factors could cause actual results to differ materially from those expressed in these forward-looking statements.Intel presently considers the following to be important factors that could cause actual results to differ materially from the company's expectations.Demand for Intel’s products is highly variable and could differ from expectations due to factors including changes in the business and economic conditions; consumer confidence or income levels; customer acceptance of Intel’s and competitors’ products; competitive and pricing pressures, including actions taken by competitors; supply constraints and other disruptions affecting customers; changes in customer order patterns including order cancellations; and changes in the level of inventory at customers.Intel’s gross margin percentage could vary significantly from expectations based on capacity utilization; variations in inventory valuation, including variations related to the timing of qualifying products for sale; changes in revenue levels; segment product mix; the timing and execution of the manufacturing ramp and associated costs; excess or obsolete inventory; changes in unit costs; defects or disruptions in the supply of materials or resources; and product manufacturing quality/yields.Variations in gross margin may also be caused by the timing of Intel product introductions and related expenses, including marketing expenses, and Intel’s ability to respond quickly to technological developments and to introduce new features into existing products, which may result in restructuring and asset impairment charges.Intel's results could be affected by adverse economic, social, political and physical/infrastructure conditions in countries where Intel, its customers or its suppliers operate, including military conflict and other security risks, natural disasters, infrastructure disruptions, health concerns and fluctuations in currency exchange rates.Results may also be affected by the formal or informal imposition by countries of new or revised export and/or import and doing-business regulations, which could be changed without prior notice.Intel operates in highly competitive industries and its operations have high costs that are either fixed or difficult to reduce in the short term.The amount, timing and execution of Intel’s stock repurchase program and dividend program could be affected by changes in Intel’s priorities for the use of cash, such as operational spending, capital spending, acquisitions, and as a result of changes to Intel’s cash flows and changes in tax laws.Product defects or errata (deviations from published specifications) may adversely impact our expenses, revenues and reputation.Intel’s results could be affected by litigation or regulatory matters involving intellectual property, stockholder, consumer, antitrust, disclosure and other issues.An unfavorable ruling could include monetary damages or an injunction prohibiting Intel from manufacturing or selling one or more products, precluding particular business practices, impacting Intel’s ability to design its products, or requiring other remedies such as compulsory licensing of intellectual property.Intel’s results may be affected by the timing of closing of acquisitions, divestitures and other significant transactions.A detailed discussion of these and other factors that could affect Intel’s results is included in Intel’s SEC filings, including the company’s most recent reports on Form 10-Q, Form 10-K and earnings release.

Rev. 1/15/15

Page 29: IDF Session Presentation Template

29

媒体云总结

媒体云要求专用硬件(DSP、FPGA、ASIC

)纯软件

英特尔 GVT 上的 MSS

转码吞吐率 良好良好,但是过于昂贵

良好

实时与低延迟 良好困难,但是过于昂贵

良好

虚拟化 困难 轻松 轻松

云集成 困难 轻松 轻松

开发、运营与升级成本 高 低 低

Page 30: IDF Session Presentation Template

30

高密度媒体工作负载:GPU 内存影响

• 每个虚拟机的默认 GM 分区为

512MB

• 5ch-1080p 转码的 fps 降低 90%

配置:I7 4770、客机 Ubuntu* 14.04LTS、4GB 内存、显存 512MB-1024MB

0

50

100

150

200

250

300

350

1024MB 896MB 768MB 640MB 512MB

fps 5ch-1080p 转码

使用不同大小的 GM

显卡地址空间分区的影响

vMediaServer

Page 31: IDF Session Presentation Template

31

智能影子(Smart shadow)页表

Gen GFX 适配器

VMM

客机操作系统 客机操作系统

PPGTT PPGTT

影子页表影子页表

• 内存密集型工作负载会造成大量PPGTT 访问

• 优化:缓存客机中的访问,并只在GPU硬件要访问的 Shadow PPGT的时候一次性同步到影子页表

优化

0.0

100.0

200.0

300.0

400.0

1ch-1080 2ch-1080 3ch-1080 4ch-1080 5ch-1080 7ch-1080 10ch-1080

单个虚拟机:1080p 转码总 fps

Previous Optimized

配置:I7 4770、客机 Ubuntu* 14.04LTS、4GB 内存、显存

768MB

Page 32: IDF Session Presentation Template

32

跨引擎同步

RCS VCS RCS VCS

硬件跨引擎同步

CPU 跨引擎同步

混合Media/3D 工作负载

闲置

闲置

闲置

闲置

闲置Obj0

Obj0

Obj1

Obj1

Obj2

Obj2

闲置

闲置

闲置

闲置

闲置Obj0

Obj0

Obj1

Obj1

Obj2

Obj2

大量中断转至Hypervisor

中断次数减少一半使用闲置插槽