Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics...

27
Copyright 2016 Samsung SDS Co., Ltd. All rights reserved Product Overview Brightics TM v2.0 Suite Samsung SDS

Transcript of Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics...

Page 1: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved

Product Overview

BrighticsTM

v2.0 Suite

Samsung SDS

Page 2: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

I. Brightics개요

- 빅데이터 분석플랫폼의 필요성

- Why Brightics?

II.Brightics Architecture

- Brightics 구조

- Key Components of Brightics

III. Brightics in Detail

- 사용자 중심의 Web 기반 통합 분석환경

- Visual & Interactive Data Search 및 시각화

편리한 분석 Modeling 환경

- Automated Installation/Configuration

- 손쉬운 서비스 관리

IV. Brightics Use Cases

V. 기술문의

- Requirement

- 문의

I. Brightics Overview

II. Brightics Architecture

III.Brightics in Detail

IV.Brightics Use Cases

V. Technical Support

AGENDA

Page 3: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

IBrightics Overview

Page 4: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

4/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Analytics platform is required to offer a variety of fundamental big data technologies and integrated

user environment.

Why Big Data Analytics Platform?

I. Brightics Overview

■□□□□

Requirements

• Rapid processing and analysis of

massive data, which is impossible with

traditional analysis approaches

• Analysis tool and automation

functionality to support different levels of

users and meet different goals

• Technology suitable for big data

visualization to analyze/verify large-scale

data

• Decision support functionality optimized for complex customer

environment

StatisticsAnalytics

BusinessIntelligence

IT

Required Functionality for Analysis Platform

Advanced statistical/analysis functions to analyze

massive and diverse data

Production environment for easy modeling and

rapid verification

Interactive visualization of big

data

Real-time analysis of

unstructured data

Page 5: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

5/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

BrighticsTM

is an integrated platform for optimal big data analysis at complicated enterprise environments.

Why BrighticsTM

?

I. Brightics Overview

■□□□□

High performance big data analytics

platform to enable rapid analysis of

super large volume of data at a low cost

which is impossible with traditional analysis

tools

Intuitive and interactive web-based visual

analytics environment to help find potential

insights from data

Advanced analysis functionality supporting AI

including prescriptive analytics and deep learning

Page 6: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

IIBrightics Architecture

Page 7: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

7/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Offers Hadoop ecosystem-based BrighticsTM

server and user analysis/visualization tool.

Brightics™ Architecture

II. Brightics Architecture

□■□□□

RDB

Sensor Device Equipment

BrighticsTM

Brightics Core

Jun ‘16 Dec ‘16

VisualAnalytics

Real-timeAnalytics

TextAnalytics

DeepLearning

Data to analyzeVisualization

of data analysis results

Page 8: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

8/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Key Components of Brightics

Data Preparation

Interface with various data sources

-Support for RDB, HDFS, CSV files

-Real-time data, unstructured text data (2nd half of the year)

1

• Table/chart data search

• Data visualization

Data Search

2

• Data upload

• Data filtering

• Missing data

processing

DataPreparation

1

Modeling

3

Model Assessment

4

• Data upload

• Data filtering

• Missing data

processing

DataPreparation

1

• Table/chart data search

• Data visualization

Data Search

2

Data Search

Visual & interactive data search and visualization

-Seamless data interface to analysis data and visualization charts enables intuitive data analysis

Fast distributed data processing

-Diverse high-performance statistics and advanced analysis functions rapidly analyze large-scale data

2

II. Brightics Architecture

□■□□□

Page 9: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

9/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Key Components of Brightics

• Table/chart data search

• Data visualization

Data Search

2

Model Assessment

4

• Data upload

• Data filtering

• Missing data

processing

DataPreparation

1

• Table/chart data search

• Data visualization

Data Search

2

• Function

selection/application

• Workflow

• Model sharing and report

Modeling

3

Modeling

Interface with various data sources

-Integrated analysis environment for visual analytics to see data and workflow at a glance

-Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year)

Easy-to-use analysis modeling

-Quick menu (Delete, connect, copy) to minimize the number of mouse clicks in case of creating workflow

-Function search with key word selection and entry and multiple pre-processing functions available

Model sharing and report

-Import and export functionality to share a model and reporting functionality for desired chart and tables

3

II. Brightics Architecture

□■□□□

Page 10: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

10/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Key Components of Brightics

• Table/chart data search

• Data visualization

Data Search

2

• Data upload

• Data filtering

• Missing data

processing

Data Preparation

1

• Function

selection/application

• Workflow

• Model sharing and report

Modeling

3

• Model simulation

• Model assessment

(ROC et al)

ModelAssessment

4

Model assessment

Model simulation

-Simulation results of analysis model according to the selection of data and parameters (To be offered, 2nd half of the year)

Analysis model assessment

- Based on evaluation function, visualize the accuracy of analysis functions with figures and graphs to offer a guideline for

suitable model selection

4

Installation/management

Automated installation/configuration

-Service installation for big data analytics in one click available for non-big data experts

Ease of service management

-Real-time monitoring and management over installed services

5

II. Brightics Architecture

□■□□□

Page 11: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

IIIBrightics in Detail

Page 12: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

12/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

1) User-oriented Web-based Integrated Analysis Environment

III. Brightics in Detail

□□■□□

Data and workflow handling in a single window

• Workflow visualization

Visualization of functions by category and execution status to see analysis flow at a glance

• Input data visualization

Charts and table-based visualization to select necessary variables for processing/analysis functions

• Immediate function executionRun function immediately after changing variables to enhance the ease of testing

• Automatic display of outputsAutomated chart recommendations to best represent outputs of the analysis function

Key CharacteristicsInputdata

visualization

Workflow

visualization

Immediate

function

execution

Output data

visualization

Page 13: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

13/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

2) Visual & Interactive Data Search and Visualization

III. Brightics in Detail

□□■□□

Seamless Data/Chart Interface and Report

Key Characteristics

Select chart data

Select data filter

• A variety of charts and reportsAnalyze entire or partial data with 14 charts

from various perspectives and offer reports

on outputs

• Visual & interactive data searchSeamless interface to analysis data and

visualized charts enables to intuitively select

and analyze data

• Easy to select and analyze dataData filter enables users to automatically

obtain and immediately analyze detailed and

additional data

Page 14: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

14/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

3) Easy-to-use Modeling Environment

III. Brightics in Detail

□□■□□

Keyword-based function search &assessment of analysis models

Key Characteristics

• Ease of function selection Offer function search with keyword entry

and selection and detailed description and

examples of functions

• Myriad of large-scale analysis

functions Up to 120 data pre-processing, cluster

analysis/identification, regression analysis

functions (to be offered 2nd half of the year)

• Comparison and assessment of

analysis functionsBased on evaluation function, visualize

the accuracy of analysis functions with

figures and graphs to offer a guideline on

suitable models (2nd half of the year)

Keyword search

Output of evaluation function

Analysis

function 1

Analysis

function 2

Evaluation

function

e.g. Logistic Regression

e.g. Decision Tree

e.g.)ROC

Analysis function 1

Analysis function 2

Page 15: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

15/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

4) Automated Installation

III. Brightics in Detail

□□■□□

Web UI-based easy installation and duplicate analysis server

Key Characteristics

• Installation in one click

General users can easily go

through entire installation and

configuration process on web UI

(Enter only server information and

ID/password)

• Advanced installation

Advanced users can select and

configure desired services only

• Duplicate analysis server

Duplicate server for front office

provides fault-tolerant services

Installation status monitoring

Advanced installation

Page 16: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

16/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

5) Ease of Service Management

III. Brightics in Detail

□□■□□

Real-time monitoring and management over installed services

Key Characteristics

• Service management

Check whether installed service is

properly working and promptly start/stop

service

• Job and resource monitoring

Monitor Hadoop clusters and diverse job

resources

• Configuration change and

prompt application

Right after modifying memory and

clusters, restart and promptly apply

changes

Service management

Job and resource monitoring

Hadoop Job Spark JobResource for

analysis

Page 17: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

IVBrightics Use Cases

Page 18: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

18/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Enhanced credibility of Cello Square’s external logistics service by collecting/analyzing logistics

risks such as natural disaster, weather, and social issues and assessing shipment risks

Assessment of 21 risks in total including marine/air/in-land transportation and structured

+unstructured data

Text mining: extracts logistics risks-related data from news articles

Ordinal classification by the intensity of risks

Real-time data collection/analysis: RSOE, OpenWeatherMap, Factiva data

Developed logistics risk assessment methodology by leveraging failure mode effect analysis

Cello Square Risk Monitoring

IV. Brightics Use Cases

□□□■□

Business

Values

Secured

technology

Page 19: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

19/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Faster-decision making with dramatically shorter prediction time driven by automated and advanced

analysis

Suggestion on right promotion timing through demand/sales forecast

More efficient business management by forecasting new product lifecycle

Product Demand/Sales Forecast

IV. Brightics Use Cases

□□□■□

Sell-out forecast

- Identifies factors affecting sales amount and estimates their impacts to predict sales amount

Forecast calibration

- Enhances the forecast accuracy by reflecting promotion effects and time-series forecast of individual models

Sales patterns by group

Peculiarities of weekly sales

• Estimate sell-out pattern by sales volume (High/Med/Low)

• Estimate and reflect the bounds of seasonal product

group’s sales volume

• Sales characteristics per month/week

① Adjust top 25% by DC

② Estimate the number of key SKUs sold and total

sales amount by week

Past 52 weeks-based

time-series forecast of individual

SKUs by DC/product group

Forecast

Calibration

Flexible structure to dynamically reflect attributes from adding customers and extending forecast weeks

2013W01

~ this week

Sell-Out

Weekly update of plan week

Promotion Plan

Sales result-based

patterns

Sales trend

by product

• Correct past prediction errors for same DC/weekly sales

• Adjust forecasts during zero-promotion period

• Adjust after constantly estimating sell-out patterns over 8 weeks

Business

Values

Secured

technology

Page 20: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

20/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Big data analytics-based innovative failure prevention framework responding to the

complexity and rapid changes to data centers to create new businesses

- Environment for prompt failure analysis (data storage by management entity → data integration by service)

- Multi-faceted anomaly detection (Threshold → performance and log data analysis-based)

Log analysis-based anomaly detection: computes the times words/patterns appeared and normal

range

Correlation analysis-based anomaly detection: analyzes correlations of time-series data such as

performance, logs and application

Enhanced Anomaly Detection Algorithm

IV. Brightics Use Cases

□□□■□

Abnormal cpu_util

Abnormal correlation

coefficients of cpu_util and

mem_usage

time

time

Abnormal correlation coefficients

of cpu_util and Avg_Response_

Time

Abnormal correlation

coefficients of cpu_util and

Java_Message_

Service_Count

time

time

Abnormal correlation coefficients

of cpu_util and "Denied“

counts

Abnormal correlation

coefficients of cpu_util and

no. of approval requests

time

time

Type

DB

WEB-WAS

WEB-WAS

Logs

Application-specific

metric

mem_usage

Avg_Response_Time

Java_Message_

Service_Count

"Denied“ counts

No. of approval requests

DB cpu_util

Business

Values

Secured

technology

Page 21: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

21/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Building Energy Analysis

IV. Brightics Use Cases

□□□■□

Energy demand forecast-based peak power decrease to save energy cost and offer operation guidelines

Energy leak reduction and suggestion on equipment change timing by leveraging sensor anomaly detection technology

Building energy demand forecast

- Baseline verification (Interaction Regression, Similarity Based Model)

- Time-series prediction (Time-series decomposition, ARIMA, Weighted Moving Average)

Visualization to verify suspect factors

- Time-series analysis (Time series K-means, Dynamic Time Warping)

- Anomaly detection (Dynamic Bound, Asymmetric Outlier Remove)

Energy baseline model Peak power demand estimation

Sensor anomaly detection

Business

Values

Secured

technology

Page 22: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

22/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Resident’s Living Patten Profile Analysis

IV. Brightics Use Cases

□□□■□

Machine learning and statistical analysis to assume the profiles of each generation

Time-series data analysis for event prediction

Created generation-specific profiles of living pattern by comprehensively analyzing diverse data

collected from home

Analytics platform for tailored data push service to generation according to profiles

Commercialized analytics functionality for B2B business such as linking to B2B advertising

business tailored to lifestyle

Expanded analytics-based data push service

Families that leave home for school or office

between 6 and 7 o’clock in the morning on weekdays

Senior couples who usually stay at home, not going

outside

Families with children that go out frequently on weekends

It is likely to rain this afternoon. Please

close the window and take an umbrella

before you go out.

Don’t forget to bring your smartphone.

You have barely gone outside for the past

3 days. At 4 pm, the weather is clear with

low UV index. How about taking a walk?

We will have strong winds with high

density of fine dust this afternoon.

Children are advised to refrain from

outdoor activities.

산책

자외선좋음

Sunny

미세먼지나쁨

외부활동자체

WindRAIN

우산

스마트폰

Business

Values

Secured

technology

Page 23: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

23/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Security Analytics Model Development

IV. Brightics Use Cases

□□□■□

Developed malicious code analysis model based on analytics rather than traditional rule-based

signature approach to respond to APT (Advanced Persistent Threat) attacks

Developed a model to detect signs of (personal) information leak by analyzing employees’

document usage patterns

Developed a model to detect potentially compromised PCs by analyzing security log data

Developed a model to detect a suspicious user who leaked personal information

Models to detect hacking suspect PCs

Factors to identify potentially compromised

PCs

Data traffic

Access interval/cycle

User-agent peculiarities

Document attachment and modification

Random Forest Clustering

Security LogAttributes factors for

extraction

0

5

10

15

20

1 2 3 4 5 6 7 8 109

Frequency

Task type

Usage patterns by user

Business

Values

Secured

technology

Page 24: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

VTechnical Support

Page 25: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

25/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Requirements

TM

※ 분석 목적 및 데이터 크기에 따라 달라질 수 있습니다

Web Browser: Internet Explorer 11 or higher or Chrome

OS : Linux Multi Node or Single Node

CPU : 4 Cores per node

RAM : 8G+ per node

HDD : 100G+ per node

Hadoop 2.6

Spark 1.6

JVM 1.8 or higher

Alluxio 1.0

PostgreSql

Ⅴ. 기술 문의

□□□□■

Page 26: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

26/29 Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

V. Technical Support

□□□□■

Samsung is

the best partnerfor your Company

Email

[email protected]

Product support

•Jaeyoung LEE (R&D Center) +82-2-6155-3664

Policy support

•EunJoo LEE (R&D Center) +82-2-6155-3680

Inquiries

Contact

Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.

Page 27: Samsung SDS BrighticsTM v2.0 Suite · -Real-time analysis, text analytics, prescriptive analytics and deep learning functionalities to be offered (2nd half of the year) Easy-to-use

www.samsungsds.com

Copyright ⓒ 2016 Samsung SDS Co., Ltd. All rights reserved.