The Potential of Big Data in the Cloud - README | SK플래닛...
Transcript of The Potential of Big Data in the Cloud - README | SK플래닛...
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
• How to apply Big Data & Analytics
• What is it? Definitions, Technology and Data Science
• The Big Data Market inside and outside the cloud
• Some use cases
2
Agenda
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Resistance is futile Competitive advantage No one size fits all It’s different
Top 4 things about Big Data and Analytics
3
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Source: An IDC White Paper - sponsored by EMC. As the Economy Contracts, the Digital Universe Expands. May 2009.
.
Complex, Unstructured
Relational
New kinds of data
Structured data vs. Unstructured data growth
Our ability
to analyze
Analysis
gap
4
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Big Data Technologies
New technologies, new approaches
Source: Wordle for Credit Suisse, Does Size Matter Only?, September 2011
5
An Illustrative Customer Experience: We Detect a Customer’s Promotion
6
Existing Customer with a Current Account, Bank Detects Financial Improvement, Suggests Options (Customer Retention Scenario)
Very simple low-pass filter on transaction record
Comparisons made between Jane’s historical spending vs saving behaviour and those
of other customers
Jane has recently been promoted. An alert is triggered that her direct deposit amounts have jumped this month.
Financial recommendation system settles on advice to propose to Jane based on successful peers experiencing a similar trend.
• Improved Awareness of Customer: • Behavioural data captured
and stored for future use • Enhance segmentation and
enabling targeted offerings
• Improved Ability to Correlate Customers: • Allow for better targeting • Develop more agile
response capability
Social activity trends logged, fed back into a validation and improvement loop
Communications logged, retained for analysis,
incremental improvements
• Sentiment analysis: • Identify customer
perception about brand • Improve segmentation • Help with personalised
and targeted offerings
Bank engages Jane via web, SMS, and/or phone call to present suggestions and guidance, e.g., upgrading to a premium account.
Cu
sto
me
r Jo
urn
ey
Dat
a In
sigh
t B
usi
ne
ss
Val
ue
Jane enjoys better control and more financial security, broadcasts this success explicitly and implicitly.
Opportunity Detection
• Increased Customer Engagement: • An opportunity to improve
the relationship between the bank and its customer
Correlation and Prediction Proposition Reduced Churn
Web site screen shot
An Illustrative Customer Experience: Location-based Mobile Shopping Recommendations
7
Existing Customer with the Bank’s Mobile App Installed on his Mobile Device (Mobile Recommendations Scenario)
App sends home location of customer
Further calculations possible to compare customers on the
basis of daily routines
John is moving through town on foot, on transit, or in his car.
• Improved Data Quality: • Behavioural data captured
and stored for future use • Can be further analysed and
used to develop further offerings
• Improved Customer Insight: • Fuller understanding of
customer behaviour
Further analysis possible to improve targeting and
engagement
Records kept of which notifications result in
behavior and under what circumstances
• Improved brand perception: • Positive customer
experience of bank in the mobile space
• Cutting-edge tools
Mobile app raises a notification to John, and John tries out a new shop.
Cu
sto
me
r Jo
urn
ey
Dat
a In
sigh
t B
usi
ne
ss
Val
ue
• Improved Customer Insight: • More detailed analysis of
what drives customers financially and socially
John comes within a physical threshold of a shop where similar customers tend to shop but he does not.
Location Observation Proposition Correlation Reduced Churn
John finds mobile app useful and as a result has increased engagement with other offerings of the bank.
Bank storefront
An Illustrative Customer Experience: Suggesting Mortgage and Savings Plans for Newly Engaged Customers
8
Existing Customer with a Current Account, Bank Infers Future Marriage, Suggests Options (Mortgage and Savings Plan Scenario)
Comparing user behavior against historical library of spending behaviors of
all users
Outlier spending detected quickly and rules of
engagement applied automatically
Jim has been dating Julie. His spending habits have trended away from his usual nights out with friends, toward more romantic, pricier restaurants.
User-sim system recognizes this trend, and when Jim makes an extraordinarily large purchase at a local jeweler an alert is raised.
• Improved Awareness of Customer: • Behavioural data captured
and stored for future use • Enhance segmentation and
enabling targeted offerings
• Improved Ability to Flag Outlier Behaviour: • Possible to react quickly to
changing conditions and target more effectively
Social activity trends logged, fed back into a validation and improvement loop
Analysis used to predict customer’s future needs and
target appropriate offers
• Increased Customer Loyalty: • Long-term customers
provide the bank with even more opportunity to make smart suggestions
Analysis suggests that users with similar behaviour to Jim are likely to buy a house within 6 months. Jim currently does not have enough savings for a deposit so the bank emails a savings plan offer tailored to Jim’s needs.
Cu
sto
me
r Jo
urn
ey
Dat
a In
sigh
t B
usi
ne
ss
Val
ue
Jim enjoys an increased feeling of security as a customer of the bank, given their inclination to suggest ways he can save for his future.
Opportunity Detection
• Increased Cross Sell and Up Sell: • An opportunity to increase
cross sell and up sell rates to existing customers based on detailed analysis
Correlation and Prediction Proposition Increased Loyalty
Bank web site
Opportunity Areas
9
Sell more to existing
customers
Sell more to new customers
Retain more customers
Reduce risk exposure
Reduce cost to sell
Reduce cost to serve
• Proactively contact customers based on behavioural triggers and key life stages
• Improve action prompts based on social insight
• Provide personalised pricing based on recent circumstances and predicted changes
• Convert more leads into sales by using social data indicators during interactions
• Improve measurement and monitoring of cancellation propensity
• Proactively target customers with high risk of churn with specific high value services • Send pre-delinquency customer
messages • Add an additional layer ( of predicted
circumstances) in approval process of financial aid requests
• Pre-assess customers reducing invitations to non-eligible or bad debt customers
• Improve Forecast and planning process based on insight
• Proactively inform customers about service issues and next steps
• Include and generate relevant service prompts
• Use innovative technologies to store/retrieve data
Big Data
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Big Data Analytics is a shift in the mindset of how we think about analytics as an internal component to the organization
Focuses on letting data be productized in a way that drives meaningful insights in a rapid fashion and innovation to exploit missed opportunities in areas previously unlooked
Big Data Analytics
What is it?
10
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Everything will be analyzed
The three Vs
Structured Unstructured
Batch
Real-time
Velocity
Variety Source: IDC
Distributed,
ETL
Relational,
ETL
In-memory,
NoSQL, Event processing,
EDW
Event
processing, Distributed+
NoSQL
Volume
11
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Big Data Analytics vs. traditional analytics
Where do they differ?
Technology Skills Processes & Organization
Big
Data
An
aly
tics
Tra
dit
ion
al
An
aly
tics
Assumes condensed,
structured, and feature rich datasets that can be modeled: relational
databases, data warehouses, dashboards
Basic knowledge of
reporting and analysis tools, few specialized resources
“Siloed” data
organizations Only specific “views” of
data visible across the enterprise
A stack of tools that
enables an organization to build a framework that allows them to extract
useful features from a large dataset to further
understand how to model their data.
Advanced analytical,
mathematical and statistical knowledge required to develop new
models – the data scientist
Data is productized and
shared across the enterprise
Dedicated data organizations with well-
defined data management processes and ownership
12
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
MapReduce and Hadoop
MapReduce revolutionized how we handle large amounts of data, Hadoop made it simple and affordable
• Originally designed and first developed in
Google as part of their efforts to more
efficiently index the web
• MapReduce splits input data into smaller
chunk that can be processed in parallel • Scales linearly with number of nodes
• Yahoo’s implementation of MapReduce
• Open source, top-level project in the
Apache Foundation
• Designed to run on commodity software
(Linux) and hardware (consumer-grade computers with directly attached storage)
• Large ecosystem of additional
components (both open source and
commercial) 13
Copyright © 2012 Accenture All rights reserved.
Analytics-Focused Massively Parallel Processing
(MPP) Software Platforms
Distributed In-memory
Big Data and Analytics in the Enterprise
Many technology choices in a rapidly changing environment. Which one is right for you?
Cloud
Hardware Optimized MPP Data Warehouses
Distributed Non-Relational Storage and Processing
Big Data-Enabled Intelligence and Analysis
14
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Technology
Augmenting existing analytics with Big Data technologies
Emerging Data
Technologies
Existing Analytics Tools and
Investments
Big Data Analytics
15
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
It’s not just Hadoop
What are traditional analytics vendors doing about it?
Distributed In-memory
16
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
The impact of Big Data Analytics on our landscapes
Hybrid landscapes, where old and new converge
ERP CRM Web Logs Time
Series Files Social
Relational DBs
Enterprise DW
Real-time analytics
HDFS
HBase
MapReduce
Hive
Data Services (REST, WS)
Pig
ETL
Internal apps,
customer-facing
apps, mobile
apps Analysis tools (SAS, SPSS, R,
Tableau)
17
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Data science
“The sexy job in the next 10 years will be statisticians”
– Hal Varian, Chief Economist at Google
Data scientists are the next-generation analytics professional, responsible for turning the data into insight
Data Science and the skill gap
Closing the loop – it’s not just about technology skills
18
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Some examples
“Cool” Cloud Vendors of Big Data Analytics
Cloud Analytics reference models for Asset Management,
Banking, HighTech, Insurance and Retail
their business analytics platform is used by leading corporations
in many industries, including automotive, commercial real
estate, restaurants and entertainment, fast moving
consumer goods, retail franchising,
and telecommunications.
They leverage Force.com platform as a service as well as
traditional big data toolset to develop Geographical Intelligence
for sales reps.
They develope software for BI SaaS potential service
providers, both private or public.
19
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Business challenge
• Database growth at 2 TB per month
• Traffic and Data size double every 6
months
• Total storage required reach 2
Petabytes in 2015
• Poor Oracle performance, very costly to
scale
• Siloed database systems
• Proliferation of home-grown tools
• Decentralized business rules and reporting data
Solving real problems with Big Data Analytics
Case study 1: Large storage systems vendor
Technologies used
• Processing – Hadoop, Hive, Pig, HBase
• Log processing – Flume
• Monitoring – Ganglia
• Business Intelligence – Pentaho
Delivered Results
• Highly scalable data processing platform
• Centralized data storage
• Cluster utilized by all teams and groups
• Increased efficiency of data consumption
• Foundation for BDaaS offering
20
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Technologies • Processing – Hadoop, Hive
• Log archiving – Flume • Data retrieval – CouchDb
Delivered Results • Highly scalable data platform
• Various data mining and machine
learning algorithms
• Centralized data storage
• Cluster utilized by all teams and groups • Increased efficiency of data
consumption
• Innovation across all teams
• Established Central Analytics team and
private cloud
Business challenge
• Enormous amount of Customer,
Transaction and Click-through data.
• Inability of existing Relational stores to
power the various batch queries and
computations.
• Data residing in different stores spread
across the company
Solving real problems with Big Data Analytics
Case 2: Global retailer
21
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Business Challenge
• Lack of agility in data processing and
analysis
• Business and Data Analysts forced to
wait inordinate amount of time to
explore the data
• Difficulty in ingesting new sources of
data without exhaustive ETL
processes
• Inability to apply advanced analytic
and statistical functions to a large data
set
Solving real problems with Big Data Analytics
Case 3: Large insurance company
Technologies used
• Processing – Hadoop, Hive, Pig,
• Analytics – Greenplum, R, Madlib
• Visualization – Tableau, Karmasphere, Alpine
Miner
Delivered Results
• Agile BI platform
• Multiple options for data ingestion and
processing for different business scenarios
• Hadoop as an economical platform for data
processing and Greenplum to ease, expedite
and enhance the data processing
22
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved.
Wrapping up
Big Data is challenging current patterns of thought
Cost-effective
computing and
storage
Data
“explosion”
Everything can be
stored
Cheap large scale
computing power
readily available
Data everywhere:
structured,
unstructured,
other people’s
data, geolocation
data
Big Data and Analytics
Resistance is futile
Are the path to competitive advantage and create value
There are many ways to go about
it Compared to traditional analytics,
they’re different; adapt or become irrelevant
23
Copyright © 2012 Accenture All rights reserved. Copyright © 2012 Accenture All rights reserved. 24
Accenture Technology Vision
http://bit.ly/accenturetechnologyvision2012
Strong advice on data for 2012