Apache Storm by Skidmarkii

7
Apache Storm 김태우 Tumblr : Skidmarkii [email protected]

Transcript of Apache Storm by Skidmarkii

Page 1: Apache Storm by Skidmarkii

Apache Storm

김태우

Tumblr : Skidmarkii

[email protected]

Page 2: Apache Storm by Skidmarkii

What is STORM?

• From Apache Storm

>> a free and open source distributed realtime computation system.

• From Udacity

>> platform for analyzing realtime streams of data as they arrive.

Page 3: Apache Storm by Skidmarkii

Storm is TOOLKIT

Streams of dataAnalysis result

Storm

is made of Java or Python

Page 4: Apache Storm by Skidmarkii

COMPONENTS of Storm

Topology

Spout

Bolt

Source of data

Do something

그림 출처 : https://storm.apache.org/

Page 5: Apache Storm by Skidmarkii

For TWEET

Stream of Tweet Parse Bolt Count Bolt

그림 출처 : https://storm.apache.org/

Page 6: Apache Storm by Skidmarkii

GROUPINGS

• Shuffle Grouping

>> Randomly distrubutes tuples.

• Fields Grouping

>> Same field, Same task.

• All Grouping

>> Replicates tuples.

• Global Grouping

>> Sends all tuples to a single bolt.

Page 7: Apache Storm by Skidmarkii

I'M GOING TO...

• Do tutorial

>> Set up

>> Examples run

• And more...

>> Udacity.com

>> github