Distributed computing and data processing frameworks

Слайд 3

The big data challenge

The big data challenge

Слайд 4

The major hardware restrictions

• Processor time
• Memory
• Hard drive space

The major hardware restrictions • Processor time • Memory • Hard drive space • Network bandwidth

• Network bandwidth

Слайд 5

Memory (RAM)

Memory (RAM)

Слайд 6

Hard drive space

Hard drive space

Слайд 10

MapReduce

MapReduce

Слайд 12

Resilient Distributed Datasets (RDDs)

Resilient Distributed Datasets (RDDs)

Слайд 14

Storm framework

Storm framework

Слайд 15

Discussion

Discussion