Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka

Length: 264 pages
Edition: 1st ed.
Language: English
Publisher: Apress
Publication Date: 2016-09-29
ISBN-10: 1484221745
ISBN-13: 9781484221747
Sales Rank: #2079599 (See Top 100 Books)

0 ratings

Description

This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large datasets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses.

Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation. The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer:

The language: Scala
The engine: Spark (SQL, MLib, Streaming, GraphX)
The container: Mesos, Docker
The view: Akka
The storage: Cassandra
The message broker: Kafka

What you’ll learn

How to make big data architecture without using complex Greek letter architectures.
How to build a cheap but effective cluster infrastructure.
How to make queries, reports, and graphs that business demands.
How to manage and exploit unstructured and No-SQL data sources.
How use tools to monitor the performance of your architecture.
How to integrate all technologies and decide which replace and which reinforce.

Who This Book Is For

This book is for developers, data architects, and data scientists looking for how to integrate the most successful big data open stack architecture and how to choose the correct technology in every layer.

Chapter 1. Big Data, Big Problems
Chapter 2. Big Data, Big Solutions
Chapter 3. The Language: Scala
Chapter 4. The Model: Akka
Chapter 5. Storage. Apache Cassandra
Chapter 6. The View
Chapter 7. The Manager: Apache Mesos
Chapter 8. The Broker: Apache Kafka
Chapter 9. Fast Data Patterns
Chapter 10. Big Data Pipelines
Chapter 11. Glossary

Free ChaptersTry Audible and Get Two Free Audiobooks »

To access the link, solve the captcha.

Recommended BooksMore Similar Books »

Julia Quick Syntax Reference, 2nd Edition: A Pocket Guide for Data Science Programming Cover

Microsoft Certified Azure Data Fundamentals (DP-900) Exam Guide: Build a solid foundation in Azure data services and pass the DP-900 exam on your first try

2024-09-27

Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka

What you’ll learn

Who This Book Is For

Table of Contents

Julia Quick Syntax Reference, 2nd Edition: A Pocket Guide for Data Science Programming

Distributed Intelligence: Building an autonomous tech ecosystem with AI, blockchain, IoT and green energy

Essential PostgreSQL: Your guide to database design, query optimization, and administration

Pro Oracle Database 23ai Administration: Manage and Safeguard Your Organization's Data

Project Objectives Management: Aligning Targets, Delivering Results, and Adapting to Changes

Fundamentals of Database Management Systems, 3rd Edition

Blockchain: The Comprehensive Guide to Blockchain Development, Ethereum, Solidity, and Smart Contracts

Microsoft Certified Azure Data Fundamentals (DP-900) Exam Guide: Build a solid foundation in Azure data services and pass the DP-900 exam on your first try