LIBRISTO
LIBROAMANTO
obligatoriu
Faceți parte dintr-o comunitate de iubitori de cărți din întreaga lume și beneficiați de o mulțime de avantaje Creați-vă un cont gratuit
0
Transport gratuit la punctele de livrare Pick Up peste 349.00 lei
Packeta 15.00 lei Cargus 28.00 lei Easybox 20.00 lei FAN 20.00 lei Punct FAN 16.00 lei Punct DPD 17.00 lei Curier Sameday 24.00 lei Curier DPD 25.00 lei

Livrare gratuită pentru comenzile peste 349,00 lei.

Apache Spark 4.0

Build High-Performance Data Engineering Pipelines with Spark SQL, Structured Streaming, and Modern Cluster Architectures

Limba englezăengleză
Carte Carte broșată
Carte Apache Spark 4.0 Yila Harrison
Codul Libristo: 51319811
Editura Independently published, februarie 2026
Build High-Performance Data Engineering Pipelines with Spark SQL, Structured Streaming, and Modern C... Descrierea completă
? points 42 b Nou Nou
91.68 lei
În depozitul extern Expediem în 9-15 zile

30 de zile pentru retur bunuri


Ar putea de asemenea, să te intereseze


Modern Computer Vision with PyTorch Yeshwanth Reddy / Carte Carte broșată
common.buy 366.44 lei

Build High-Performance Data Engineering Pipelines with Spark SQL, Structured Streaming, and Modern Cluster Architectures

Apache Spark has become the backbone of modern data engineering - but knowing Spark isn't the same as mastering it in production.

Apache Spark 4.0 is a deeply practical, production-focused guide for data engineers, platform engineers, and analytics professionals who want to build scalable, fault-tolerant, high-performance data pipelines using Spark SQL, Structured Streaming, and modern cluster architectures.

This book goes far beyond surface-level tutorials. It teaches you how Spark actually works under the hood - and how to use that knowledge to design systems that scale.

You won't just learn Spark APIs.
You'll learn how to think like the Spark engine.


What You'll Master

Inside this book, you will learn how to:

  • Understand Spark's execution model: jobs, stages, tasks, DAGs, Catalyst, and Tungsten

  • Write high-performance Spark SQL queries and choose efficient join strategies

  • Design batch, streaming, and hybrid pipelines that scale

  • Optimize memory, CPU, shuffle behavior, and partitioning

  • Build real-time pipelines with Structured Streaming

  • Deploy Spark on Kubernetes and modern cloud architectures

  • Diagnose slow jobs and production failures with confidence

  • Apply operational best practices for reliability and fault tolerance

  • Design complete end-to-end data engineering systems

Each chapter builds progressively - from core fundamentals to advanced architectural decisions - ensuring you develop both tactical skills and strategic judgment.


Built for Real-World Production

This book is not theoretical.

Every concept is explained clearly, then grounded in practical Spark applications. You will learn how to:

  • Prevent silent data corruption

  • Handle skewed data and large shuffles

  • Tune Spark configurations that actually matter

  • Debug production failures under pressure

  • Design pipelines that survive real workloads

If you work with large-scale data, this book gives you the mental models and tools needed to operate Spark with confidence.


Who This Book Is For

This book is ideal for:

  • Data Engineers building batch and streaming pipelines

  • Analytics Engineers optimizing Spark SQL workloads

  • Platform Engineers managing Spark clusters

  • Developers moving from Spark basics to production mastery

  • Teams adopting Spark 4.0 and modern cluster architectures

If you already know basic Spark and want to move into performance tuning, reliability, and architecture design - this book is for you.


Why Apache Spark 4.0 Matters

Spark 4.0 represents a refinement of Spark's execution engine, adaptive query behavior, and production readiness. This book shows you how to leverage those improvements without guesswork.

Instead of memorizing settings or copying code snippets, you'll understand:

  • Why Spark behaves the way it does

  • How execution plans translate into real resource usage

  • When Spark is the right tool - and when it isn't

That clarity is what separates average Spark users from high-impact data engineers.


Build Systems That Scale

Data systems fail when engineers treat Spark as a black box.

This book removes that black box.

By the end, you will be able to design and deploy robust, high-performance data pipelines - from ingestion to analytics - using Spark SQL, Structured Streaming, and modern cluster architectures.

Actriță & Poliglotă
EWA KASP pentru
Redă videoclipul
Ewa Kasp
Libristo are cea mai mare selecție de literatură în limbi străine. De aceea îmi cumpăr cărțile de aici.

Informații despre carte

Titlu complet Apache Spark 4.0
Limba engleză
Legare Carte - Carte broșată
Data publicării 2026
Număr pagini 172
EAN 9798249316587
Codul Libristo 51319811
Greutatea 311
Dimensiuni 178 x 254 x 9
Dăruiește această carte chiar astăzi
Este foarte ușor
1 Adaugă cartea în coș și selectează Livrează ca un cadou 2 Îți vom trimite un voucher în schimb 3 Cartea va ajunge direct la adresa destinatarului

Logare

Conectare la contul de utilizator Încă nu ai un cont Libristo? Crează acum!

 
obligatoriu
obligatoriu

Nu ai un cont? Beneficii cu contul Libristo!

Datorită contului Libristo, vei avea totul sub control.

Creare cont Libristo
Consilier de cărți Libroamiko
Bună ziua, sunt Libroamiko, vă pot ajuta?