Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library
Autor Hien Luuen Limba Engleză Paperback – 16 aug 2018
Develop
applications
for
the
big
data
landscape
with
Spark
and
Hadoop.
This
book
also
explains
the
role
of
Spark
in
developing
scalable
machine
learning
and
analytics
applications
with
Cloud
technologies.Beginning
Apache
Spark
2gives
you
an
introduction
to
Apache
Spark
and
shows
you
how
to
work
with
it.
Along
the
way,
you’ll
discover
resilient
distributed
datasets
(RDDs);
use
Spark
SQL
for
structured
data;
and
learn
stream
processing
and
build
real-time
applications
with
Spark
Structured
Streaming.
Furthermore,
you’ll
learn
the
fundamentals
of
Spark
ML
for
machine
learning
and
much
more.
After
you
read
this
book,
you
will
have
the
fundamentals
to
become
proficient
in
using
Apache
Spark
and
know
when
and
how
to
apply
it
to
your
big
data
applications.
What
You
Will
Learn
- Understand Spark unified data processing platform
- How
to
run
Spark
in
Spark
Shell
or
Databricks
- Use
and
manipulate
RDDs
- Deal with structured data using Spark SQL through its operations and advanced functions
- Build
real-time
applications
using
Spark
Structured
Streaming
- Develop
intelligent
applications
with
the
Spark
Machine
Learning
library
Who
This
Book
Is
For
Programmers
and
developers
active
in
big
data,
Hadoop,
and
Java
but
who
are
new
to
the
Apache
Spark
platform.
Preț: 229.39 lei
Preț vechi: 286.74 lei
-20% Nou
Puncte Express: 344
Preț estimativ în valută:
43.90€ • 46.31$ • 36.59£
43.90€ • 46.31$ • 36.59£
Carte disponibilă
Livrare economică 12-26 decembrie
Livrare express 27 noiembrie-03 decembrie pentru 42.73 lei
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9781484235782
ISBN-10: 1484235789
Pagini: 310
Dimensiuni: 178 x 254 x 23 mm
Greutate: 0.76 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
ISBN-10: 1484235789
Pagini: 310
Dimensiuni: 178 x 254 x 23 mm
Greutate: 0.76 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
Cuprins
1.
Introduction
to
Apache
Spark
2.
Working
with
Apache
Spark
3.
Resilient
Distributed
Dataset
4.
Spark
SQL
-
Foundation
5.
Spark
SQL
-
Advanced
6.
Spark
Streaming
7.
Spark
Streaming
Advanced
8.
Machine
Learning
with
Spark.
Notă biografică
Hien
Luu
has
extensive
experience
in
designing
and
building
big
data
applications
and
scalable
web-based
applications.
He
is
particularly
passionate
about
the
intersection
between
big
data
and
machine
learning.
Hien
enjoys
working
with
open
source
software
and
has
contributed
to
Apache
Pig
and
Azkaban.
Teaching
is
also
one
of
his
passions,
and
he
serves
as
an
instructor
at
the
UCSC
Silicon
Valley
Extension
school
teaching
Apache
Spark.
He
has
given
presentations
at
various
conferences
such
a
QCon
SF,
QCon
London,
Seattle
Data
Day,
Hadoop
Summit,
and
JavaOne.
Textul de pe ultima copertă
Develop
applications
for
the
big
data
landscape
with
Spark
and
Hadoop.
This
book
also
explains
the
role
of
Spark
in
developing
scalable
machine
learning
and
analytics
applications
with
Cloud
technologies. Beginning
Apache
Spark
2 gives
you
an
introduction
to
Apache
Spark
and
shows
you
how
to
work
with
it.
Along
the
way,
you’ll
discover
resilient
distributed
datasets
(RDDs);
use
Spark
SQL
for
structured
data;
and
learn
stream
processing
and
build
real-time
applications
with
Spark
Structured
Streaming.
Furthermore,
you’ll
learn
the
fundamentals
of
Spark
ML
for
machine
learning
and
much
more.
After
you
read
this
book,
you
will
have
the
fundamentals
to
become
proficient
in
using
Apache
Spark
and
know
when
and
how
to
apply
it
to
your
big
data
applications.
You
will:
- Understand
Spark
unified
data
processing
platform
- Use
and
manipulate
RDDs
- Deal with structured data using Spark SQL
- Build
real-time
applications
using
Spark
Structured
Streaming
- Develop
intelligent
applications
with
the
Spark
Machine
Learning
library
Caracteristici
A
tutorial
on
the
Apache
Spark
platform
written
by
an
expert
engineer
and
trainer
using
and
teaching
Spark
One of the very first books on the new Apache Spark 2.1
Apache Spark is the leading alternative to Hadoop
One of the very first books on the new Apache Spark 2.1
Apache Spark is the leading alternative to Hadoop