Do you need a personalized offer for your team? Contact us at academy@esolutions.ro or call 0753.029.187

Apache Spark

Spark
COURSE FEATURES
Course Duration
Duration
3 Days
Skill Level
Skill level
All levels info Course tailored to participants' expertise.
Course Language
Language
Romanian
Course Type
Type
In-class / Online-live
Assesments
Assessments
Yes
Price for Custom Training
Custom Training
1800 Eur / Day
Price for Open Class Course
Open Class
850 Eur / Participant

Apache Spark has gained popularity for its ability to handle diverse data processing workloads, including batch processing, real-time data streaming, machine learning, and graph processing.

Its flexibility, performance, and rich set of libraries make it a versatile choice for building data-intensive applications and performing complex analytics on large-scale datasets.

This introduction course covers the main concepts in Apache Spark:

  • the core (architecture, RDDs/Dataframes/Datasets, transformations & actions, DAG)

  • the SQL engine

  • the streaming engine

  • machine learning libraries

It also highlights the possible usage of Spark in different use cases like: ETL, Analytics and Machine Learning.

During the course, we will build an end-to-end case with Spark, from data input, data cleaning, data storage, and machine learning. We will work in a cloud environment and we will use Apache Zeppelin for all the Spark coding/exercises (Scala).

This course is taught in Romanian, with course materials available in either English, or Romanian. The course can be customized - on request.

DAY
1

TOPICS

12

TOPICS
12
A brief history of Spark
Where Spark fits in the big data landscape
Apache Spark vs. Apache MapReduce: An overall architecture comparison
Cluster Architecture: cluster manager, workers, executors; Spark Context; Cluster Manager Types; Deployment scenarios
How Spark schedules and executes jobs and tasks
Resilient Distributed Datasets : Fundamentals & hands on exercises
Ways to create an RDD: Parallelize Collection; Read from external data source; From existing RDD
Introduction to Transformations and Actions
Caching
How transformations lazily build up a Directed Acyclic Graph (DAG)
Shuffling
Hands on: using Spark for ETL
DAY
2

TOPICS

5

TOPICS
5
What are DataFrames/Datasets vs RDD’s
The DataFrames/Datasets API
Spark SQL
Creating and running DataFrame operations
Reading from multiple data sources and hands on exercises: HDFS; noSQL; Hive
DAY
3

TOPICS

6

TOPICS
6
When to use Spark Streaming
DStream
Structured streaming: Building a Spark streams out of Kafka topics; Windowing & Aggregation; Register a Spark DF stream
Spark’s MLlib and MLlib Pipeline API (Spark.ml) for Machine Learning
Spark MLlib and Spark.ml
Machine Learning Examples: Collaborative filtering: Alternating Least Squares; Classification and regression

Contact Us

Feel free to leave us your thoughts so we can discover the solution together!

EMAIL

academy@esolutions.ro

Get in touch

0753.029.187

Our address

20 Constantin Budisteanu Street , 1 st. District, Bucharest

By clicking on "Send" button above, you agree with our terms of use.

EMAIL

academy@esolutions.ro

Get in touch

0753.029.187

Our address

20 Constantin Budisteanu Street , 1 st. District, Bucharest

Related Courses

Apache NIFI
Big Data

This course covers the main concepts in Apache NiFi and its potential in automated data flow between systems, providing efficient data ingestion, transformation, and routing.

DURATION
3 Days
Apache Storm
Big Data

This course covers the main concepts in Apache Storm and its potential in use cases such as real-time analytics, continuous computation, and event-driven apps.

DURATION
3 Days
Big Data
Big Data

This course covers the usage and applicability of big data technologies and concepts necessary for architecting and building a big data architecture.

DURATION
4 Days
Cassandra
Big Data

This introductory course covers common use cases for Cassandra, its key features, storage architecture, and more.

DURATION
3 Days
+ View all courses
dividerLeft

Do you want to become a trainer?

Ask not what your company can do for you, ask what you can do for your company. Apply for a training position.

Apply Now