Skip to content

Latest commit

 

History

History
27 lines (23 loc) · 1.61 KB

2015-09-12-apache-spark-tutorial.md

File metadata and controls

27 lines (23 loc) · 1.61 KB
layout title date author tags modified_time
post
Apache Spark Tutorial with Hortonworks Data Platform
2015-09-12T12:34:00.001-07:00
Saptak Sen
spark
hadoop
2015-09-12T15:11:18.054-07:00

Apache Spark is a fast, in-memory data processing engine with an elegant development API that allows data workers to efficiently execute algorithms which require iterative access to datasets, like machine learning algorithms. Spark on YARN enables deep integration with Hadoop and other YARN enabled workloads in the enterprise.

Below, we are going to explore the basic concepts of Apache Spark and the first few necessary steps to get started.

Table of Contents