What is Spark?


Apache Spark is an open source parallel processing framework which is designed to run data analytics across clustered computers.

The general engine for large-scale data processing is maintained by the Apache Software Foundation.

Spark is designed to provide programmers with an application programming interface (API) centred on a resilient distributed dataset, this is a multiset of data items distributed over a cluster of machines.

Spark is capable of running on Hadoop, Mesos, standalone, or in the cloud and can access numerous data sources such as HDFS, Cassandra, HBase, and S3.


Why has Spark become so popular?

Name of Technology:

Apache Spark

Year Introduced:


Invented by :

Matei Zaharia

Open Source or Proprietary:

Open Source

Main technology category:

Data analytics

Previous ArticleWhat is a Chromebook?
Next ArticleWhat is Quark?
Comments (0)

Leave a Reply

Your email address will not be published. Required fields are marked *


  • Favorite list is empty.
FavoriteLoadingClear favorites

Your favorite posts saved to your browsers cookies. If you clear cookies also favorite posts will be deleted.