Big Blue initiatives associated with Apache Spark open source software.
IBM has unveiled initiatives associated to the Apache Spark open-source software for processing and analysing vast amounts of data.
Apart from integrating Spark into its analytics and commerce platforms, IBM said it would offer it as a cloud service on Bluemix, allowing developers to load data and derive the predictive artifact for use in their app.
The company plans to open a technology centre in San Francisco to undertake Spark-based innovations.
More than 1 million data scientists and data engineers will be educated on Spark, while over 3,500 researchers and developers will work on Spark-associated projects across the world.
The company will open source its IBM SystemML machine learning technology to the Spark open-source ecosystem.
IBM Analytics general manager Analytics Platform Beth Smith said: We believe strongly in the power of open source as the basis to build value for clients, and are fully committed to Spark as a foundational technology platform for accelerating innovation and driving analytics across every business in a fundamental way.
"Our clients will benefit as we help them embrace Spark to advance their own data strategies to drive business transformation and competitive differentiation."
Spark is a scalable data analytics platform that features primitives for in-memory computing with some performance advantages when compared to Hadoop’s cluster storage approach.
It helps organisations more easily integrate big data into applications and drives business transformation for IBM clients.