The four biggest things from Day 2 of the event in Las Vegas and the storage giant moves into some surprising spaces
The main talking points from day two have come from Pivotal with enhancements to its data suite that are designed to help customers to create data driven applications. CBR has compiled a list to keep you up to date.
1. Hadoop Updates
Component updates to the company’s HD enterprise Apache Hadoop distribution have updated the Hadoop stack in order to provide better stability, management, security and data processing.
Pivotal HD is now based on a stanadard Open Data Platform core which consists of Apache Hadoop 2.6 and Apache Ambari.
Updates have been applied to the Hadoop components for scripting and query, otherwise known as Apache Pig and Apache Hive and the non-relational database HBase. Additional updates have been added to improve the basic coordination and workflow orchestration.
The Apache Spark core and machine learning library have been added and components to improve security such as Ranger, Knox and others have been added.
More updates have come with Pivotal’s Greenplum Database which offers a hundred times performance improvement and now comes with the Query Optimiser, which is a cost-based query optimiser for Big Data.
The Greenplum Database and HAWQ will handle large volumes of data at scale with an enhanced data structure and will enable simultaneous workloads on multiple analytics uses.
Zhang Min, Vice President, CITIC Bank, said: "As a long time Greenplum customer, we are excited to see Pivotal’s continued investment in the Greenplum Database."
"While the current product performs well, the new features will shorten the cycles of innovation by enabling complex queries on large data sets and bringing analytics closer to big data."
3. Cloud Ready
The Big Data Suite is designed to allow companies to modernise their data infrastructure and to discover more insights with advanced analytics. In addition to this it is aimed at helping to build applications at scale.
The improvements to the suite have included SQL and NoSQL databases as well as in-memory processing with the goal of reaching customers via virtualised and private cloud instances as well as in public clouds.
4. EMC Open Source
Given that it is EMC’s conference it’s something of a given that the company would make some sort of announcement.
This came in the form of its plans to release an open source project based on its ViPR Controller, Project CoprHD. It will be the first open source project that is based on one of its commercial software products.
CoprHD will be licensed under the Mozilla Public License 2.0 which will enable customers, partners and developers to expand and contribute to the project.
CJ Desai, President, EMC Emerging Technologies Division, said: "ViPR Controller was the perfect candidate for open source contribution. As fully transparent software focusing on storage automation, both the software and its users will benefit from community contributions."
"It’s an exciting time at EMC, and we’re looking forward to seeing the new ways Project CoprHD will evolve in the open source community."