• New engineered solution from EMC Information Infrastructure, Pivotal and VMware accelerates and automates deployment of Data Lakes
  • Enables organizations to deploy Hadoop and real-time analytics capabilities in as little as seven days
  • EMC offers a complete portfolio of Data Lake Services for customers at every stage of the Big Data journey

Connect With Us:

  • Watch the Federation Business Data Lake virtual event:
  • Join EMC’s Ask Me Anything chat with EMC “Dean of Big Data” Bill Schmarzo:
  • Follow @EMCCorp, @EMCbigdata and use #RedefineBigData
  • Read EMC’s perspective on the EMC Pulse blog: Big Data Game-Changer: Federation Business Data Lake
  • Read Blogs from Federation BusinessData Lake Ecosystem Partners:
    • Brocade – Brocade New IP Technologies for Lightning Fast Big Data Deployments Leveraged in EMC Federation Business Data Lake Solution
    • Cloudera – Bringing EDH to the EMC Business Data Lake
    • HortonWorks – EMC Business Data Lake
    • Pivotal – New Federation Business Data Lake Should Be Your Silver Bullet for Big Data Success
    • SAS – EMC and SAS Redefine Big Data Analytics with the Data Lake

HOPKINTON, MA, MARCH 23, 2015 — / — EMC Corporation (NYSE:EMC) today announced the Federation Business Data Lake. The fully engineered solution includes leading storage and Big Data analytics technologies from EMC Information Infrastructure, Pivotal, and VMware to help customers leverage the new world of Big Data, thereby clearing the path for new insights and disruptive differentiation.

Implemented in as little as seven days, the Federation Business Data Lake greatly simplifies the massively complex task of building a Data Lake and is designed for speed, self-service and scalability for the enterprise, enabling organizations to begin making better-informed business decisions using Big Data analytics. The Federation Business Data Lake joins the Enterprise Hybrid Cloud Solution as a converged solution from the EMC Federation that will redefine infrastructure to maximize the speed and agility for IT organizations deploying Hybrid Clouds and Data Lakes.

The incredible potential of Big Data is being driven first and foremost by the growth of data from traditional applications, modern applications, sensors and intelligent devices along with masses of new public data such as social media feeds.  The ability to capture and process that data is now possible because of the growth of inexpensive storage and limitless compute, along with the invention of new technologies that enable real-time analysis and a direct connection to action through new applications and products.  These storage and analytics technologies, along with the massive data sets comprise the Business Data Lake.

Business Data Lakes are becoming a top corporate priority because they fill a critical gap left by traditional data warehousing. A Business Data Lake contains structured and unstructured data from a wide variety of sources and the analytics are focused on building models to predict the future.  Companies with successful Data Lakes are leveraging the data and predictive models to build new products, applications and business models to redefine their industry, taking or extending the “Market Leader” role.

A highly effective Business Data Lake will provide three critical functions:

  • Store: Stores structured and unstructured data for all types of analytics, from many different sources, blending capacity and performance as needed for the analytics use case.
  • Analyze: Provides modern data management and analytics tools for all types of analytics including Hadoop-based, In-Memory No-SQL and Scale-out MPP.
  • Surface & Act: Provides data to users and applications to enable real-time changes in outcomes and to influence critical decisions.

Until now, building an effective Data Lake has been difficult and complex. IT organizations seeking to deploy a Data Lake must deploy and configure the right analytics platform and the right corresponding storage for each analytics use case, from Hadoop to real-time.  Once the environment is created, data must be loaded with all the right access rights and governance applied to the data sets. Deployment of the environment and data sets is a complex and time-consuming task, preventing IT from meeting the needs of business users.

The Federation Business Data Lake Solution makes it easy to deploy a Business Data Lake. Core products from the EMC Federation of Companies, EMC Information Infrastructure, Pivotal and VMware, provide the core functionality of the Federation Business Data Lake meeting the critical functional needs – Store, Analyze, Surface and Act.

The Federation Business Data Lake is a fully engineered solution that can be rapidly and automatically provisioned, enabling IT organizations to lead the needs of the business. The analytics layer is completely virtualized with VMware running on Vblocks® with predefined analytics use cases and automated provisioning and configuration. EMC® Isilon® provides the Data Lake Storage Foundation, delivering the ideal balance of capacity and performance.

The analytics layer is comprised of the Pivotal Big Data Suite, including PivotalHD, featuring the world’s leading SQL-on-Hadoop engine, HAWQ. Pivotal Big Data Suite provides enterprise-class SQL, which allows for seamless integration and interoperability with top analytics platforms such as SAS, Tableau and others, over data stored in Hadoop. EMC is also delivering two additional Business Data Lakes to enable integration with customer choice of Hadoop distribution including Cloudera and Hortonworks, along with any future Open Data Platform-based Hadoop distribution.

A full suite of services and education is available with the Federation Business Data Lake to enable customers at varying stages of their Data Lake journey to implement the solution, prove out the value of the solution and quickly identify strategic Big Data use cases, including:

  • EMC Technology Onboarding Service: For customers who are ready to deploy a Data Lake, the EMC Technology Onboarding Service offers full consulting services to install and deploy the Federation Business Data Lake, optimize the analytics environment and configure and customize data requirements.
  • EMC Proof of Value Service: For customers who know the use case they want to address but are looking for help implementing the latest big data analytic and rapid application development tools and techniques, the Proof of Value Service demonstrates the ROI of a targeted use case using real customer data.
  • EMC Big Data Vision Workshop: For customers who are undecided about how to start infusing Big Data into its business strategy, the EMC Big Data Vision Workshop analyzes an organization’s strategy, business goals and then prioritizes a target use case for the start of its Big Data journey.
  • Education Services: In addition to the service offerings above, EMC offers training and certification to develop fundamental as well as advanced Big Data and Data Science understanding and skills required by business leaders and Big Data practitioners.

The Federation Business Data Lake will be offered in Directed Availability in April 2015 with General Availability in select countries.

Dan Cutler, Director, Technical Operations, Adobe Digital Marketing“A solution like the Federation Business Data Lake to help organizations realize new technical solutions for Big Data management and storage is essential to businesses that are constantly looking for new revenue streams or cost-saving opportunities to stay relevant in the market. Big Data solutions have enabled us to expand our offerings to include Hadoop-as-a-Service, extending the value that Adobe Digital Marketing can provide to our customers.”

Jeffrey Kelly, Principal Analyst, Big Data, Wikibon“As the number of connected devices continues to soar, the world of digital business is disrupting traditional business models. It’s no longer just about the product itself, but the data and the application of the data. With Big Data, organizations can gather as much data as possible and apply analytic techniques to understand the data, make predictions and take action, resulting in the creation of new business models. Big Data solutions like the Federation Business Data Lake allow organizations to bring data, analytics and applications together to realize new business opportunities.”

Josh Kahn, Senior Vice President, Global Solutions, EMC Corporation“Nearly every traditional business model faces near-term, lasting disruption. The fast track to competitive advantage will be reserved for those able to quickly embrace and yield value from the massive growth in data, but it will take a new approach. The new Federation Business Data Lake solution makes it easy to harness all types of data to build predictive models, that enable new applications, products and business models to redefine industries.”

EMC Corporation is a global leader in enabling businesses and service providers to transform their operations and deliver IT as a service. Fundamental to this transformation is cloud computing. Through innovative products and services, EMC accelerates the journey to cloud computing, helping IT departments to store, manage, protect and analyze their most valuable asset — information — in a more agile, trusted and cost-efficient way. Additional information about EMC can be found at

EMC2, EMC, vBlock, Isilon and the EMC logo are registered trademarks or trademarks of EMC Corporation in the United States and other countries. All other trademarks used herein are the  property of their respective owners. © Copyright 2015 EMC  Corporation. All rights reserved.

Based on a 1PB Business Data Lake.  Includes deployment of converged infrastructure, Hadoop, structured data and real-time analytics tools so data can be analyzed

Jen Sorenson

Source: EMC