Welcome!

Open Source Cloud Authors: William Schmarzo, Liz McMillan, Stackify Blog, Vaibhaw Pandey, Pat Romanski

Related Topics: @CloudExpo, Open Source Cloud

@CloudExpo: Blog Feed Post

Fast Track To Hadoop

The Quickest Way To Deploy A Well Engineered Apache Hadoop Solution To A Production Environment

As a former enterprise CTO and current technology watcher, I was struck at the incredible brilliance of yesterday’s announcement by Dell and Cloudera. In a move that will help enterprises of all sizes serve a very wide range of missions, those two organizations have announced a new relationship that will enable something that has never really existed before. Enterprises can now buy a Hadoop-centric solution that has been validated from end-to-end and is available from a single vendor. And they can do this on systems that have hardware designed for performing Big Data analytics.

Since their formation, Cloudera has been working to bring clarity and unity and support to the entire stack of software around Apache Hadoop. A key benefit of the Cloudera Distribution Including Apache Hadoop (CDH) is that it makes enterprise support in production environments possible. Every enterprise CIO wants supported software and CDH is the way to do that with the Cloudera supported Hadoop stack. You can think of this next move as taking that approach one huge step further.  Now enterprises will not only get a fully supported software stack of all relevant Big Data analytics tools and management tools, but they will also get that in a specially designed hardware system. And the same vendor will provide training, technology support, professional services and other support. In every sense of the word this system has been validated end to end.

Benefits for organizations and their missions include:

  • Much lower risk
  • Faster deployment
  • Increased efficiency/Higher performance
  • Total supportability
  • Much more agility and ability to support the organization’s mission.
For more see the press release below:

Dell and Cloudera Collaborate to Enable Large Scale Data Analysis and Modeling through Open Source Solution

Enterprises looking to process massive application datasets can now utilize the industry’s first complete Apache Hadoop solution

ROUND ROCK, Texas and PALO ALTO, Calif. – August 4, 2011 –Enterprises looking to process massive application datasets can now utilize the industry’s first complete Apache Hadoop solution, resulting from a relationship between Dell and Cloudera Inc. The Dell | Cloudera solution for Apache Hadoop combines Dell servers and networking components with Cloudera’s Distribution Including Apache Hadoop (CDH), as well as management tools, training, technology support and professional services, to give customers a single source to deploy, manage, and scale a comprehensive Apache Hadoop-based stack.

Aimed at financial services, energy, utility and telecom companies, research institutions, retail businesses, and Internet/media groups, The Dell | Cloudera solution for Apache Hadoop can reduce the complexity of deploying, configuring, and managing Hadoop systems. By combining the leading open source Apache Hadoop-based software platform and purpose-built hardware, the integrated Dell | Cloudera solution for Hadoop enables organizations to analyze large amounts of complex, dynamic data at a lower total cost of ownership (TCO) than proprietary solutions.

The Dell PowerEdge C line is designed and optimized to work in large scale-out computing environments which highlight the company’s approach to producing best of breed solutions. This offering brings simplicity and power helping enable more organizations to process big data sets in order to spot business trends and buying patterns, conduct risk modeling, direct customer profiling and retention programs, determine buying patterns or simulate physical, biological, and environmental processes. Enabling the analysis of large amounts of data gives customers the power to derive greater and faster insights that lead to a tangible business advantage.

“Many companies today are recognizing the inherent value in analyzing the vast amount of structured and unstructured data currently being stored across their organization. Apache Hadoop enables companies to perform complex analyses, including detailed special-purpose computation across large collections of mixed data. However, deployment and management complexity, as well as requirements for highly specialized knowledge, have been barriers to adoption,” said Merv Adrian, Research VP, Information Management at Gartner. “Providing complete end-to-end solutions for Apache Hadoop deployments will be key to significantly speeding its adoption within mainstream enterprise environments.”

The Dell | Cloudera solution for Apache Hadoop consists of CDH, Dell Crowbar software, and Cloudera Enterprise combined with a Dell PowerEdge C2100 server (other models to be added shortly) and PowerConnect 6248 48-port Gigabit Ethernet Layer 3 switch. Joint service and support and a deployment guide are also included. Available for purchase directly from Dell within the next 30 days, The Dell | Cloudera solution for Apache Hadoop is designed to offer customers:

  • Streamlined Deployment: Cloudera Enterprise and Dell Crowbar enable enterprises to manage the complete operational lifecycle of their Apache Hadoop systems. The combination simplifies the deployment and management of Apache Hadoop services, including the ability to install and configure Apache Hadoop in minutes from a central dashboard and continue to make configuration updates while the system is running.
  • Increased Efficiency: Designed with inspiration from Dell’s Data Center Solutions (DCS) business, the PowerEdge C series is a high performance data analytics, cloud compute platform and cloud storage server line-up. Feature- and power-optimized to provide lower total cost of ownership in addition to saving on space and energy, it is an ideal platform for high performance computing (HPC), Web 2.0, gaming, social networking, Software-as-a-Service (SaaS) and public and private cloud infrastructures.
  • Low-Risk: Built on CDH, the leading distribution of Apache Hadoop in commercial and non-commercial environments, and Dell’s performance-based PowerEdge C hardware, The Dell | Cloudera solution for Apache Hadoop is tested, validated, and supported by Dell, one of the industry’s most experienced providers to hyperscale computing environments.
  • Complete Support: Led by Dell, customers will have access to comprehensive and collaborative service and support for the entire solution, from installation, configuration, and deployment to optimization and tuning with existing IT infrastructures.

This solution also leverages Dell’s popular Crowbar software, which Dell has released to the community as open source. Crowbar manages the Apache Hadoop deployment from the initial server boot to the configuration of the primary Apache Hadoop components allowing users to complete bare metal deployment of multi-node Hadoop environments in a matter of hours, as opposed to days. Once the initial deployment is complete, Crowbar can be used to maintain, expand, and architect a complete data analytics solution, including BIOS configuration, network discovery, status monitoring, performance data gathering, and alerting. Crowbar provides the necessary tools and automation to manage the complete lifecycle of Hadoop environments.

Dell is committed to providing our customers with the best solutions for expanding IT capabilities without creating additional management requirements. Our PowerEdge C servers are purpose-built for performing Big Data analytics. Through our relationship with Cloudera, companies can now take advantage of the leading distribution of Apache Hadoop along with Dell servers and networking capabilities to easily and quickly deploy an end-to-end solution for Hadoop,” said John Igoe, executive director of Cloud Solutions at Dell. “Organizations can begin solving business challenges by analyzing structured and unstructured data while knowing that they have the backing of Dell service and support and access to Cloudera training and management tools, all via a single point of contact.”

Our relationship with Dell is a natural fit as we continue to expand the footprint of Apache Hadoop across mainstream commercial environments. Enterprises can now quickly and easily deploy an entire solution for Apache Hadoop that is optimized for production level environments,” said Ed Albanese, head of business development at Cloudera. “Cloudera customers have spoken clearly. They want a Hadoop configuration that has been validated from end–to-end and is available from a single vendor. We are excited that Dell is investing to meet this demand.”

Additional Information:

Dell Hadoop Solutions
Dell PowerEdge C Servers
Dell Crowbar Installer Blog
Cloudera Blog

About Dell

Dell Inc. (NASDAQ: DELL) listens to customers and delivers innovative technology and services that give them the power to do more. For more information, visit www.dell.com or email[email protected].

Contact Information

Media Contacts:

Hope Nicora
Cloudera
(831) 227-3660
[email protected]

Andre Fuochi
Dell
(512) 698-6757
[email protected]

Investor Relations Conacts:

Robert Williams
Dell
(512) 728-7570
[email protected]

# # #

Dell, PowerEdge and PowerConnect are trademarks of Dell Inc. Hadoop is trademarked by the Apache Software Foundation. Dell disclaims any proprietary interest in the marks and names of others.

About Cloudera

Cloudera, the leader in Apache Hadoop-based software and services, enables data driven enterprises to easily derive business value from all their structured and unstructured data. Cloudera’s Distribution Including Apache Hadoop (CDH), available to download for free atwww.cloudera.com/downloads, is the most comprehensive, tested, stable and widely deployed distribution of Hadoop in commercial and non-commercial environments. For the fastest path to reliably using this completely open source technology in production for Big Data analytics and answering previously un-addressable big questions, organizations can subscribe to Cloudera Enterprise, comprised of Cloudera Support and a portfolio of software including Cloudera Management Suite. Cloudera also offers consulting services, training and certification on Apache technologies. As the top contributor to the Apache open source community and with tens of thousands of nodes under management across customers in financial services, government, telecommunications, media, web, advertising, retail, energy, bioinformatics, pharma/healthcare, university research, oil and gas and gaming, Cloudera’s depth of experience and commitment to sharing expertise are unrivaled. www.cloudera.com

Connect with Cloudera

Read the blog: http://www.cloudera.com/blog/
Follow on Twitter: http://twitter.com/cloudera
Visit on Facebook: http://www.facebook.com/cloudera

Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley writes on enterprise IT. He is a founder and partner at Cognitio Corp and publsher of CTOvision.com

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@ThingsExpo Stories
In his session at 21st Cloud Expo, Carl J. Levine, Senior Technical Evangelist for NS1, will objectively discuss how DNS is used to solve Digital Transformation challenges in large SaaS applications, CDNs, AdTech platforms, and other demanding use cases. Carl J. Levine is the Senior Technical Evangelist for NS1. A veteran of the Internet Infrastructure space, he has over a decade of experience with startups, networking protocols and Internet infrastructure, combined with the unique ability to it...
22nd International Cloud Expo, taking place June 5-7, 2018, at the Javits Center in New York City, NY, and co-located with the 1st DXWorld Expo will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud ...
"Cloud Academy is an enterprise training platform for the cloud, specifically public clouds. We offer guided learning experiences on AWS, Azure, Google Cloud and all the surrounding methodologies and technologies that you need to know and your teams need to know in order to leverage the full benefits of the cloud," explained Alex Brower, VP of Marketing at Cloud Academy, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clar...
"IBM is really all in on blockchain. We take a look at sort of the history of blockchain ledger technologies. It started out with bitcoin, Ethereum, and IBM evaluated these particular blockchain technologies and found they were anonymous and permissionless and that many companies were looking for permissioned blockchain," stated René Bostic, Technical VP of the IBM Cloud Unit in North America, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventi...
Gemini is Yahoo’s native and search advertising platform. To ensure the quality of a complex distributed system that spans multiple products and components and across various desktop websites and mobile app and web experiences – both Yahoo owned and operated and third-party syndication (supply), with complex interaction with more than a billion users and numerous advertisers globally (demand) – it becomes imperative to automate a set of end-to-end tests 24x7 to detect bugs and regression. In th...
Widespread fragmentation is stalling the growth of the IIoT and making it difficult for partners to work together. The number of software platforms, apps, hardware and connectivity standards is creating paralysis among businesses that are afraid of being locked into a solution. EdgeX Foundry is unifying the community around a common IoT edge framework and an ecosystem of interoperable components.
"MobiDev is a software development company and we do complex, custom software development for everybody from entrepreneurs to large enterprises," explained Alan Winters, U.S. Head of Business Development at MobiDev, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"Akvelon is a software development company and we also provide consultancy services to folks who are looking to scale or accelerate their engineering roadmaps," explained Jeremiah Mothersell, Marketing Manager at Akvelon, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, discussed how from store operations and ...
"There's plenty of bandwidth out there but it's never in the right place. So what Cedexis does is uses data to work out the best pathways to get data from the origin to the person who wants to get it," explained Simon Jones, Evangelist and Head of Marketing at Cedexis, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
SYS-CON Events announced today that CrowdReviews.com has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5–7, 2018, at the Javits Center in New York City, NY. CrowdReviews.com is a transparent online platform for determining which products and services are the best based on the opinion of the crowd. The crowd consists of Internet users that have experienced products and services first-hand and have an interest in letting other potential buye...
SYS-CON Events announced today that Telecom Reseller has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. Telecom Reseller reports on Unified Communications, UCaaS, BPaaS for enterprise and SMBs. They report extensively on both customer premises based solutions such as IP-PBX as well as cloud based and hosted platforms.
It is of utmost importance for the future success of WebRTC to ensure that interoperability is operational between web browsers and any WebRTC-compliant client. To be guaranteed as operational and effective, interoperability must be tested extensively by establishing WebRTC data and media connections between different web browsers running on different devices and operating systems. In his session at WebRTC Summit at @ThingsExpo, Dr. Alex Gouaillard, CEO and Founder of CoSMo Software, presented ...
WebRTC is great technology to build your own communication tools. It will be even more exciting experience it with advanced devices, such as a 360 Camera, 360 microphone, and a depth sensor camera. In his session at @ThingsExpo, Masashi Ganeko, a manager at INFOCOM Corporation, introduced two experimental projects from his team and what they learned from them. "Shotoku Tamago" uses the robot audition software HARK to track speakers in 360 video of a remote party. "Virtual Teleport" uses a multip...
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, whic...
SYS-CON Events announced today that Evatronix will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Evatronix SA offers comprehensive solutions in the design and implementation of electronic systems, in CAD / CAM deployment, and also is a designer and manufacturer of advanced 3D scanners for professional applications.
Leading companies, from the Global Fortune 500 to the smallest companies, are adopting hybrid cloud as the path to business advantage. Hybrid cloud depends on cloud services and on-premises infrastructure working in unison. Successful implementations require new levels of data mobility, enabled by an automated and seamless flow across on-premises and cloud resources. In his general session at 21st Cloud Expo, Greg Tevis, an IBM Storage Software Technical Strategist and Customer Solution Architec...
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. In his session at @BigDataExpo, Jack Norris, Senior Vice President, Data and Applications at MapR Technologies, reviewed best practices to ...