Welcome!

Open Source Cloud Authors: Elizabeth White, Christopher Keene, Dana Gardner, Craig Lowell, SmartBear Blog

Blog Feed Post

Adoption of R by large Enterprise Software Vendors

by Uday Tennety:  Director, Advanced Analytics Services at Revolution Analytics The R ecosystem has become widely popular lately with large players such as Pivotal, Tibco, Oracle, IBM, Teradata and SAP integrating R into their product suites. All these big players are using value chain integration and platform envelopment strategies to build a network effect in order to gain maximum leverage against their competitors in the Big Data and Analytics space. Big Data movement has gained a lot of traction in the enterprise space, and the ecosystem is rapidly evolving. The end goal for most enterprises is not to collect, store and manage data, but to obtain new business insights through predictive modeling and analytics. With this objective in mind, many enterprise software vendors have embraced R for their analytics story. Below is my analysis on the various distributions of R provided by large Enterprise software vendors along with their integration strategies: Oracle R Enterprise Oracle R Distribution is Oracle's free distribution of open source R.  Oracle R Enterprise integrates Oracle R Distribution/ R, the open source scripting language and environment, with Oracle Database. Oracle R Enterprise primarily introduces a variant to many R data types by overloading them in order to integrate Oracle database with R. But, the names of the Oracle R Enterprise data types are the same as the names of corresponding R data types prefixed by "ore". Oracle’s strategy with Oracle’s R Enterprise is to provide in-database analytics capabilities for its widely adopted enterprise RDBMS, and for its Exadata appliance. Tibco’s TERR Tibco with its acquisition of S+ technology from Insightful in 2008, built its own distribution of R called TERR. TERR has been built from ground up in C++, and their team has redesigned data object representation by implementing those objects using abstract C++ classes.  Also, TERR claims to provide better performance and memory management compared to open source R. According to the company sources, TERR is compatible with open source R, and runs analytics by loading data in memory.  But TERR does not yet support data from disk, streaming and database sources, and the company plans to support them sometime in future. Tibco has recently integrated TERR with their data visualization tool, Spotfire, to make it easy for enterprises choosing Spotfire to run an R based analytics tool. PivotalR Pivotal was officially launched on April 1, 2013, as EMC decided to group together a set of EMC, VMware and Pivotal Lab’s products to offer a differentiated Enterprise grade big data platform. Pivotal’s strategy with R is very similar to Oracle’s strategy with their Oracle R Enterprise. PivotalR is a package that enables users of R to interact with the Pivotal (Greenplum) Database as well as Pivotal HD and HAWQ for Big Data analytics. It does so by providing an interface to the operations on tables and views in the database. PivotalR also claims to provide parallel and distributed computation ability on Pivotal for big data analytics. It also provides a wrapper for MADlib, which is an open-source library for parallel and scalable in-database analytics. SAP SAP has integrated R with their in-memory database, HANA, to allow usage of R for specific statistical functions. But, SAP does not ship the R environment with SAP HANA database, nor does it provide support for R. In order to use the SAP HANA integration with R, one needs to download R from CRAN and configure it. Also, an Rserve configuration is needed for this integration to work. SAP’s strategy for integrating HANA with R is to provide a well-known and robust environment for advanced data analysis, while providing a support mechanism in HANA for specific statistical functions. Teradata Teradata has partnered with Revolution Analytics to provide a platform that brings parallelized analytical algorithms to the data. Revolution R Enterprise 7 for Teradata includes a library of Parallel External Memory Algorithms (PEMAs) that run directly in parallel on the Teradata nodes. This strategy provides a scalable solution to run analytical algorithms in-database, in parallel, by bringing analytics to the data in its true sense. IBM Similar to Teradata, IBM has also partnered with Revolution Analytics to provide advanced data analysis capabilities to its PureData System for Analytics platform (fka Netezza).  Revolution R Enterprise for PureData System for Analytics, enables the execution of advanced R computations for rapid analysis of hundreds of petabyte-class data volumes. Today, businesses are scrambling to build IT infrastructure to extract value from all the data available to them. They are afraid that their competitors might get there first and gain a competitive advantage. In short, enterprises are now in a Big Analytics arms race. With strong partners, a powerful community and with a promise of an easy-to-integrate solution, R is in a great position to capitalize on the Big Data and Analytics revolution.      

Read the original blog entry...

More Stories By David Smith

David Smith is Vice President of Marketing and Community at Revolution Analytics. He has a long history with the R and statistics communities. After graduating with a degree in Statistics from the University of Adelaide, South Australia, he spent four years researching statistical methodology at Lancaster University in the United Kingdom, where he also developed a number of packages for the S-PLUS statistical modeling environment. He continued his association with S-PLUS at Insightful (now TIBCO Spotfire) overseeing the product management of S-PLUS and other statistical and data mining products.<

David smith is the co-author (with Bill Venables) of the popular tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Today, he leads marketing for REvolution R, supports R communities worldwide, and is responsible for the Revolutions blog. Prior to joining Revolution Analytics, he served as vice president of product management at Zynchros, Inc. Follow him on twitter at @RevoDavid

@ThingsExpo Stories
The Internet of Things can drive efficiency for airlines and airports. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Sudip Majumder, senior director of development at Oracle, will discuss the technical details of the connected airline baggage and related social media solutions. These IoT applications will enhance travelers' journey experience and drive efficiency for the airlines and the airports. The session will include a working demo and a technical d...
Although it has gained significant traction in the consumer space, IoT is still in the early stages of adoption in enterprises environments. However, many companies are working on initiatives like Industry 4.0 that includes IoT as one of the key disruptive technologies expected to reshape businesses of tomorrow. The key challenges will be availability, robustness and reliability of networks that connect devices in a business environment. Software Defined Wide Area Network (SD-WAN) is expected to...
Developing software for the Internet of Things (IoT) comes with its own set of challenges. Security, privacy, and unified standards are a few key issues. In addition, each IoT product is comprised of (at least) three separate application components: the software embedded in the device, the back-end service, and the mobile application for the end user’s controls. Each component is developed by a different team, using different technologies and practices, and deployed to a different stack/target –...
Internet of @ThingsExpo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 19th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devices - comp...
SYS-CON Events announced today that Numerex Corp, a leading provider of managed enterprise solutions enabling the Internet of Things (IoT), will exhibit at the 19th International Cloud Expo | @ThingsExpo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Numerex Corp. (NASDAQ:NMRX) is a leading provider of managed enterprise solutions enabling the Internet of Things (IoT). The Company's solutions produce new revenue streams or create operating...
As cloud adoption continues to transform business, today’s global enterprises are challenged with managing a growing amount of information living outside of the data center. The rapid adoption of IoT and increasingly mobile workforce are exacerbating the problem. Ensuring secure data sharing and efficient backup poses capacity and bandwidth considerations as well as policy and regulatory compliance issues.
Why do your mobile transformations need to happen today? Mobile is the strategy that enterprise transformation centers on to drive customer engagement. In his general session at @ThingsExpo, Roger Woods, Director, Mobile Product & Strategy – Adobe Marketing Cloud, covered key IoT and mobile trends that are forcing mobile transformation, key components of a solid mobile strategy and explored how brands are effectively driving mobile change throughout the enterprise.
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
19th Cloud Expo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterpri...
Data is an unusual currency; it is not restricted by the same transactional limitations as money or people. In fact, the more that you leverage your data across multiple business use cases, the more valuable it becomes to the organization. And the same can be said about the organization’s analytics. In his session at 19th Cloud Expo, Bill Schmarzo, CTO for the Big Data Practice at EMC, will introduce a methodology for capturing, enriching and sharing data (and analytics) across the organizati...
SYS-CON Events announced today that Pulzze Systems will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Pulzze Systems, Inc. provides infrastructure products for the Internet of Things to enable any connected device and system to carry out matched operations without programming. For more information, visit http://www.pulzzesystems.com.
SYS-CON Events announced today Telecom Reseller has been named “Media Sponsor” of SYS-CON's 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Telecom Reseller reports on Unified Communications, UCaaS, BPaaS for enterprise and SMBs. They report extensively on both customer premises based solutions such as IP-PBX as well as cloud based and hosted platforms.
Almost two-thirds of companies either have or soon will have IoT as the backbone of their business in 2016. However, IoT is far more complex than most firms expected. How can you not get trapped in the pitfalls? In his session at @ThingsExpo, Tony Shan, a renowned visionary and thought leader, will introduce a holistic method of IoTification, which is the process of IoTifying the existing technology and business models to adopt and leverage IoT. He will drill down to the components in this fra...
Pulzze Systems was happy to participate in such a premier event and thankful to be receiving the winning investment and global network support from G-Startup Worldwide. It is an exciting time for Pulzze to showcase the effectiveness of innovative technologies and enable them to make the world smarter and better. The reputable contest is held to identify promising startups around the globe that are assured to change the world through their innovative products and disruptive technologies. There w...
There is growing need for data-driven applications and the need for digital platforms to build these apps. In his session at 19th Cloud Expo, Muddu Sudhakar, VP and GM of Security & IoT at Splunk, will cover different PaaS solutions and Big Data platforms that are available to build applications. In addition, AI and machine learning are creating new requirements that developers need in the building of next-gen apps. The next-generation digital platforms have some of the past platform needs a...
With so much going on in this space you could be forgiven for thinking you were always working with yesterday’s technologies. So much change, so quickly. What do you do if you have to build a solution from the ground up that is expected to live in the field for at least 5-10 years? This is the challenge we faced when we looked to refresh our existing 10-year-old custom hardware stack to measure the fullness of trash cans and compactors.
The emerging Internet of Everything creates tremendous new opportunities for customer engagement and business model innovation. However, enterprises must overcome a number of critical challenges to bring these new solutions to market. In his session at @ThingsExpo, Michael Martin, CTO/CIO at nfrastructure, outlined these key challenges and recommended approaches for overcoming them to achieve speed and agility in the design, development and implementation of Internet of Everything solutions wi...
Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more business becomes digital the more stakeholders are interested in this data including how it relates to business. Some of these people have never used a monitoring tool before. They have a question on their mind like “How is my application doing” but no id...
Cloud computing is being adopted in one form or another by 94% of enterprises today. Tens of billions of new devices are being connected to The Internet of Things. And Big Data is driving this bus. An exponential increase is expected in the amount of information being processed, managed, analyzed, and acted upon by enterprise IT. This amazing is not part of some distant future - it is happening today. One report shows a 650% increase in enterprise data by 2020. Other estimates are even higher....
Smart Cities are here to stay, but for their promise to be delivered, the data they produce must not be put in new siloes. In his session at @ThingsExpo, Mathias Herberts, Co-founder and CTO of Cityzen Data, will deep dive into best practices that will ensure a successful smart city journey.