| By David Smith | Article Rating: |
|
| November 8, 2012 12:39 PM EST | Reads: |
1,614 |
We're pleased to announce that the latest update to Revolution R Enterprise is available today! Existing subscribers will soon receive an email with update instructions, and the free academic distribution will be updated later today. Version 6.1 adds a frequently-requested big-data statistical modeling algorithm, adds new connectivity option for Hadoop, improves performance, and provides new security and installation options for IT. Here's a summary of the new features:
Decision Trees for Big Data. The new “rxDTree” function is a powerful tool for fitting classification and regression trees, which are among the most frequently used algorithms for data analysis and data mining. The implementation provided in Revolution Analytics’ RevoScaleR package is parallelized, scalable, distributable and designed with big data in mind. Revolution R Enterprise continues to offer a wide range of other big-data analysis algorithms, including summary statistics, crosstabs, regression, generalized linear models and K-means clustering.
Improved performance for ‘Big Data’ files. RevoScaleR’s ‘XDF’ file format provides fast access to big data. With new compression technology the size of XDF files can be reduced, allowing for higher-performance analytics throughput and faster transfers into clusters or cloud processing systems.
Improved Linux installer. The installation process on Linux servers has been streamlined to meet stringent IT requirements, especially for non-root installs.
SiteMinder single-sign for applications: Authorized users of applications built on Revolution R Enterprise deployed via the RevoDeployR Web Services API may authenticate using CA SiteMinder®.
Analyze data from Hadoop Distributed File System (HDFS). With more and more data stored in Hadoop, this new option lets data scientists read data from HDFS and apply big-data statistical models from Revolution R Enterprise.
I'm especially excited about this last feature, which makes it possible to feed structured data files in Hadoop directly to the big-data statistical algorithms in the RevoScaleR package, as demonstrated in the video below. It pairs well with the RHadoop project: if you don't have structured data in Hadoop already, use the rmr package and map-reduce to create a structured data file in HDFS, and then analyze it with RevoScaleR.
You can find more details about the features in Revolution R Enterprise in this table. Read the original blog entry...
Published November 8, 2012 Reads 1,614
Copyright © 2012 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By David Smith
David Smith is Vice President of Marketing and Community at Revolution Analytics. He has a long history with the R and statistics communities. After graduating with a degree in Statistics from the University of Adelaide, South Australia, he spent four years researching statistical methodology at Lancaster University in the United Kingdom, where he also developed a number of packages for the S-PLUS statistical modeling environment. He continued his association with S-PLUS at Insightful (now TIBCO Spotfire) overseeing the product management of S-PLUS and other statistical and data mining products.< David smith is the co-author (with Bill Venables) of the popular tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Today, he leads marketing for REvolution R, supports R communities worldwide, and is responsible for the Revolutions blog. Prior to joining Revolution Analytics, he served as vice president of product management at Zynchros, Inc. Follow him on twitter at @RevoDavid
- Cloud People: A Who's Who of Cloud Computing
- Cloud Expo New York: Cloud Is Changing the Economics of Business
- Windows Azure IaaS Reaches General Availability
- Cloudant to Exhibit at Cloud Expo & Big Data Expo New York
- Learn How To Use Google Apps Script
- Cloud Expo New York: Basics of SSD Technology and Its Use in Cloud
- Cloud Computing Is Simplifying Things
- Session Topics: 12th Cloud Expo / Cloud Expo New York
- CollabNet And UC4 Announce General Availability Of Joint Enterprise DevOps Platform
- Cloud Expo New York: The Big Challenge of Big Data & Hadoop Integration
- Overview of the OpenStack Cloud
- The Flexible Cloud
- Cloud People: A Who's Who of Cloud Computing
- Cloud Expo New York: Cloud Is Changing the Economics of Business
- Cloud Expo New York: How to Use Google Apps Script
- Windows Azure IaaS Reaches General Availability
- Rackspace Hosting Named “Platinum Plus Sponsor” of Cloud Expo New York
- Portable Experimenter’s Platform, Powered by Raspberry Pi
- Small Cancers, Big Data, and a Life Examined
- SUSE Receives Common Criteria Security Certifications
- Cloudant to Exhibit at Cloud Expo & Big Data Expo New York
- Basho Announces Open Source Riak CS and General Availability of Riak CS Enterprise v1.3
- Learn How To Use Google Apps Script
- Cloud Expo New York: Basics of SSD Technology and Its Use in Cloud
- After Ubuntu, Windows Looks Increasingly Bad, Increasingly Archaic, Increasingly Unfriendly
- SCO CEO Posts Open Letter to the Open Source Community
- Simula Labs Launches Hosted Delivery Platform To Enable Enterprise Open Source Adoption
- Where Are RIA Technologies Headed in 2008?
- Source Claims SCO Will Sue Google
- How Open Is "Open"? – Industry Luminaries Join the Debate
- Latest SCO News is Plain Weird
- SCO Claims Linux Lifted ELF
- IBM Tells SCO Court It Can't Find AIX-on-Power Code
- Developing an Application Using the Eclipse BIRT Report Engine API
- Should RIM BlackBerries Be Rented?
- Flashback: Investing in 'Professional Open Source' - Exclusive 2004 Interview with David Skok, Matrix Partners






















