Red Hat to Contribute Red Hat Storage Hadoop Plug-in to Apache Hadoop Community
Red Hat, Inc. (NYSE: RHT), the world’s leading provider of open source solutions, today announced its big data direction and solutions to satisfy enterprise requirements for highly reliable, scalable, and manageable solutions to effectively run their big data analytics workloads. In addition, Red Hat announced that the company will contribute its Red Hat Storage Hadoop plug-in to the ApacheTM Hadoop® open community to transform Red Hat Storage into a fully-supported, Hadoop-compatible file system for big data environments, and that Red Hat is building a robust network of ecosystem and enterprise integration partners to deliver comprehensive big data solutions to enterprise customers. This is another example of Red Hat’s strategic commitment to big data customers and its continuing efforts to provide them with enterprise solutions through community-driven innovation.
Red Hat big data infrastructure and application platforms are ideally suited for enterprises leveraging the open hybrid cloud environment. Red Hat is working with the open cloud community to support big data customers. Many enterprises worldwide use public cloud infrastructure, such as Amazon® Web Services (AWS), for the development, proof-of-concept, and pre-production phases of their big data projects. The workloads are then moved to their private clouds to scale up the analytics with the larger data set. An open hybrid cloud environment enables enterprises to transfer workloads from the public cloud into their private cloud without the need to re-tool their applications. Red Hat is actively engaged in the open cloud community through projects like OpenStack and OpenShift Origin to help meet these enterprise big data expectations both today and in the future.
Today, there are several Red Hat solutions available to effectively manage enterprise big data workloads. Focused on three primary areas, Red Hat’s big data direction includes extending its product portfolio to deliver enhanced enterprise-class infrastructure solutions and application platforms, and partnering with leading big data analytics vendors and integrators.
Red Hat’s Big Data Infrastructure Solutions
- Red Hat Enterprise Linux – According to the Jan. 2012 The Linux Foundation Enterprise Linux User Report, the majority of big data implementations run on Linux and as the leading provider of commercial Linux1, Red Hat Enterprise Linux is a leading platform for big data deployments. Red Hat Enterprise Linux excels in distributed architectures and includes features that address critical big data needs. Managing tremendous data volumes and intensive analytic processing requires an infrastructure designed for high performance, reliability, fine-grained resource management, and scale-out storage. Red Hat Enterprise Linux addresses these challenges while adding the ability to develop, integrate, and secure big data applications reliably and scale easily to keep up with the pace that data is generated, analyzed, or transferred. This can be accomplished in the cloud, making it easier to store, aggregate, normalize, and integrate data from sources across multiple platforms, whether they are deployed as physical, virtual, or cloud-based resources.
- Red Hat Storage – Built on the trusted Red Hat Enterprise Linux operating system and the proven GlusterFS distributed file system, Red Hat Storage Servers can be used to pool inexpensive commodity servers to provide a cost-effective, scalable, and reliable storage solution for big data.
Announced today, Red Hat intends to make its Hadoop plug-in for Red Hat Storage available to the Hadoop community later this year. Currently in technology preview, the Red Hat Storage Apache Hadoop plug-in provides a new storage option for enterprise Hadoop deployments that delivers enterprise storage features while maintaining the API compatibility and local data access the Hadoop community expects. Red Hat Storage brings enterprise-class features to big data environments, such as Geo replication, High Availability, POSIX compliance, disaster recovery, and management, without compromising API compatibility and data locality. Customers now have a unified data and scale out storage software platform to accommodate files and objects deployed across physical, virtual, public and hybrid cloud resources.
- Red Hat Enterprise Virtualization – Announced in Dec. 2012, Red Hat Enterprise Virtualization 3.1 is integrated with Red Hat Storage, enabling it to access the secure, shared storage pool managed by Red Hat Storage. This integration also offers enterprises reduced operational costs, expanded portability, choice of infrastructure, scalability, availability and the power of community-driven innovation with the contributions of the open source oVirt and Gluster projects. The combination of these platforms furthers Red Hat’s open hybrid cloud vision of an integrated and converged Red Hat Storage and Red Hat Enterprise Virtualization node that serves both compute and storage resources.
Red Hat’s Big Data Application and Integration Platforms
- Red Hat JBoss Middleware – Red Hat JBoss Middleware provides enterprises with powerful technologies for creating and integrating big data-driven applications that are able to interact with new and emerging technologies like Hadoop or MongoDB. Big data is only valuable when businesses can extract information and respond intelligently. Red Hat JBoss Middleware solutions can populate large volumes and varieties of data quickly and reliably into Hadoop with high speed messaging technologies; simplify working with MongoDB through Hibernate OGM; process large volumes of data quickly and easily with Red Hat JBoss Data Grid; access Hadoop along with your traditional data sources with JBoss Enterprise Data Services Platform; and identify opportunities and threats through pattern recognition with JBoss Enterprise BRMS. Red Hat’s middleware portfolio is well-suited to help enterprises seize the opportunities of big data.
Big Data Partnerships
- Big Data Ecosystem Partners – To provide a comprehensive big data solution set to enterprises, Red Hat plans to partner with leading big data software and hardware providers to offer interoperability. Development of certified and documented reference architectures are expected to allow users to integrate and install comprehension enterprise big data solutions.
- Enterprise Partners – Red Hat anticipates enabling the delivery of a comprehensive big data solution to its customers through leading enterprise integration partners utilizing the reference architectures developed by Red Hat and its big data ecosystem partners.
Ranga Rangachari, vice president and general manager, Storage, Red Hat, “With today’s announcement, Red Hat demonstrates its strong commitment to continue to provide enterprise infrastructure and platforms to effectively run big data applications today and in the growing open hybrid cloud environment. With true enterprise-class offerings, Red Hat leverages the power of the open source community to give our big data customers a choice in technology, deployment environments, and partners.”
Ashish Nadkarni, research director, Storage Systems and co-lead, Big Data Global Overview, IDC, “Red Hat is uniquely positioned to excel in enterprise big data solutions, a market that IDC expects to grow from $6 billion in 2011 to $23.8 billion in 2016.2 Red Hat is one of the very few infrastructure providers that can deliver a comprehensive big data solution because of the breadth of its infrastructure solutions and application platforms for on-premises or cloud delivery models. As a leading contributor to open source communities developing essential technologies for the big data IT stack – from Linux to OpenStack Origin and Gluster – Red Hat will continue to play a pivotal role in Big Data.”
1 IDC Worldwide Linux Client & Server Operating Environments 2012 – 2016 Forecast and 2011 Vendor Shares, July 2012
2 Big Data: Global Overview: Market Analysis, December 2012, IDC #238746, Volume: 1