What the Evolving Apache Hadoop Ecosystem Will Mean for Storage Developers

webinar

Author(s)/Presenter(s):

Sanjay Radia

Library Content Type

Presentation

Library Release Date

Focus Areas

Abstract

During the last 12 months, the Apcahe Hadoop ecosystem has experienced tremendous growth and empowered enterprises to better handle large volumes of data. In many ways, this data explosion is outpacing current storage, management and processing approaches. As a result of the growing ecosystem, HDFS, the storage engine for Apache Hadoop which allows data to be processed in parallel, has evolved, enabling better isolation, faster startups and upgrades, and better scalability. In this presentation, Sanjay Radia, one of the founders of Hortonworks, will discuss the latest advancements in HDFS, what improvements are currently in the pipeline, and explain how these changes will drive the future of storage in the enterprise.

Learning Objectives

How HDFS provides developers performance improvements for local access
Why did HDFS not use disk-raid and what impact it has on recovery, reliability and operational management
What enterprise support improvements are in the pipeline including support for snapshots and greater storage efficiency?
Should cold archival data sit on separate clusters? What are some of the options?