apache hadoop

Global View Distributed File System with Mount Points

Apache Hadoop Distributed File System (HDFS) is the most popular file system in the big data world. The Apache Hadoop File System interface has provided integration to many other popular storage systems like Apache Ozone, S3, Azure Data Lake Storage…
Read more

Dancing with Elephants in 5 Easy Steps

The Corner Office is pressing their direct reports across the company to “Move To The Cloud” to increase agility and reduce costs. And next to those legacy ERP, HCM, SCM and CRM systems, that mysterious elephant in the room –…
Read more

Industry Transformation: the new business as usual

The Industry Transformation category at the Data Impact Awards has never been more timely. While the business world is mostly focused on digital transformation, Cloudera and our customers know that true, data-driven change is reshaping whole enterprises and entire industries….
Read more

Fair Scheduler to Capacity Scheduler conversion tool

Introduction In Apache Hadoop YARN 3.x (YARN for short), switching to Capacity Scheduler has considerable benefits and only a few drawbacks. To bring these features to users who are currently using Fair Scheduler, we created a tool with the upstream…
Read more