Extracting XML Data from HDFS Sequence Files July 21, 2015 Jul 21, 2015 07/21/15 Beat Schwegler In this post, you will learn how to implement a MapReduce job to efficiently extract XML data from HDFS sequence files using a custom RecordReader.
Leveraging HDInsight from On-Premises Hadoop July 21, 2015 Jul 21, 2015 07/21/15 Beat Schwegler How to take advantage of Microsoft Azure HDInsight to run a remote Hadoop MapReduce job using Apache Templeton.