Extracting XML Data from HDFS Sequence Files July 21, 2015 Jul 21, 2015 07/21/15 Beat Schwegler In this post, you will learn how to implement a MapReduce job to efficiently extract XML data from HDFS sequence files using a custom RecordReader.