XML input and Hadoop – custom InputFormat