Custom Binary Input - Hadoop
问题 I am developing a demo application in Hadoop and my input is .mrc image files. I want to load them to hadoop and do some image processing over them. These are binary files that contain a large header with metadata followed by the data of a set of images. The information on how to read the images is also contained in the header (eg. number_of_images, number_of_pixels_x, number_of_pixels_y, bytes_per_pixel, so after the header bytes, the first [number_of_pixels_x*number_of_pixels_y*bytes_per