Read uncompressed thrift files in spark
问题 I'm trying to get spark to read uncompressed thrift files from s3. So far it has not been working. data is loaded in s3 as uncompressed thrift files. The source is AWS Kinesis Firehose. I have a tool that deserializes files with no problem, so I know that thrift serialization/deserialization works. in spark, im using newAPIHadoopFile using elephantbird's LzoThriftBlockInputFormat, I am able to successfully read lzo-compressed thrift files I can't figure out what InputFormat should I use to