I have a hacky way of achieving this using boto3 (1.4.4), pyarrow (0.4.1) and pandas (0.20.3).
boto3
pyarrow
pandas
First, I can read a single parq
You can use s3fs from dask which implements a filesystem interface for s3. Then you can use the filesystem argument of ParquetDataset like so:
import s3fs s3 = s3fs.S3FileSystem() dataset = pq.ParquetDataset('s3n://dsn/to/my/bucket', filesystem=s3)