A build of Apache PySpark that uses the hadoop-cloud maven profile to bundle hadoop-aws 3.x which contains S3A. The pyspark distribution on pypi ships with hadoop 2.7 and no cloud jars (ie: hadoop-aws ...