SQL Server Big data clusters has its own local HDFS built-in data lake to enable the storing of unstructured data and high volume data. This data virtualization capability has a feature called HDFS tiering. It is a major new contribution to the Apache Hadoop project.
With HDFS tiering you can access other data lakes by mounting the remote HDFS/S3 compatible data source to your local HDFS data lake. Access is seamlessly available from SQL Server or Apache Spark. Currently you can mount the following storage: Azure Data Lake Storage Gen2, AWS S3, Isilon, StorageGRID and Flashblase.