Blob Code Download Much Slower Than MS Azure Storage Explorer
Answer :
You should specify which version of MS Azure Storage explorer your're using.
If you're using some newer versions of 1.9.0 / 1.8.1 / 1.8.0 etc.(please find more details in this link), then Azure Storage Explorer is integrated with azcopy which is using simple commands designed for optimal performance. So you can have a good-performance for downloading / uploading etc.
When using code for downloading / uploading blobs, you can take use of this Microsoft Azure Storage Data Movement Library. This library is based on the core data movement framework that powers AzCopy, which also provides you high-performance uploading, downloading.
I eventually tried 2 solutions proposed by @Ivan and @mjwills:
- DownloadToFileParallelAsync resulted in 10min 12secs
- Microsoft Azure Storage Data Movement Library resulted in 9min 35secs
Both solutions much faster than the original DownloadToFileAsync. DownloadToFileParallelAsync is only available in later versions of the library and hence was not available in the one I had installed.
I'd suggest using DownloadToFileParallelAsync
.
As per the docs:
Initiates an asynchronous operation to download the contents of a blob to a file by making parallel requests.
and:
The parallelIOCount and rangeSizeInBytes should be adjusted depending on the CPU, memory, and bandwidth.
This API should only be used for larger downloads as a HEAD request is made prior to downloading the data.
For smaller blobs, please use DownloadToFileAsync().
To get the best performance, it is recommended to try several values, and measure throughput.
One place to start would be to set the parallelIOCount to the number of CPUs.
Then adjust the rangeSizeInBytes so that parallelIOCount times rangeSizeInBytes equals the amount of memory you want the process to consume.
The benefit of this method vs DownloadToFileAsync
is that multiple 'slices' of the file are downloaded in parallel (at the same time). This can be beneficial for large files over fast internet connections (in most cases, I'd expect it to be 4-8 times faster).
Comments
Post a Comment