Direct download from Google Drive using Google Drive API

Question

My desktop application  written in java  tries to download public files from Google Drive  As i found out  it can be implemented by using file s webContentLink  it s for ability to download public files without user authorization    So  the code below works with small files   String webContentLink   aFile getWebContentLink    InputStream in   new URL webContentLink  openStream      But it doesn t work on big files  because in this case file can t be downloaded directly via webContentLink without user confirmation with google virus scan warning  See an example  web content link   So my question is how to get content of a public file from Google Drive without user authorization

User · Answer

Update as of August 2020  This is what worked for me recently - Upload your file and get a shareable link which anyone can see Change permission from  quot Restricted quot  to  quot Anyone with the Link quot  in the share link options  Then run   SHAREABLE LINK  lt google drive shareable link gt   curl -L https   drive google com uc  id    echo  SHAREABLE LINK   cut -f6 -d quot   quot

User · Answer

This seems to be updated again as of May 19  2015   How I got it to work   As in jmbertucci s recently updated answer  make your folder public to everyone  This is a bit more complicated than before  you have to click Advanced to change the folder to  On - Public on the web     Find your folder UUID as before--just go into the folder and find your UUID in the address bar   https   drive google com drive folders  lt folder UUID gt    Then head to   https   googledrive com host  lt folder UUID gt    It will redirect you to an index type page with a giant subdomain  but you should be able to see the files in your folder  Then you can right click to save the link to the file you want  I noticed that this direct link also has this big subdomain for googledrive com   Worked great for me with wget   This also seems to work with others  shared folders   e g     https   drive google com folderview id 0B7l10Bj LprhQnpSRkpGMGV2eE0 amp usp sharing  maps to  https   googledrive com host 0B7l10Bj LprhQnpSRkpGMGV2eE0  And a right click can save a direct link to any of those files

User · Answer

If you face the  This file cannot be checked for viruses  intermezzo page  the download is not that easy    You essentially need to first download the normal download link  which however redirects you to the  Download anyway  page  You need to store cookies from this first request  find out the link pointed to by the  Download anyway  button  and then use this link to download the file  but reusing the cookies you got from the first request   Here s a bash variant of the download process using CURL   curl -c  tmp cookies  https   drive google com uc export download amp id DOCUMENT ID   gt   tmp intermezzo html curl -L -b  tmp cookies  https   drive google com  cat  tmp intermezzo html   grep -Po  uc-download-link     gt    href   K         sed  s   amp amp    amp  g     gt  FINAL DOWNLOADED FILENAME   Notes    this procedure will probably stop working after some Google changes the grep command uses Perl syntax  -P  and the  K  operator  which essentially means  do not include anything preceding  K to the matched result  I don t know which version of grep introduced these options  but ancient or non-Ubuntu versions probably don t have it a Java solution would be more or less the same  just take a HTTPS library which can handle cookies  and some nice text-parsing library

User · Answer

I faced an issue in direct download because I was logged in using multiple Google accounts  Solution is append authUser 0 parameter  Sample request URL to download  https   drive google com uc id FILEID amp authuser 0 amp export download

User · Answer

I would consider downloading from the link  scraping the page that you get to grab the confirmation link  and then downloading that   If you look at the  download anyway  URL it has an extra confirm query parameter with a seemingly randomly generated token  Since it s random   and you probably don t want to figure out how to generate it yourself  scraping might be the easiest way without knowing anything about how the site works   You may need to consider various scenarios

User · Answer

I simply create a javascript so that it automatically capture the link and download and close the tab  with the help of tampermonkey        UserScript       name         Bypass Google drive virus scan     namespace    SmartManoj     version      0 1     description  Quickly get the download link     author       SmartManoj     match        https   drive google com uc id   amp export download      grant        none       UserScript        function sleep ms          return new Promise resolve   gt  setTimeout resolve  ms               async function demo             await sleep 5000           window close                function             location replace document getElementById  uc-download-link   href           demo                Similarly you can get the html source of the url and download in java

User · Answer

https   drive google com uc export download amp id FILE ID replace the FILE ID with file id    if you don t know were is file id then check this article Article LINK

User · Answer

https   github com google skicka  I used this command line tool to download files from Google Drive  Just follow the instructions in Getting Started section and you should download files from Google Drive in minutes

User · Answer

I know this is an old question but I could not find a solution to this problem after some research  so I am sharing what worked for me   I have written this C  code for one of my projects  It can bypass the scan virus warning programmatically  The code can probably be converted to Java   using System  using System Collections Generic  using System ComponentModel  using System IO  using System Net  using System Text   public class FileDownloader   IDisposable       private const string GOOGLE DRIVE DOMAIN    drive google com       private const string GOOGLE DRIVE DOMAIN2    https   drive google com           In the worst case  it is necessary to send 3 download requests to the Drive address          1  an NID cookie is returned instead of a download warning cookie          2  download warning cookie returned          3  the actual file is downloaded     private const int GOOGLE DRIVE MAX DOWNLOAD ATTEMPT   3       public delegate void DownloadProgressChangedEventHandler  object sender  DownloadProgress progress            Custom download progress reporting  needed for Google Drive      public class DownloadProgress               public long BytesReceived  TotalBytesToReceive          public object UserState           public int ProgressPercentage                       get                               if  TotalBytesToReceive  gt  0L                       return  int       double  BytesReceived   TotalBytesToReceive     100                     return 0                                        Web client that preserves cookies  needed for Google Drive      private class CookieAwareWebClient   WebClient               private class CookieContainer                       private readonly Dictionary lt string  string gt  cookies   new Dictionary lt string  string gt                  public string this Uri address                                get                                       string cookie                      if  cookies TryGetValue  address Host  out cookie                             return cookie                       return null                                    set                                       cookies address Host    value                                                     private readonly CookieContainer cookies   new CookieContainer            public DownloadProgress ContentRangeTarget           protected override WebRequest GetWebRequest  Uri address                         WebRequest request   base GetWebRequest  address                if  request is HttpWebRequest                                 string cookie   cookies address                   if  cookie    null                          HttpWebRequest  request   Headers Set   cookie   cookie                     if  ContentRangeTarget    null                          HttpWebRequest  request   AddRange  0                               return request                     protected override WebResponse GetWebResponse  WebRequest request  IAsyncResult result                         return ProcessResponse  base GetWebResponse  request  result                         protected override WebResponse GetWebResponse  WebRequest request                         return ProcessResponse  base GetWebResponse  request                         private WebResponse ProcessResponse  WebResponse response                         string   cookies   response Headers GetValues   Set-Cookie                 if  cookies    null  amp  amp  cookies Length  gt  0                                 int length   0                  for  int i   0  i  lt  cookies Length  i                         length    cookies i  Length                   StringBuilder cookie   new StringBuilder  length                    for  int i   0  i  lt  cookies Length  i                         cookie Append  cookies i                      this cookies response ResponseUri    cookie ToString                               if  ContentRangeTarget    null                                 string   rangeLengthHeader   response Headers GetValues   Content-Range                     if  rangeLengthHeader    null  amp  amp  rangeLengthHeader Length  gt  0                                         int splitIndex   rangeLengthHeader 0  LastIndexOf                             if  splitIndex  gt   0  amp  amp  splitIndex  lt  rangeLengthHeader 0  Length - 1                                                 long length                          if  long TryParse  rangeLengthHeader 0  Substring  splitIndex   1    out length                                 ContentRangeTarget TotalBytesToReceive   length                                                                     return response                       private readonly CookieAwareWebClient webClient      private readonly DownloadProgress downloadProgress       private Uri downloadAddress      private string downloadPath       private bool asyncDownload      private object userToken       private bool downloadingDriveFile      private int driveDownloadAttempt       public event DownloadProgressChangedEventHandler DownloadProgressChanged      public event AsyncCompletedEventHandler DownloadFileCompleted       public FileDownloader                 webClient   new CookieAwareWebClient            webClient DownloadProgressChanged    DownloadProgressChangedCallback          webClient DownloadFileCompleted    DownloadFileCompletedCallback           downloadProgress   new DownloadProgress               public void DownloadFile  string address  string fileName                 DownloadFile  address  fileName  false  null               public void DownloadFileAsync  string address  string fileName  object userToken   null                 DownloadFile  address  fileName  true  userToken               private void DownloadFile  string address  string fileName  bool asyncDownload  object userToken                 downloadingDriveFile   address StartsWith  GOOGLE DRIVE DOMAIN      address StartsWith  GOOGLE DRIVE DOMAIN2            if  downloadingDriveFile                         address   GetGoogleDriveDownloadAddress  address                driveDownloadAttempt   1               webClient ContentRangeTarget   downloadProgress                    else             webClient ContentRangeTarget   null           downloadAddress   new Uri  address            downloadPath   fileName           downloadProgress TotalBytesToReceive   -1L          downloadProgress UserState   userToken           this asyncDownload   asyncDownload          this userToken   userToken           DownloadFileInternal               private void DownloadFileInternal                 if   asyncDownload                         webClient DownloadFile  downloadAddress  downloadPath                    This callback isn t triggered for synchronous downloads  manually trigger it             DownloadFileCompletedCallback  webClient  new AsyncCompletedEventArgs  null  false  null                        else if  userToken    null               webClient DownloadFileAsync  downloadAddress  downloadPath            else             webClient DownloadFileAsync  downloadAddress  downloadPath  userToken               private void DownloadProgressChangedCallback  object sender  DownloadProgressChangedEventArgs e                 if  DownloadProgressChanged    null                         downloadProgress BytesReceived   e BytesReceived              if  e TotalBytesToReceive  gt  0L                   downloadProgress TotalBytesToReceive   e TotalBytesToReceive               DownloadProgressChanged  this  downloadProgress                         private void DownloadFileCompletedCallback  object sender  AsyncCompletedEventArgs e                 if   downloadingDriveFile                         if  DownloadFileCompleted    null                   DownloadFileCompleted  this  e                      else                       if  driveDownloadAttempt  lt  GOOGLE DRIVE MAX DOWNLOAD ATTEMPT  amp  amp   ProcessDriveDownload                                      Try downloading the Drive file again                 driveDownloadAttempt                    DownloadFileInternal                              else if  DownloadFileCompleted    null                   DownloadFileCompleted  this  e                            Downloading large files from Google Drive prompts a warning screen and requires manual confirmation        Consider that case and try to confirm the download automatically if warning prompt occurs        Returns true  if no more download requests are necessary     private bool ProcessDriveDownload                 FileInfo downloadedFile   new FileInfo  downloadPath            if  downloadedFile    null               return true              Confirmation page is around 50KB  shouldn t be larger than 60KB         if  downloadedFile Length  gt  60000L               return true              Downloaded file might be the confirmation page  check it         string content          using  var reader   downloadedFile OpenText                              Confirmation page starts with  lt  DOCTYPE html gt   which can be preceeded by a newline             char   header   new char 20               int readCount   reader ReadBlock  header  0  20                if  readCount  lt  20       new string  header   Contains    lt  DOCTYPE html gt                         return true               content   reader ReadToEnd                       int linkIndex   content LastIndexOf   href    uc              if  linkIndex  lt  0               return true           linkIndex    6          int linkEnd   content IndexOf       linkIndex            if  linkEnd  lt  0               return true           downloadAddress   new Uri   https   drive google com    content Substring  linkIndex  linkEnd - linkIndex   Replace    amp amp      amp                return false                Handles the following formats  links can be preceeded by https             - drive google com open id FILEID        - drive google com file d FILEID view usp sharing        - drive google com uc id FILEID amp export download     private string GetGoogleDriveDownloadAddress  string address                 int index   address IndexOf   id              int closingIndex          if  index  gt  0                         index    3              closingIndex   address IndexOf    amp    index                if  closingIndex  lt  0                   closingIndex   address Length                    else                       index   address IndexOf   file d                  if  index  lt  0      address is not in any of the supported forms                 return string Empty               index    7               closingIndex   address IndexOf       index                if  closingIndex  lt  0                                 closingIndex   address IndexOf       index                    if  closingIndex  lt  0                       closingIndex   address Length                                   return string Concat   https   drive google com uc id    address Substring  index  closingIndex - index      amp export download                public void Dispose                 webClient Dispose              And here s how you can use it      NOTE  FileDownloader is IDisposable  FileDownloader fileDownloader   new FileDownloader        This callback is triggered for DownloadFileAsync only fileDownloader DownloadProgressChanged      sender  e     gt  Console WriteLine   Progress changed     e BytesReceived         e TotalBytesToReceive       This callback is triggered for both DownloadFile and DownloadFileAsync fileDownloader DownloadFileCompleted      sender  e     gt  Console WriteLine   Download completed      fileDownloader DownloadFileAsync   https   INSERT DOWNLOAD LINK HERE     C  downloadedFile txt

User · Answer

Case 1  download file with small size    You can use url with format https   drive google com uc export download amp id FILE ID and then inputstream of file can be obtained directly     Case 2  download file with large size    You stuck a wall of a virus scan alert page returned  By parsing html dom element  I tried to get link with confirm code under button  Download anyway  but it didn t work  Its may required cookie or session info  enter image description here   SOLUTION    Finally I found solution for two above cases  Just need to put httpConnection setDoOutput true  in connection step to get a Json           disposition   SCAN CLEAN    downloadUrl   http www       fileName   exam list json txt    scanResult   OK    sizeBytes  2392    Then  you can use any Json parser to read downloadUrl  fileName and sizeBytes    You can refer follow snippet  hope it help   private InputStream gConnect String remoteFile  throws IOException      URL  url   new URL remoteFile       URLConnection connection   url openConnection        if connection instanceof HttpURLConnection           HttpURLConnection httpConnection    HttpURLConnection  connection          connection setAllowUserInteraction false           httpConnection setInstanceFollowRedirects true           httpConnection setRequestProperty  User-Agent    Mozilla 4 0  compatible  MSIE 6 0  Windows 2000             httpConnection setDoOutput true                     httpConnection setRequestMethod  GET            httpConnection connect             int reqCode   httpConnection getResponseCode              if reqCode    HttpURLConnection HTTP OK               InputStream is   httpConnection getInputStream                Map lt String  List lt String gt  gt  map   httpConnection getHeaderFields                List lt String gt  values   map get  content-type                if values    null  amp  amp   values isEmpty                     String type   values get 0                    if type contains  text html                         String cookie   httpConnection getHeaderField  Set-Cookie                        String temp   Constants getPath mContext  Constants PATH TEMP      temp html                       if saveGHtmlFile is  temp                            String href   getRealUrl temp                           if href    null                               return parseUrl href  cookie                                                                       else if type contains  application json                         String temp   Constants getPath mContext  Constants PATH TEMP      temp txt                       if saveGJsonFile is  temp                            FileDataSet data   JsonReaderHelper readFileDataset new File temp                            if data getPath      null                               return parseUrl data getPath                                                                                                 return is                      return null       And     public static FileDataSet readFileDataset File file  throws IOException          FileInputStream is   new FileInputStream file           JsonReader reader   new JsonReader new InputStreamReader is   UTF-8              reader beginObject            FileDataSet rs   new FileDataSet            while reader hasNext                 String name   reader nextName                if name equals  downloadUrl                     rs setPath reader nextString                   else if name equals  fileName                     rs setName reader nextString                   else if name equals  sizeBytes                     rs setSize reader nextLong                   else                   reader skipValue                                    reader endObject            return rs

User · Answer

Update December 8th  2015 According to Google Support using the   googledrive com host ID   method will be turned off on Aug 31st  2016     I just ran into this issue   The trick is to treat your Google Drive folder like a web host   Update April 1st  2015  Google Drive has changed and there s a simple way to direct link to your drive   I left my previous answers below for reference but to here s an updated answer    Create a Public folder in Google Drive  Share this drive publicly    Get your Folder UUID from the address bar when you re in that folder  Put that UUID in this URL https   googledrive com host  lt folder UUID gt   Add the file name to where your file is located  https   googledrive com host  lt folder UUID gt   lt file name gt      Which is intended functionality by Google  new Google Drive Link   All you have to do is simple get the host URL for a publicly shared drive folder   To do this  you can upload a plain HTML file and preview it in Google Drive to find your host URL   Here are the steps    Create a folder in Google Drive  Share this drive publicly    Upload a simple HTML file  Add any additional files  subfolders ok    Open and  preview  the HTML file in Google Drive   Get the URL address for this folder   Create a direct link URL from your URL folder base   This URL should allow direct downloads of your large files     edit   I forgot to add  If you use subfolders to organize your files  you simple use the folder name as you would expect in a URL hierarchy   https   googledrive com host  lt your public folders id string gt  images my-image png    What I was looking to do  I created a custom Debian image with Virtual Box for Vagrant   I wanted to share this   box  file with colleagues so they could put the direct link into their Vagrantfile   In the end  I needed a direct link to the actual file   Google Drive problem  If you set the file permissions to be publicly available and create generate a direct access link by using something like the gdocs2direct tool or just crafting the link yourself   https   docs google com uc export download amp id  lt your file id gt   You will get a cookie based verification code and prompt  Google could not scan this file  prompt  which won t work for things such as wget or Vagrantfile configs   The code that it generates is a simple code that appends GET query variable     amp confirm     to the string  but it s per user specific  so it s not like you can copy paste that query variable for others   But if you use the above  Web page hosting  method  you can get around that prompt   I hope that helps

User · Answer

Using a Service Account might work for you

User · Answer

Check this out   wget https   raw githubusercontent com circulosmeos gdown pl master gdown pl chmod  x gdown pl   gdown pl https   drive google com file d FILE ID view TARGET PATH

User · Answer

If you just want to programmatically  as oppossed to giving the user a link to open in a browser  download a file through the Google Drive API  I would suggest using the downloadUrl of the file instead of the webContentLink  as documented here  https   developers google com drive web manage-downloads

[java] Direct download from Google Drive using Google Drive API

Examples related to java

Examples related to google-drive-api