2012-05-31 123 views
14

我需要从某些Linux服务器备份各种文件类型到GDrive(不只是可转换为GDocs格式)。如何使用Python脚本将文件上传到Google Drive?

什么是最简单,最优雅的方式来做到这一点与python脚本?有关GDoc的解决方案是否适用?

+3

什么是2016上传到g盘的方式? –

回答

10

您可以使用文档列表API编写写入到驱动器的脚本:

https://developers.google.com/google-apps/documents-list/

无论是文档列表API和驱动API使用相同的资源(即相同的文档和文件)交互。

此样品中的Python客户端库展示了如何上传未转换的文件到驱动器:

http://code.google.com/p/gdata-python-client/source/browse/samples/docs/docs_v3_example.py#180

+4

文档列表API已于2012年9月14日弃用,应使用Google Drive API代替https://developers.google.com/drive/ –

0

用于保存文件使用Python Google云端硬盘当前的文档可以在这里找到: https://developers.google.com/drive/v3/web/manage-uploads

但是,谷歌驱动器API处理文档存储和检索的方式并不遵循与POSIX文件系统相同的体系结构。因此,如果您希望在Linux文件系统上保留嵌套文件的分层体系结构,则需要编写大量自定义代码,以便将父目录保存在Google Drive上。

最重要的是,谷歌很难获得正常的驱动器帐户的写入权限。您的权限范围必须包含以下链接:https://www.googleapis.com/auth/drive并且要获取令牌以访问用户的正常帐户,该用户必须首先提供join a group以提供对未审核应用的访问权限。而且任何创建的oauth标记都具有有限的保质期。

但是,如果您获得访问令牌,则以下脚本应允许您将本地计算机上的任何文件保存到谷歌驱动器上的相同(相对)路径。

def migrate(file_path, access_token, drive_space='drive'): 

    ''' 
     a method to save a posix file architecture to google drive 

    NOTE: to write to a google drive account using a non-approved app, 
      the oauth2 grantee account must also join this google group 
      https://groups.google.com/forum/#!forum/risky-access-by-unreviewed-apps 

    :param file_path: string with path to local file 
    :param access_token: string with oauth2 access token grant to write to google drive 
    :param drive_space: string with name of space to write to (drive, appDataFolder, photos) 
    :return: string with id of file on google drive 
    ''' 

# construct drive client 
    import httplib2 
    from googleapiclient import discovery 
    from oauth2client.client import AccessTokenCredentials 
    google_credentials = AccessTokenCredentials(access_token, 'my-user-agent/1.0') 
    google_http = httplib2.Http() 
    google_http = google_credentials.authorize(google_http) 
    google_drive = discovery.build('drive', 'v3', http=google_http) 
    drive_client = google_drive.files() 

# prepare file body 
    from googleapiclient.http import MediaFileUpload 
    media_body = MediaFileUpload(filename=file_path, resumable=True) 

# determine file modified time 
    import os 
    from datetime import datetime 
    modified_epoch = os.path.getmtime(file_path) 
    modified_time = datetime.utcfromtimestamp(modified_epoch).isoformat() 

# determine path segments 
    path_segments = file_path.split(os.sep) 

# construct upload kwargs 
    create_kwargs = { 
     'body': { 
      'name': path_segments.pop(), 
      'modifiedTime': modified_time 
     }, 
     'media_body': media_body, 
     'fields': 'id' 
    } 

# walk through parent directories 
    parent_id = '' 
    if path_segments: 

    # construct query and creation arguments 
     walk_folders = True 
     folder_kwargs = { 
      'body': { 
       'name': '', 
       'mimeType' : 'application/vnd.google-apps.folder' 
      }, 
      'fields': 'id' 
     } 
     query_kwargs = { 
      'spaces': drive_space, 
      'fields': 'files(id, parents)' 
     } 
     while path_segments: 
      folder_name = path_segments.pop(0) 
      folder_kwargs['body']['name'] = folder_name 

    # search for folder id in existing hierarchy 
      if walk_folders: 
       walk_query = "name = '%s'" % folder_name 
       if parent_id: 
        walk_query += "and '%s' in parents" % parent_id 
       query_kwargs['q'] = walk_query 
       response = drive_client.list(**query_kwargs).execute() 
       file_list = response.get('files', []) 
      else: 
       file_list = [] 
      if file_list: 
       parent_id = file_list[0].get('id') 

    # or create folder 
    # https://developers.google.com/drive/v3/web/folder 
      else: 
       if not parent_id: 
        if drive_space == 'appDataFolder': 
         folder_kwargs['body']['parents'] = [ drive_space ] 
        else: 
         del folder_kwargs['body']['parents'] 
       else: 
        folder_kwargs['body']['parents'] = [parent_id] 
       response = drive_client.create(**folder_kwargs).execute() 
       parent_id = response.get('id') 
       walk_folders = False 

# add parent id to file creation kwargs 
    if parent_id: 
     create_kwargs['body']['parents'] = [parent_id] 
    elif drive_space == 'appDataFolder': 
     create_kwargs['body']['parents'] = [drive_space] 

# send create request 
    file = drive_client.create(**create_kwargs).execute() 
    file_id = file.get('id') 

    return file_id 

PS。我已经从labpack python模块修改了这个脚本。在该模块中有一个名为driveClient的类,该类由rcj1492编写,它以保留POSIX文件系统的方式处理保存,加载,搜索和删除谷歌驱动器上的文件。

from labpack.storage.google.drive import driveClient 
相关问题