使用openpyxl从内存中读取文件

我下载了一个谷歌电子表格作为对象在Python中。

我怎样才能使用openpyxl使用工作簿,而不必先保存到磁盘?

我知道xlrd可以做到这一点:

book = xlrd.open_workbook(file_contents=downloaded_spreadsheet.read()) 

将“downloaded_spreadsheet”作为我下载的xlsx文件作为对象。

而不是xlrd,我想使用openpyxl,因为更好的xlsx-support(我读过)。

我使用这个到目前为止…

 #!/usr/bin/python import openpyxl import xlrd # which to use..? import re, urllib, urllib2 class Spreadsheet(object): def __init__(self, key): super(Spreadsheet, self).__init__() self.key = key class Client(object): def __init__(self, email, password): super(Client, self).__init__() self.email = email self.password = password def _get_auth_token(self, email, password, source, service): url = "https://www.google.com/accounts/ClientLogin" params = { "Email": email, "Passwd": password, "service": service, "accountType": "HOSTED_OR_GOOGLE", "source": source } req = urllib2.Request(url, urllib.urlencode(params)) return re.findall(r"Auth=(.*)", urllib2.urlopen(req).read())[0] def get_auth_token(self): source = type(self).__name__ return self._get_auth_token(self.email, self.password, source, service="wise") def download(self, spreadsheet, gid=0, format="xls"): url_format = "https://spreadsheets.google.com/feeds/download/spreadsheets/Export?key=%s&exportFormat=%s&gid=%i" headers = { "Authorization": "GoogleLogin auth=" + self.get_auth_token(), "GData-Version": "3.0" } req = urllib2.Request(url_format % (spreadsheet.key, format, gid), headers=headers) return urllib2.urlopen(req) if __name__ == "__main__": email = "........@gmail.com" # (your email here) password = '.....' spreadsheet_id = "......" # (spreadsheet id here) # Create client and spreadsheet objects gs = Client(email, password) ss = Spreadsheet(spreadsheet_id) # Request a file-like object containing the spreadsheet's contents downloaded_spreadsheet = gs.download(ss) # book = xlrd.open_workbook(file_contents=downloaded_spreadsheet.read(), formatting_info=True) #It works.. alas xlrd doesn't support the xlsx-funcionality that i want... #ie being able to read the cell-colordata.. 

我希望任何人都可以提供帮助,因为我一直在努力从谷歌电子表格中获得给定单元格的颜色数据。 (我知道谷歌的API不支持它..)

load_workbook文档中说:

 #:param filename: the path to open or a file-like object 

所以它一直都是有能力的。 它读取一个path或采取类似文件的对象。 我只需要将由urlopen返回的类文件对象转换为具有以下内容的bytestream

 from io import BytesIO wb = load_workbook(filename=BytesIO(input_excel.read())) 

我可以阅读Google电子表格中的每一条数据。

实际上就是:

 file = open('path/to/file.xlsx', 'rb') wb = openpyxl.load_workbook(filename=file) 

它会工作。 不需要BytesIO和东西。