xlrd xls XLRDError:不支持的格式或损坏的文件:期望的BOFlogging; find'\ r \ n <html>'

这是代码:

xls = open_workbook('data.xls') 

作为回报:

 File "/home/woles/P2/fin/fin/apps/data_container/importer.py", line 16, in import_data xls = open_workbook('data.xlsx') File "/home/woles/P2/fin/local/lib/python2.7/site-packages/xlrd/__init__.py", line 435, in open_workbook ragged_rows=ragged_rows, File "/home/woles/P2/fin/local/lib/python2.7/site-packages/xlrd/book.py", line 91, in open_workbook_xls biff_version = bk.getbof(XL_WORKBOOK_GLOBALS) File "/home/woles/P2/fin/local/lib/python2.7/site-packages/xlrd/book.py", line 1230, in getbof bof_error('Expected BOF record; found %r' % self.mem[savpos:savpos+8]) File "/home/woles/P2/fin/local/lib/python2.7/site-packages/xlrd/book.py", line 1224, in bof_error raise XLRDError('Unsupported format, or corrupt file: ' + msg) XLRDError: Unsupported format, or corrupt file: Expected BOF record; found '\r\n<html>' 

该文件没有损坏,我可以用Excel,LibreOffice打开它。

尝试用pandas来打开它:

 import pandas as pd data=pd.read_html(filename.xls) 

或者尝试任何其他的HTML Pythonparsing器。

这不是一个适当的Excel文件,但一个HTML与Excel可读。