使用OpenPyXL将Excel数据写入Python中的错误

对不起，打了一堆深奥的代码，但我遇到了一个错误，我不知道如何解决。

基本上，我想读取电子表格列中的单个单元格，并将其数据写入相应的字典（称为dataSet）。

我创build了一个function来做到这一点：

def loopCol(col, start_offset, write_list): ''' Loop through 1 column (col) until it ends. Skip header by start_offset. Write data to list within DataSet dict ''' from openpyxl.utils import column_index_from_string # Create list and capture str of its name list_string = str(write_list) print(list_string) if list_string not in dataSet: raise KeyError('List name not within DataSet Keys') write_list = [] # Loop through column, capturing each cell's value # Use start_offset to skip header cells for i in range(column_index_from_string(col) + start_offset, sheet.max_row + 1): listItem = sheet[col + str(i)].value print(listItem) if listItem != None: if isinstance(listItem, datetime.datetime): listItem = listItem.strftime('%d/%m/%Y') write_list.append(listItem) else: write_list.append(listItem) # Write data to dataSet for list_index in write_list: dataSet[list_string] = [list_index for list_index in write_list] loopCol('A', 0, 'dates') loopCol('B', 0, 'ph') loopCol('C', 0, 'water_level') loopCol('D', 0, 'salinity') loopCol('E', 1, 'conductivity') loopCol('F', 0, 'tds')

所以从理论上说，这应该是通过一列中的所有单元格，如果它们中有一些值，就把这个值写到这个字典中对应的地方：

 dataSet = { 'dates': [], 'ph': [], 'water_level': [], 'salinity': [], 'conductivity': [], 'tds': [] }

但是，有一个问题。当所有事情都说完后，字典看起来像：

 {'ph': [3.4, 2.1, 7], 'salinity': [2.2, 1.2], 'conductivity': [5.3], 'water_level': ['2m', '3m', '1m'], 'tds': [], 'dates': ['Date', '21/01/2016', '28/01/2012', '06/03/2012']}

现在我知道在每列中都有3个单元格。但有些人并没有把它写进字典。 “盐度”只有2个值，“电导率”只有一个，“tds”是空的。这些恰好是dataSet字典中的最后一个条目，所以也许这是原因的一部分。但我无法弄清楚逻辑中的错误在哪里。

这是上下文文件的屏幕

有人可以帮忙吗？我真的很想给我的老板留下深刻的印象;）（我不是在IT部门工作，所以任何让人们生活更轻松的计算机技术都会被惊叹和敬畏）。

如果我没有做好足够的准确解释代码是什么让我知道，我会尽力澄清。

你可以尝试这样的事情：

 def colValues(sheet, keys, offsets=None): if offsets is None or not len(offsets): # Set offsets default to 0 offsets = {k: 0 for k in keys} if len(offsets) != len(keys): # If offsets given, fail if length mismatch raise AttributeError() res = {} for column in sheet.columns: # Store current column header field (ie its name) ch = column[0].value if ch not in keys: # Fail early: No need for any tests if this column's data # is not desired in result set. continue # Append all row values to the result dict with respect to the # given column offset. Note: Lowest possible row index is 1, # because here we assume that header fields are present. res[ch] = [c.value for c in column[offsets[keys.index(ch)] + 1:]] return res if __name__ == '__main__': xlsx = 'test.xlsx' ws = load_workbook(xlsx)['Sheet1'] ds = colValues(ws, ['foo', 'bar'], [0, 1]) print(ds)

对于我的小testing，这会产生每列正确数量的项目。注意，键'bar'在这里只有一个项目，因为在上面的函数调用中它的偏移量更高。

 {u'foo': [2.3, 3.5, 5.6, 7.9], u'bar': [6.2, 3.6, 9L]}

另外，代码更轻。

使用OpenPyXL将Excel数据写入Python中的错误

VBA Excel计数特定值

如何创buildExcel电子表格内容的字典？

在一个字典中find一个值为列表的项目

Mac上的VBA（Excel）字典？

EXCEL – VBA。获取单元值作为键值对

excel映射来匹配列

Python 3 – 从构造的Dictionary中使用pandas写入excel

VBA类中的Dictionary属性

将字典保存到.XLSX中

正则expression式从CSV中删除加倍的双引号

使用OpenPyXL将Excel数据写入Python中的错误

VBA Excel计数特定值

如何创buildExcel电子表格内容的字典？

在一个字典中find一个值为列表的项目

Mac上的VBA（Excel）字典？

EXCEL – VBA。 获取单元值作为键值对

excel映射来匹配列

Python 3 – 从构造的Dictionary中使用pandas写入excel

VBA类中的Dictionary属性

将字典保存到.XLSX中

正则expression式从CSV中删除加倍的双引号

EXCEL – VBA。获取单元值作为键值对