将文本文件转换为Excel表格

我有一个这种格式的文本文件（格式由---和||||分隔，使它看起来像一个表）

格式由---和||||分隔使它看起来像一张桌子

  st --------------------------------------------------------------------------------------------------------- Server : hh site: Date : 2012-03-10 Run Time :00.00.00 --------------------------------------------------------------------------------------------------------- AA |dd |condition |another |condition |Ref. yy |sa33 |true |OK: 4tu |true |yt.4.5 | | | | |.3 ---------|-----------------------|-------------------|---------------------------|-----------------|----- BB |tr dd |2 |dhfdk | |yt.5.1 |verson | | t3hd | true |.1 |and above) | | | | ---------|-----------------------|-------------------|---------------------------|-----------------|-----

细胞的内容是全部的价值。 不是标题。 谢谢

我没有任何编程技能来读取文件并parsing它。我如何删除----和||||| 并以行和列的forms导入到Excel中。

作为使用“ Pandas的替代scheme，您可以自己分析文件，并使用Python Excel库（如xlsxwriter创build.xlsx文件：

 from itertools import islice import xlsxwriter wb = xlsxwriter.Workbook("output.xlsx") ws = wb.add_worksheet() cell_format = wb.add_format() cell_format.set_text_wrap() cell_format.set_align('top') with open('input.txt', 'rb') as f_input: csv_input = csv.reader(f_input, delimiter='|') cells = [] row_output = 1 header = [row.strip() for row in islice(f_input, 0, 4)][2] ws.merge_range('A1:G1', header) #ws.write(0, 0, header) for row_input in csv_input: if row_input[0].startswith('---'): for col, cell in enumerate(zip(*cells)): ws.write(row_output, col, '\n'.join(cell), cell_format) row_output += 1 cells = [] else: cells.append(row_input) wb.close()

这将创build一个与您的数据相同的格式的Excel文件，即每个单元格包含多行：

Excel截图

pandas图书馆应该尽一切所需！

iPython环境中的代码：

 import pandas as pd from cStringIO import StringIO text_file = ''' st --------------------------------------------------------------------------------------------------------- Server : hh site: Date : 2012-03-10 Run Time :00.00.00 --------------------------------------------------------------------------------------------------------- AA |dd |condition |another |condition |Ref. yy |sa33 |true |OK: 4tu |true |yt.4.5 | | | | |.3 ---------|-----------------------|-------------------|---------------------------|-----------------|----- BB |tr dd |2 |dhfdk | |yt.5.1 |verson | | t3hd | true |.1 |and above) | | | | ---------|-----------------------|-------------------|---------------------------|-----------------|----- ''' # Read in tabular data, skipping the first header rows # StringIO(text_file) is for example only # Normally, you would use pd.read_csv('/path/to/file.csv', ...) top = pd.read_table(StringIO(text_file), sep='\s{2,}', header=None, skiprows=3, nrows=1) df = pd.read_table(StringIO(text_file), sep='|', header=None, skiprows=5) # Remove '-' lines df = df[~df[0].str.contains('-')] # Reset the index df = df.reset_index().drop('index', 1) # Combine top line df = pd.concat([top, df], ignore_index=True) df

在这里输入图像说明

做任何你需要做的事来清理数据，然后写入excel：

 # Write to excel file df.to_excel('/path/to/file.xls')

将文本文件转换为Excel表格

Rails 5如何将数据从xls文件导入数据库

从Excel获取工作表名称

使用PHPExcel检测Excel单元格的数字格式

从HTML复制/粘贴到MS Excel时如何处理换行符

Excel VBA InputBox和MsgBox输出

如何将大数据导出到Excel中

Rails axlsxgem – 公式不逃避

如何基于从查询输出传递的值在Excel中创build多个工作表

将数据从excel导入到使用ms访问表单进行访问

在CHAR中跳过字母O，I，Z（RANDBETWEEN（65,90））