删除一行中具有重复项的重复项的dynamic数据

我正在尝试删除重复的列Ddynamic数据没有标题或标识function。 我正在尝试删除所有的行D列重复的行。我正在转换为一个数据框的Excel,删除重复,然后把它放回到Excel中。 但是,我不断收到各种错误或删除重复。 我来自VBA的背景,但我们正在迁移到Python

尝试:

df.drop_duplicates(["C"]) df = pd.DataFrame({"C"}) df.groupby(["C"]).filter(lambda df:df.shape[0] == 1) As well an assortment of other variations. I was able to do this in VBA with one line. Any ideas why this keeps causing this issue. \\ import pandas as pd df = pd.DataFrame({"C"]}) df.drop_duplicates(subset=[''C'], keep=False) DG=df.groupby([''C']) print pd.concat([DG.get_group(item) for item, value in DG.groups.items() if len(value)==1]) 

我能够用一行在VBA中做到这一点。 任何想法,为什么这一直造成这个问题。

代码本身Template-

 df = pd.read_excel("C:/wadwa.xlsx", sheetname=0) columns_to_drop = ['d.1'] #columns_to_drop = ['d.1', 'b.1', 'e.1', 'f.1', 'g.1'] import pandas as pd Df = df[[col for col in df.columns if col not in columns_to_drop]] print(df) writer = pd.ExcelWriter('C:/dadwa/dwad.xlsx') df.to_excel(writer,'Sheet1') writer.save() print(df) 

码:

 import pandas as pd df = pd.read_excel("C:/Users/Documents/Book1.xlsx", sheetname=0) import pandas as pd df = df.drop_duplicates(subset=[df.columns[3]], keep=False) writer = pd.ExcelWriter('C:/Users//Documents/Book2.xlsx') df.to_excel(writer,'Sheet1') writer.save() print(df) 

我觉得你需要分配回来,按位置select第四列:

 df = df.drop_duplicates(subset=[df.columns[3]], keep=False)