df.drop_duplicates()返回刪除重複行的DataFrame

阿新 • • 發佈：2020-09-21

drop_duplicates()

可以刪除重複的行，返回的是刪除重複行後的df

DataFrame.drop_duplicates(subset=None, keep='first', inplace=False, ignore_index=False)

引數

subset：column label or sequence of labels, optional，需要刪除的列，預設是全部的列
keep：{‘first’, ‘last’, False}, default ‘first’，確定要保留的重複項（如果有），first和last分別是第一次和最後一次，false則是刪除所有的重複項
inplace：bool, default False，是否覆蓋原來的df
ignore_index：bool, default False

如果inplace=Ture，則返回刪除重複項的df

官網例子

df = pd.DataFrame({
    'brand': ['Yum Yum', 'Yum Yum', 'Indomie', 'Indomie', 'Indomie'],
    'style': ['cup', 'cup', 'cup', 'pack', 'pack'],
    'rating': [4, 4, 3.5, 15, 5]
})
df
'''
    brand style  rating
0  Yum Yum   cup     4.0
1  Yum Yum   cup     4.0
2  Indomie   cup     3.5
3  Indomie  pack    15.0
4  Indomie  pack     5.0
 
'''

預設情況下，它將基於所有列刪除重複的行

df.drop_duplicates()
'''
    brand style  rating
0  Yum Yum   cup     4.0
2  Indomie   cup     3.5
3  Indomie  pack    15.0
4  Indomie  pack     5.0
'''

要刪除特定列上的重複項，請使用`subset`

df.drop_duplicates(subset=['brand'])
'''
    brand style  rating
0  Yum Yum   cup     4.0
2  Indomie   cup     3.5
 
'''

要刪除重複項並保持最後一次出現，請使用`keep`

df.drop_duplicates(subset=['brand', 'style'], keep='last')
'''
    brand style  rating
1  Yum Yum   cup     4.0
2  Indomie   cup     3.5
4  Indomie  pack     5.0
'''

df.drop_duplicates()返回刪除重複行的DataFrame

drop_duplicates() 可以刪除重複的行，返回的是刪除重複行後的df DataFrame.drop_duplicates(subset=None, keep=\'first\', inplace=False, ignore_index=False)

mysql刪除重複行的實現方法

表relation create table relation( id int primary key auto_increment,userId int not null,fanId int not null

mysql 刪除重複行

1、根據單行判斷重複（1）查詢重複項 SELECT * FROM graph_disease_corresponding WHERE diag_pingan IN (

MySQL 如何查詢刪除重複行？

第一步是定義什麼樣的行才是重複行。多數情況下很簡單：它們某一列具有相同的值。本文采用這一定義，或許你對“重複”的定義比這複雜，你需要對sql做些修改。本文要用到的資料樣本：

面試官：MySQL 如何查詢刪除重複行？我竟然寫不出來。。

本文講述如何查詢資料庫裡重複的行。這是初學者十分普遍遇到的問題。方法也很簡單。這個問題還可以有其他演變，例如，如何查詢“兩欄位重複的行”（#mysql IRC 頻道問到的問題）

必備技能，MySQL 查詢並刪除重複行

pandas duplicated() 重複行標記與drop_duplicates()刪除

技術標籤：pythonpythonpandas pandas.DataFrame.duplicated DataFrame.duplicated(subset=None,keep=\'first\')

python 刪除excel表格重複行,資料預處理操作

使用python刪除excel表格重複行。 # 匯入pandas包並重命名為pd import pandas as pd # 讀取Excel中Sheet1中的資料

excel刪除重複的行_如何在Excel中刪除重複的行

excel刪除重複的行 When you are working with spreadsheets in Microsoft Excel and accidentally copy rows, or if you are making a composite spreadsheet of several others, you will encount