pandas的基本操作

阿新 • • 發佈：2019-01-17

1、reindex重新索引

pandas提供了一個reindex方法來建立一個適應新索引的新物件，Serires通過呼叫reindex方法會根據新索引的順序重新排序，如果新的索引中存在原索引中不存在的索引，將會使用NaN值進行填充。

    obj = Series([1,2,3],index=["c","b","a"])
    obj1 = obj.reindex(["a","b","c","d"])
    print(obj1)
    '''
    a    3.0
    b    2.0
    c    1.0
    d    NaN
    '''

a、通過fill_value來指定填充值

    obj2 = obj.reindex(["a","b","c","d"],fill_value=0)
    print(obj2)
    '''
    a    3
    b    2
    c    1
    d    0
    '''

b、插值處理

ffill或pad前向填充，使用插值的前一個值來填充

    obj = Series([1,2,3],index=["a","c","e"])
    obj1 = obj.reindex(["a","b","c","d","e","f"],method="ffill")
    print(obj1)
    '''
    a    1
    b    1
    c    2
    d    2
    e    3
    f    3
    '''

method除了"ffill"和"pad"之外，還可以是"bfill"和"backfill"後向填充，當沒有前一個值或者後一個值的時候，會使用預設的NaN進行填充。

c、使用reindex對DataFrame進行、列索引重排

    #生成一個3行3列的二維陣列
    a = np.arange(9).reshape(3,3)
    frame = DataFrame(a,index=["a","b","c"],columns=["one","two","three"])
    print(frame)
    '''
        one  two  three
    a    0    1      2
    b    3    4      5
    c    6    7      8
    '''
    #重新排列行索引
    frame2 = frame.reindex(["c","b","a"])
    print(frame2)
    '''
        one  two  three
    c    6    7      8
    b    3    4      5
    a    0    1      2
    '''
    #重新排列列索引
    frame3 = frame.reindex(columns=["three","two","one"])
    print(frame3)
    '''
         three  two  one
    a      2    1    0
    b      5    4    3
    c      8    7    6
    '''

可以同時對DataFrame的行和列進行重新索引，需要注意的時候，插值只能按行應用，對列無效。

2、使用ix標籤重新索引

    frame2 = frame.ix[["c","b","a"],["three","two","one"]]
    print(frame2)
    '''
        one  two  three
    c    6    7      8
    b    3    4      5
    a    0    1      2
    '''

需要注意的是，第一個列表代表行索引，第二個代表列索引。

3、刪除指定行或列

a、Series通過索引刪除行

    a = np.arange(3)
    series = Series(a,index=["a","b","c"])
    series1 = series.drop("b")
    print(series)
    '''
    a    0
    b    1
    c    2
    '''
    print(series1)
    '''
    a    0
    c    2
    '''

可以發現，並不會在原來的Series上刪除，只是返回了一個新的Series。可以通過指定inplace為True，在原來的Series上進行刪除

    a = np.arange(3)
    series = Series(a,index=["a","b","c"])
    series.drop("b",inplace=True)
    print(series)
    '''
    a    0
    c    2
    '''

b、DataFrame刪除行和列

    a = np.arange(9).reshape(3,3)
    dataFrame = DataFrame(a,index=["a","b","c"],columns=["one","two","three"])
    #刪除行
    dataFrame1 = dataFrame.drop(["a","c"],axis=0)
    print(dataFrame1)
    '''
        one  two  three
    b    3    4      5
    '''
    #刪除列
    dataFrame2 = dataFrame.drop(["one","two"],axis=1)
    print(dataFrame2)
    '''
         three
    a      2
    b      5
    c      8
    '''

刪除行的時候，可以不指定axis=0，預設是刪除行，在刪除列的時候必須指定，不然會報ValueError: labels ['one' 'two'] not contained in axis

4、通過索引獲取指定位置的值

a、Series獲取值

    a = np.arange(3)
    series = Series(a,index=["a","b","c"])
    #通過索引獲取
    print(series[["b","c"]])
    '''
    b    1
    c    2
    '''
    #通過下標獲取
    print(series[1:3])
    '''
    b    1
    c    2
    '''

通過索引獲取值的時候，可以是單個索引或者一個索引列表，在使用下標獲取值的時候，需要注意的時候是從0開始，而且不包括右邊的下標。也可以使用["a":"c"]型別與下標來獲取值，不同的是，它包括右邊的索引。

b、DataFrame獲取值

    a = np.arange(9).reshape(3,3)
    dataFrame = DataFrame(a,index=["a","b","c"],columns=["one","two","three"])
    #獲取列的值
    print(dataFrame[["one","two"]])
    '''
        one  two
    a    0    1
    b    3    4
    c    6    7
    '''
    #獲取行的值
    print(dataFrame[0:2])
    '''
        one  two  three
    a    0    1      2
    b    3    4      5
    '''

5、索引過濾

    a = np.arange(9).reshape(3,3)
    dataFrame = DataFrame(a,index=["a","b","c"],columns=["one","two","three"])
    #選取列索引為"two"大於5的數
    print(dataFrame[dataFrame["two"] > 5])
    '''
       one  two  three
    c    6    7      8
    '''
    #選取所有大於5的數，返回一個bool型別的二維陣列
    print(dataFrame > 5)
    '''
          one    two  three
    a   False  False  False
    b   False  False  False
    c   True   True   True
    '''

DataFrame對於行的操作可以使用ix

    a = np.arange(9).reshape(3,3)
    dataFrame = DataFrame(a,index=["a","b","c"],columns=["one","two","three"])
    #選取第二行的第一列和第三列
    print(dataFrame.ix["b",["one","three"]])
    '''
     one      3
     three    5
    ''

pandas的基本操作

1、reindex重新索引

a、通過fill_value來指定填充值

b、插值處理

c、使用reindex對DataFrame進行、列索引重排

2、使用ix標籤重新索引

3、刪除指定行或列

a、Series通過索引刪除行

b、DataFrame刪除行和列

4、通過索引獲取指定位置的值

a、Series獲取值

b、DataFrame獲取值

5、索引過濾

pandas 基本操作

pandas基本操作

python pandas 基本操作

Python資料分析庫pandas基本操作

pandas基本操作函式

pandas庫介紹之DataFrame基本操作

pandas.DataFrame()的基本操作

pandas庫之DataFrame基本操作

Pandas 資料框增、刪、改、查、去重、抽樣基本操作

Pandas DataFrame 的基本操作之重新索引

【pandas】[2] DataFrame 基礎，建立DataFrame和增刪改查基本操作（1）

pandas小記：pandas資料結構和基本操作

pandas的基本操作

Python 十分鐘學會pandas基本資料操作

pandas一些基本操作（DataFram和Series）_4

pandas一些基本操作（DataFram和Series）_2

pandas一些基本操作（DataFram和Series）_3

python-pandas基本資料操作

python之Pandas庫的基本操作

pandas中基本操作——如缺失值處理。等

pandas的基本操作

1、reindex重新索引

a、通過fill_value來指定填充值

b、插值處理

c、使用reindex對DataFrame進行、列索引重排

2、使用ix標籤重新索引

3、刪除指定行或列

a、Series通過索引刪除行

b、DataFrame刪除行和列

4、通過索引獲取指定位置的值

a、Series獲取值

b、DataFrame獲取值

5、索引過濾

相關推薦