pandas 學習彙總13 - 函式應用- 將自定義或其他庫函式應用於Pandas物件( tcy)

阿新 • • 發佈：2018-12-15

Pandas函式應用- 將自定義或其他庫函式應用於Pandas物件（pipe,apply,applymap,map,agg） 2018/12/5

1.函式：

# 表函式應用：
df.pipe(func, *args, **kwargs) #資料為Series，DataFrames或GroupBy物件
s.pipe( func, *args, **kwargs)

# 行或列函式應用：
df.apply(func, axis=0, raw=False, result_type=None, args=(), **kwds)
s.apply(func, convert_dtype=True, args=(), **kwds)
# 引數：
# raw =False輸入為序列;=True輸入為ndarray
# result_type: {'expand', 'reduce', 'broadcast', None}在axis=1時起作用
# 分別返回list,Series,原形狀,自動推斷

# 元素函式應用：
df.applymap(func)
s.map(arg, na_action=None) #對應的序列的對映值(dict、Series或函式)

# 聚合：
s.agg(func, axis=0, *args, **kwargs)#使用指定軸上的一個或多個操作進行聚合

2.pipe函式

例項1：

def adder(x1,x2):
   return x1+x2
    
s=pd.Series([1,2,3,4])
df = pd.DataFrame([[1,2,3],[4,5,6]],columns=['col1','col2','col3'])
    
s.pipe(adder,2)
# 0    3
# 1    4
# 2    5
# 3    6
# dtype: int64
    
df.pipe(adder,2)
    
#    col1  col2  col3
# 0     3     4     5
# 1     6     7     8

3.apply函式

# 例項1：Series序列應用 

def adder(x,y):
return np.add(x,y)

s.apply(lambda x:x+2) #lambda函式
s.apply(adder,args=(2,)) #自定義函式

def sum(x, **kwargs):
for month in kwargs:
x+=kwargs[month]
return x

s.apply(sum,a1=1,a2=2,a3=3)#多引數

s.apply(np.log) #庫函式
# 0 0.000000
# 1 0.693147
# 2 1.098612
# 3 1.386294
# dtype: float64

例項2：DataFrame應用：

# 用apply()方法沿DataFrame或Panel的軸應用任意函式

df.apply(np.mean)#預設axis=0,在列上計算
# col1 2.5
# col2 3.5
# col3 4.5
# dtype: float64

df.apply(np.mean,axis=1)
# 0 2.0
# 1 5.0

4.對映

# 例項1：Series

# 並不是所有的函式都可以向量化
df['col1'].map(lambda x:x*2)
s.map(lambda x:x*2)

x = pd.Series([1,2,3], index=['a1', 'a2', 'a3'])
y = pd.Series(['v1', 'v2', 'v3'], index=[1,2,3])

x.map(y)#對映序列
# a1 v1
# a2 v2
# a3 v3
# dtype: object

z = {1: 'A', 2: 'B', 3: 'C'}
x.map(z)#對映字典
# a1 A
# a2 B
# a3 C
# dtype: object

s = pd.Series([1, 2, 3, np.nan])
s2 = s.map('str = {}'.format, na_action=None)#Na值
# 0 str = 1.0
# 1 str = 2.0
# 2 str = 3.0
# 3 str = nan
# dtype: object

s3 = s.map('str = {}'.format, na_action='ignore')
# 0 str = 1.0
# 1 str = 2.0
# 2 str = 3.0
# 3 NaN
# dtype: object

例項2：DataFrame

df = pd.DataFrame(np.random.randn(5,3),columns=['col1','col2','col3'])
df.applymap(lambda x:x*2)

format = lambda x: "%.2f" % x
df.applymap(format)
# col1 col2 col3
# 0 1.00 2.00 3.00
# 1 4.00 5.00 6.00

pandas 學習彙總13 - 函式應用- 將自定義或其他庫函式應用於Pandas物件( tcy)

Pandas函式應用- 將自定義或其他庫函式應用於Pandas物件（pipe,apply,applymap,map,agg） 2018/12/5 1.函式： # 表函式應用： df.pipe(func, *args, **kwarg

pandas 學習彙總15 - str函式(全面 tcy)

str函式 2018/12/5 彙總52個str函式，本人經過全部測試，簡單的容易理解的沒寫例程，你可根據簡表進行測試；複雜難以理解的都附加有例程，共有12個例程。簡表註釋內容根據測試結果編寫，而不是按簡單的翻譯，有時是難以理解的。 1.函式表

pandas 學習彙總10 - 統計：視窗函式rolling，expanding( tcy)

視窗函式rolling，expanding 2018/12/4 主要用在統計方面。 1.函式 df.rolling(window,

numpy 學習彙總13-numpy.linalg線性代數 ( 基礎學習 tcy)

線性代數 2018/11/11 子模組numpy.linalg實現了基本的線性代數，建議使用scipy.linalg ========================================================================

python 學習彙總58：class類外部定義函式（初級學習- tcy）

類外部定義函式 2018/11/19 目錄： 1. class定義 2. 內部類 3.外部定義函式 4.高階函式與類的關係 5.物件記憶體管理 6.類作用域 7.使用輸出引數 8.類屬性 9.類特性 10.描述符 11.檢視類屬性 12.繼承 13.型別檢

pandas 學習彙總17 - 計算( tcy)

1.算數計算 2018/11/8 2018/12/10 1.1函式： Series.product([axis, skipna, level, …]) # 返回請求軸的值的乘積；各個元素相乘 Series.dot(other) # 矩陣乘法與

pandas 學習彙總16 - 基本設定( tcy)

pandas基本設定 2018/12/5 1.函式： get_option(*args, **kwds) # 獲取預設引數值 set_option(*args, **kwds) # 設定引數值 reset_option(*args, **kwds) # 引數重設為預設值 desc

pandas 學習彙總12 - 描述性統計(比較全 tcy)

描述性統計 2018/12/4 1.統計函式說明：大部分是聚合函式（因此產生低維結果）採用軸引數（通過名稱或整數）可選level引數，該引數僅在物件具有分層索引時才適用可選skipna引數，一般預設排除系列輸入上的NA值。 2.視窗函式：

pandas 學習彙總11 - 統計：pd.cut與pd.qcut數字按區間劃分( tcy)

pd.cut與pd.qcut數字按區間劃分 2018/12/4 1.函式： pandas.cut(x, bins, right=True, labels=None, retbins=False, precision=3, include_low

pandas 學習彙總9 - Series系列，DataFrame資料幀屬性( tcy)

Series-屬性 2018/11/8 2018/12/6 序列： # 可以把Series看成有序字典；均勻資料；尺寸資料均可變 s=pd.Series(data=np.arange(10,15),index=pd.Index(list('abcde'))

pandas 學習彙總8 - Series系列，DataFrame資料幀新增刪除（行列）( tcy)

新增刪除 2018/12/3 1.函式： s1.append(to_append, ignore_index=False, verify_integrity=False) #更多序列連線 df.append(other, ignore_index=False, verify_in

pandas 學習彙總7 - 缺失資料( tcy)

缺少資料 2018/12/3 # 用np.nan表示缺失資料。預設不包含在計算中 dates=pd.date_range('2018-12-02',periods=4) df=pd.DataFrame(np.random.random((4,3)) ,index=dates,c

pandas 學習彙總5 - index 建立( tcy)

index 建立 2018/12/2 #1.pd.Index i=pd.Index([1,2,3,4]) # (Int64Index([1, 2, 3, 4], dtype='int64') i=pd.Index(list('abcd')) # Index(['a','b','c',

pandas 學習彙總3 - Series,DataFrame迭代iter( tcy)

迭代iter 2018/12/1 ======================================================================= 1.基本iteration()產生：#系列：值；DataFrame：列標籤；面板：專案標籤 # 迭代Seri

SODBASE CEP學習（十七）：自定義函式開發

前面的文章已經多次提到自定義函式，對JAVA開發熟悉的讀者，只要自己實現一個類的public方法，就可以當做自定義函式在EPL中使用。部署時，程式碼然後打成jar包放到lib目錄下即可。如果對這個流程不熟悉也不要緊，本文提供一個示例，按步驟就可以做自定義函式 1 使用場景

vue.js自定義指令及鉤子函式學習

過濾器概念：Vue.js允許自定義過濾器，可被用作一些常見的文字格式化，過濾器可以用在兩個地方：mustache插值和v-bind表示式。過濾器應該被新增在JavaScript表示式的尾部，由管道符指示過濾器呼叫時候的格式 { { name | nam

將自定義的rpm添加到yum

rom lean oop name cdr sta mount creat repo 本章介紹如何創建本地yum源，使用鏡像外的rpm包。 1、安裝createrepo#mount -t iso9660 -o loop /iso/rhel-server-6.9-x86_64

學習windows編程 day3 之自定義畫筆的兩種方法

cas delete tro HP rec col 編程 UC eat LRESULT CALLBACK WndProc(HWND hwnd, UINT message, WPARAM wParam, LPARAM lParam) { HDC hdc; P

學習windows編程 day4 之自定義映射

BE port turn pro pos truct IT llb 中心 LRESULT CALLBACK WndProc(HWND hwnd, UINT message, WPARAM wParam, LPARAM lParam) { HDC hdc;

Log4Net 之將自定義屬性記錄到文件中 (三)

hive days bsp 文本處理 message homepage layout backup 即解決了將自定義屬性記錄到數據庫之後。一個新的想法冒了出來，自定義屬性同樣也能記錄到文件中嗎？答案是肯定的，因為Log4Net既然已經考慮到了數據庫記錄方式，當然也一定考慮

pandas 學習彙總13 - 函式應用- 將自定義或其他庫函式應用於Pandas物件( tcy)

相關推薦