statistic—偏度，峰度，卡方分佈，t分佈，f分佈

阿新 • • 發佈：2018-12-18

from __future__ import print_function, division
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from scipy.stats import chi2, t, f

DJIA = pd.read_csv('^DJI.csv')
DJIA.index = pd.to_datetime(DJIA.Date)
# plt.plot(DJIA.Close)

# More profesional plot but can be even more professional 

SP500 = pd.read_csv('^GSPC.csv')
SP500.index = pd.to_datetime(SP500.Date)
# plt.plot(SP500.Close)

LD, LS = np.log(DJIA.Close), np.log(SP500.Close)
rD, rS = np.diff(LD), np.diff(LS)
lr = pd.DataFrame(data = {'DJIA': rD, 'SP500': rS})
lr.index = DJIA.index[1:]
plt.plot(lr.DJIA)
plt.show()

a = lr.mean()
s = lr. 
std(ddof=1)#ddof=0 sample std;ddof=1 population std

ydays = 252

A, V = a*ydays,  s*np.sqrt(ydays)

print('Annualized Log Return on %s = %0.2f%%' % (A.index[0], A.iloc[0]*100))
print('Annualized Log Return on %s = %0.2f%%' % (A.index[1], A.iloc[1]*100))
print('Annualized Volatility of %s = %0.2f%%' % (V.index[ 
0], V.iloc[0]*100))
print('Annualized Volatility of %s = %0.2f%%' % (V.index[1], V.iloc[1]*100))

lr_demeaned = lr - a

T, alpha = len(lr), 0.05

sigma = s*np.sqrt((T-1)/T)

skewness0 = sum(lr_demeaned.DJIA**3)/(T* sigma.DJIA**3)
skewness1 = sum(lr_demeaned.SP500**3)/(T* sigma.SP500**3)

kurtosis0 = sum(lr_demeaned.DJIA**4)/(T* sigma.DJIA**4) 
kurtosis1 = sum(lr_demeaned.SP500**4)/(T* sigma.SP500**4)  

JB0 = T*(skewness0**2/6 + (kurtosis0-3)**2/24)
JB1 = T*(skewness1**2/6 + (kurtosis1-3)**2/24) 
cvalue = chi2.ppf(1-alpha, 2)   #percent point function
print('\nJB of DJIA = %0.0f, JB of SP500 = %0.0f, Critical Value = %0.2f' 
     %  (JB0, JB1,  cvalue))
print('The null hypothesis of normality must be rejected.')

###############################################################################
correlation = np.corrcoef(lr.DJIA, lr.SP500)
print('\nCorrelation between DJIA and S&P500 log returns = %0.2f%%' 
      %  (correlation[0][1]*100))

T_test_stat = (a.DJIA - a.SP500)/np.sqrt(s.DJIA**2/T + s.SP500**2/T )
sT0, sT1 = s.DJIA**2/T, s.SP500**2/T

ddof = (sT0 + sT1)**2/(sT0**2/(T-1) + sT1**2/(T-1))
t_cvalue = t.ppf(1-alpha/2, ddof)    #ddof自由度
print('\nTwo sample t test stat = %0.4f, Critical Value = %0.2f'
      % (T_test_stat, t_cvalue))
print('The null hypothesis of same mean cannot be rejected.')

F = s.DJIA**2/s.SP500**2

f_cvalue_lower = f.ppf(alpha/2, T-1, T-1)
f_cvalue_upper = f.ppf(1-alpha/2, T-1, T-1)

print('\nF test stat = %0.4f, Lower crit val = %0.4f, Upper crit val = %0.4f'
      % (F, f_cvalue_lower, f_cvalue_upper))
print('The null hypothesis of equal variance must be rejected.')

結果：
在這裡插入圖片描述
Annualized Log Return on DJIA = 8.86%
Annualized Log Return on SP500 = 8.16%
Annualized Volatility of DJIA = 17.45%
Annualized Volatility of SP500 = 17.86%

JB of DJIA = 640600, JB of SP500 = 288868, Critical Value = 5.99
The null hypothesis of normality must be rejected.

Correlation between DJIA and S&P500 log returns = 96.51%

Two sample t test stat = 0.1640, Critical Value = 1.96
The null hypothesis of same mean cannot be rejected.

F test stat = 0.9549, Lower crit val = 0.9584, Upper crit val = 1.0434
The null hypothesis of equal variance must be rejected.

statistic—偏度，峰度，卡方分佈，t分佈，f分佈

statistic—偏度，峰度，卡方分佈，t分佈，f分佈

多階矩在影象中的含義（方差，偏度，峰度）

偏度與峰度的正態性分佈判斷

偏度與峰度（附python程式碼）

機器學習數學|偏度與峰度及其python實現

統計分析：偏度和峰度

python模擬概率論中偏度和峰度計算

數據的偏度和峰度

這屆網際網路公司月餅：阿里卡哇伊，百度酷炫風，京東乾隆審美……

抵禦WannaCry勒索病毒，瑞度吹起進攻號角！

SQLSERVER線程，調度器，工作任務

【立貼為證】二十年後2027，百度眼必然成人眼一個

JStorm與Storm源碼分析（三）--Scheduler，調度器

React antd嵌入百度編輯器（css加載不到等問題，'offsetWidth' of null）

phpcms的後臺網站直接訪問正常，百度快照收錄鏈接訪問跳轉到非法網站

從邂逅到共生：關於AI落地，百度與小米的新碰撞

達成戰略合作後，百度DuerOS和海爾U＋平臺將如何加速AI+IoT行業？

“百度杯”CTF比賽九月場_123（文件備份，爆破，上傳）

分布式計算標準差，信度

一鍵式搭建私人網絡硬盤、個人網盤，百度網盤——owncloud安裝指南

statistic—偏度，峰度，卡方分佈，t分佈，f分佈

相關推薦