檢測異常點並過濾

阿新 • • 發佈：2020-07-13

1、檢測通過區域性相關跟蹤方法測量的異常，不同方法對應不同的閾值。

 1  def detect_anomaly_lcs(self, lcs_scores):
 2         """
 3         It detects the anomalies which are measured by local correlation tracking method.
 4         - gauss: threshold = 0.0 + self.sigma * std
 5         - threshold: the given threshold variable
 6         - proportion: threshold = sort_scores[threshold_index]
 
 7         :param lcs_scores: list<float> | the list of local correlation scores
 8         :return:
 9         """
10         if self.rule == "gauss":
11             mean = 0.0
12             std = np.std(lcs_scores)
13             threshold = mean + self.sigma * std
14             change_labels = []
 
15             for lcs in range(len(lcs_scores)):
16                 if lcs > threshold:
17                     change_labels.append(True)
18                 else:
19                     change_labels.append(False)
20             return change_labels, lcs_scores
21         if self.rule == "threshold 
":
22             threshold = self.threshold
23             change_labels = []
24             for lcs in range(len(lcs_scores)):
25                 if lcs > threshold:
26                     change_labels.append(True)
27                 else:
28                     change_labels.append(False)
29             return change_labels, lcs_scores
30         if self.rule == "proportion":
31             sort_scores = sorted(np.array(lcs_scores))
32             threshold_index = int(len(lcs_scores) * (1.0 - self.proportion))
33             threshold = sort_scores[threshold_index]
34             change_labels = []
35             for lcs in range(len(lcs_scores)):
36                 if lcs > threshold:
37                     change_labels.append(True)
38                 else:
39                     change_labels.append(False)
40             return change_labels, lcs_scores

2、通過比較預測值和實際值來計算每個點的掉落率。執行filter_anomaly（）函式以通過引數“ rule”過濾掉異常。

 1     def detect_anomaly_regression(self, predicted_series1, practical_series1, predicted_series2, practical_series2):
 2         """
 3         It calculates the drop ratio of each point by comparing the predicted value and practical value.
 4         Then it runs filter_anomaly() function to filter out the anomalies by the parameter "rule".
 5         :param predicted_series1: list<float> | the predicted values of the KPI series 1.
 6         :param practical_series1: list<float> | the practical values of the KPI series 1.
 7         :param predicted_series2: list<float> | the predicted values of the KPI series 2.
 8         :param practical_series2: list<float> | the practical values of the KPI series 2.
 9         :return:
10         """
11         change_ratios1 = []
12         change_ratios2 = []
13         change_scores = []
14         for i in range(len(practical_series1)):
15             c1 = (practical_series1[i] - predicted_series1[i]) / (predicted_series1[i] + 1e-7)
16             c2 = (practical_series2[i] - predicted_series2[i]) / (predicted_series2[i] + 1e-7)
17             change_ratios1.append(c1)
18             change_ratios2.append(c2)
19             s = (abs(c1) + abs(c2)) / 2.0
20             change_scores.append(s)
21 
22         change_labels = self.filter_anomaly(change_ratios1, change_ratios2, change_scores)
23         return change_ratios1, change_ratios2, change_labels, change_scores

3、檢測迴歸方法的異常

 1     def filter_anomaly(self, change_ratios1, change_ratios2, change_scores):
 2         """
 3         It detects the anomalies which are measured by regression method.
 4         - gauss: threshold1 = mean - self.sigma * std, threshold2 = mean + self.sigma * std
 5         - threshold: the given threshold variable
 6         - proportion: threshold = sort_scores[threshold_index]
 7         :param change_ratios1: list<float> | the change ratios of the KPI1.
 8         :param change_ratios2: list<float> | the change ratios of the KPI2.
 9         :param change_scores: list<float> | the average of the change anomaly degree of the two change ratios.
10         :return: list<bool> | the list of the labels where "True" stands for an anomaly.
11         """
12         if self.rule == 'gauss':
13             mean = np.mean(change_ratios1)
14             std = np.std(change_ratios1)
15             threshold1 = mean - self.sigma * std
16             threshold2 = mean + self.sigma * std
17             change_labels1 = self.filter_by_threshold(change_ratios1, threshold1, threshold2)
18             mean = np.mean(change_ratios2)
19             std = np.std(change_ratios2)
20             threshold1 = mean - self.sigma * std
21             threshold2 = mean + self.sigma * std
22             change_labels2 = self.filter_by_threshold(change_ratios2, threshold1, threshold2)
23             change_labels = list(np.array(change_labels1) + np.array(change_labels2))
24             return change_labels
25 
26         if self.rule == "threshold":
27             threshold = self.threshold
28             change_labels1 = self.filter_by_threshold(change_ratios1, -threshold, threshold)
29             change_labels2 = self.filter_by_threshold(change_ratios2, -threshold, threshold)
30             change_labels = list(np.array(change_labels1) + np.array(change_labels2))
31             return change_labels
32 
33         if self.rule == "proportion":
34             sort_scores = sorted(np.array(change_scores))
35             threshold_index = int(len(change_scores) * (1.0 - self.proportion))
36             threshold = sort_scores[threshold_index]
37             change_labels = []
38             for i in range(len(change_scores)):
39                 if change_scores[i] > threshold:
40                     change_labels.append(True)
41                 else:
42                     change_labels.append(False)
43             return change_labels

4、將過於偏離的點過濾為異常。

 1     def filter_by_threshold(self, change_ratios, threshold1, threshold2):
 2         """
 3         It filter out the too deviated points as anomalies.
 4         :param change_ratios: list<float> | the change ratios.
 5         :param threshold1: float | the negative threshold standing for a drop deviation.
 6         :param threshold2: float | the positive threshold standing for a rise deviation.
 7         :return: list<bool> | the list of the labels where "True" stands for an anomaly.
 8         """
 9         change_labels = []
10         for r in change_ratios:
11             if r < threshold1 or r > threshold2:
12                 change_labels.append(True)
13             else:
14                 change_labels.append(False)
15         return change_labels

檢測異常點並過濾

1、檢測通過區域性相關跟蹤方法測量的異常，不同方法對應不同的閾值。 1def detect_anomaly_lcs(self, lcs_scores):

Scikit-learn實戰之 SVM迴歸分析、密度估計、異常點檢測

Scikit-learn實戰之 SVM迴歸分析、密度估計異常點檢測 1. SVM迴歸 SVM的支援向量的方法能夠被擴充套件以解決迴歸問題。這種方法被稱之為SVR（Support Vector Regression 支援向量迴歸）。該模型是由SVC（支援向量分

python opencv 檢測移動物體並截圖儲存例項

最近在老家找工作，無奈老家工作真心太少，也沒什麼面試機會，不過之前面試一家公司，提了一個有意思的需求，檢測河面沒有有什麼船隻之類的物體，我當時第一反應是用opencv做識別，不過回家想想，河面相對的東西比較

ROC曲線評估和異常點去除

1、詳細連結見https://www.cnblogs.com/mdevelopment/p/9456486.html 複習ROC曲線： ROC曲線是一個突出ADS分辨能力的曲線，用來區分正常點和異常點。ROC曲線將TPR召回率描繪為FPR假陽性率的函式。

PHP + TP5 自定義異常機制並記錄日誌

技術標籤：TP5php 1.首先在lib目錄下建立Exception資料夾，並在該資料夾建立一個ApiHandleException.php （名稱可自定義）檔案，重寫render方法，作為異常輸出。

k8s --etcd 叢集檢測異常( could not connect: x509)

技術標籤：K8S運維etcd etcd叢集單節點檢測正常 [[email protected] etcd]# supervisorctl status

異常點分析

import matplotlib.pyplot as plt import plotly.express as px import plotly plotly.offline.init_notebook_mode(connected=True)

php抓取網頁body內容，並過濾網頁標籤

php只抓取網頁文字內容，並過濾其標籤，說幹就幹，開始！ <?php function curl_request ( $url , $post = \'\' , $cookie = \'\' ,$returnCookie = 0 ) {

OPPO 公開活體檢測相關專利：可降低眼部檢測成本，並提高準確性

8 月 10 日訊息 OPPO 廣東行動通訊有限公司在今日公開了“活體檢測方法及裝置、計算機可讀儲存介質和電子裝置”專利，公開號為 CN113239887A。

扎克伯格：有種新型類膚材料，可以檢測現實觸覺並反饋到虛擬世界

北京時間 11 月 2 日早間訊息，據報道，最近 Facebook 正式更名為 Meta，該公司聯合創始人兼執行長馬克・扎克伯格（Mark Zuckerberg）在週一表示，一種新的觸控感測器和一種塑料材料可以配合在一起工作，並有可能為“

inno setup打包多個exe、msi 自動檢測.net framework並安裝

新建一個空白指令碼 #define MyAppName \"傳奇霸業\" #define MyAppVersion \"1.8.8.8\" #define MyAppPublisher \"霸業科技\"

uni-app檢測版本升級並顯示下載進度

uni-app檢測版本升級並顯示下載進度一、檢測版本 1、自動檢測即開啟應用是就檢測應用版本，檢測方法需要寫在app.vue檔案中，程式碼如下

Python多執行緒捕獲子執行緒的異常，並退出主程序。

自己在專案的開發中，一般能避免在單個程序中使用多執行緒就儘量把每個執行緒包裝成獨立的程序執行，通過socket或者一些中介軟體比如redis進行通訊，工作，協調。

SpringBoot定義全域性異常類並列印錯誤堆疊資訊

目錄一、註解含義二、定義全域性異常類 SpringBoot中可以定義全域性異常類，不用在每一個介面使用try catch捕獲返回異常

高德地圖杭州上線一鍵導航核酸檢測取樣點

感謝網友肖戰割割的線索投遞！

油猴指令碼：自動檢測元素並點選、休眠、順序執行、單頁面也適用

油猴指令碼-目標：自動檢測元素並點選、休眠、順序執行、填充表單、單頁面也適用

springboot攔截器過濾token,並返回結果及異常處理操作

1.springboot 攔截器處理過濾token，並且返回結果 import org.apache.commons.lang3.StringUtils;

無需看原始碼瞭解並解決一個事務常見的異常

無需看原始碼瞭解並解決一個事務常見的異常在觀看此篇文章之前需要了解什麼是事務的傳播屬性

Python 過濾錯誤log並匯出的例項

前言：測試過程中獲取App相關log後，如何快速找出crash的部分，並匯出到新的檔案呢？

Python實現非正太分佈的異常值檢測方式

工作中，我們經常會遇到資料異常，比如說瀏覽量突增猛降，交易量突增猛降，但是這些資料又不是符合正太分佈的，如果用幾倍西格瑪就不合適，那麼我們如何來判斷這些變化是否在合理的範圍呢？

檢測異常點並過濾

相關推薦