XORing Elephants: Novel Erasure Codes for Big Data

阿新 • • 發佈：2021-10-10

0. ABSTRACT

RS基礎上做了改進，通過增加儲存冗餘，優化效能

1. INTRODUCTION

介紹了多副本，糾刪碼。

糾刪碼的修復主要問題是，頻寬開銷，假如是（10，4），修復一塊需要10倍塊大小的頻寬

提出了LRC的概念

1.1. Importance of Repair

分析了Facebook的相關資料得出降低網路頻寬代價是重要的結論

efficiently repairable重要的原因

degraded reads

很多瞬時錯誤並不會丟失永久性資料，但是會造成資料不可用，這個時候讀編碼的條塊會讀降級。

這個時候可以通過修復過程重建資料塊，但是目的不是容錯，而是為了更高的資料可用性。重建的塊不必寫入磁碟

所以efficient and fast repair 可以提高資料可用性
efficient node decommissioning

Hadoop可以讓故障節點退役

Functional data必須在退役之前從節點中複製出來，這是一個複雜且耗時的過程

Fast repairs可以把節點退役視為定期修復，並且重新建立塊，不會產生非常大的網路流量
repair influences the performance of other concurrent MapReduce jobs

因為修復需要佔用一定的網路頻寬，與資料中心的網路頻寬相比，儲存空間的增長速度不成比例地快。所以這個問題會越來越嚴重，所以local repairs顯得越發重要
local repair would be a key in facilitating geographically distributed file systems across data centers

RS在跨地理的程度上是不可行的因為high bandwidth requirements across wide area networks

local repair是可行的

複製可以很好的處理上面的問題，但是儲存開銷較大。MDS開銷小，但是會遇到上面的問題。

本文可以看作是犧牲了一些儲存效率，來達到其他指標

2. THEORETICAL CONTRIBUTIONS

MDS在通訊和儲存領域應用非常廣泛

MDS是最低恢復冗餘

兩個定義

Minimum Code Distance
Block Locality

locality和good distance是矛盾的

LRC(k, n−k, r)

2.1. LRC implemented in Xorbas

Ci的選擇有要求線性無關，不能為0

設計了一個隨機演算法和確定演算法，可以生成係數

有個優化S3不用儲存，構造出來S1+S2+S3=0

係數的構造

3. SYSTEM DESCRIPTION

在HDFS-RAID上實現

RaidNode

負責建立和維護校驗塊

BlockFixer

用來修復塊

ErasureCode

實現編碼和解碼功能，上面兩個元件都依賴於他

HDFS-Xorbas 在HDFS-RAID基礎上增加了LRC

3.1. HDFS-Xorbas

3.1.1. 編碼

RaidNode 將一個檔案分成10塊，然後編碼出4塊。可能一個檔案可能不夠10塊，預設剩下的填充為0，依舊是10塊

LRC會額外計算兩個塊，如上圖所示

3.1.1.2 解碼和修復

RaidNode

light-decoder

針對於每個條帶單個塊的錯誤

heavy-decoder

light-decoder失敗的時候使用

BlockFixer

檢測到塊失敗，決定LRC恢復需要的5個塊

light-decoding嘗試恢復

如果出現multiple failures，可能沒有需要的5個塊， light-decoder失敗， heavy-decoder啟動

heavy-decoder跟RS恢復過程一樣，將結果傳送和儲存到資料節點

4.RELIABILITY ANALYSIS

mean-time to data loss (MTTDL) 可靠性分析的依據

可以容忍的故障數

修復的速度

彈性增加和修復時間減小，MTTDL也會增加

結果LRC，RS比複製高了很多，但是複製的資料可用性比LRC和RS強

5. EVALUATION

兩種環境效能

Amazon’s Elastic Compute Cloud

a test cluster in Facebook

5.1 評價指標

HDFS Bytes Read

對應於為修復而啟動的作業所讀取的總資料量

collected from the statistics-reports of the jobs spawned following a failure event

Network Traffic

the total amount of data communicated from nodes in the cluster

單位為GB

用下面這個工具來監測

Amazon’s A WS Cloudwatch monitoring tools

Repair Duration

the time interval between the starting time of the first repair job and the ending time of the last repair job.

5.2 EC2

two Hadoop clusters

HDFS-Xorbas

HDFS-RS

Each cluster

51 instances of type m1.small

1 master hosting Hadoop’s NameNode, JobTracker and RaidNode daemons

50 slave as DataNode and a TaskTracker daemon

file size 640 MB

block size 64MB

每個檔案兩個叢集中分別生成14和16塊

故障包括一個數據節點或者多個數據節點的終止

四個故障事件是單節點錯誤，兩個三節點錯誤，兩個兩節點錯誤

檔案數量（20，100，200）

5.2.1 HDFS Bytes Read

HDFS-Xorbas 比HDFS-RS好41%-52%

讀的平均塊數從11.5降到5.8

5.2.2 Network Traffic

網路流量和讀取的位元組數基本上一致，二倍的關係

5.2.3 Repair Time

Xorbas比HDFS-RS快25%到45%

實驗裡面頻寬沒滿，實際環境中頻寬可能跑滿，時間表現可能更好

5.2.4 Repair under Workload

為了演示修復效能對叢集負載的影響。

建立了兩個叢集

每個叢集15個從節點

塊故障時不可用，LRC相比RS延遲小

5.3 Facebook’s cluster

區別點在於利用的叢集中現有的資料集

塊大小為256MB

94%檔案3塊剩下10塊平均3.4塊

由於塊大小比較小，Xorbas比HDFS-RS儲存開銷大了27%（最好應為13%）

6. RELATED WORK

functional repair

雖然塊可以恢復，但是這個時候確實不可使用，需要其他k塊來恢復

exact repair

使用更小網路代價來修復是有可能的

low rate

high rate

這部分涉及到的工作還挺多的，有時間可以看一看具體內容

7. CONCLUSIONS

LRC降低頻寬開銷2倍，增加儲存開銷14%

提出了想法，應用在寬條帶上，RS在寬條帶上不可行，因為頻寬要求隨著塊大小增長

XORing Elephants: Novel Erasure Codes for Big Data

0. ABSTRACT RS基礎上做了改進，通過增加儲存冗餘，優化效能 1. INTRODUCTION 介紹了多副本，糾刪碼。

vue+iview多條聯動，for迴圈data是函式

問題：多條for迴圈出的資料二級聯動for迴圈出多條資料，每條資料都有一個二級聯動，每次下拉一級聯動，二級的選項都是變化的。

CCS - Space-Time Codes for MIMO Systems - Space-Time Block Codes(STTC)

At the transmitter, the sequence of information bits is fed into a symbol mapper that maps a block ofbits into signal points {sd selected from a signal constellation such as PAM, PSK, orQAM, consisti

CCS - Space-Time Codes for MIMO Systems - Space-Time Block Codes(STBC)

At the transmitter, the sequence of information bits is fed into a symbol mapper that maps a block ofbits into signal points {sd selected from a signal constellation such as PAM, PSK, orQAM, consisti

Python for Finance: Data Visualization

https://www.mlq.ai/python-for-finance-data-visualization/ Data visualization is an essential step in quantitative analysis with Python.

Privacy Security in Big Data and Privacy-Preserving Data Mining (PPDM)

Introduction Big data is such a hot and well-known concept in recent years that it can often be heard or seen in everyday life. In this introduction, I would first explain the definition of big data

Online PCA for Contaminated Data

發表時間：2013（NIPS 2013）文章要點：這篇文章提出了一個online robust PCA演算法。在online case下，需要根據資料流不斷更新Principal Component，但是資料流裡面可能有異常值，如果不剔除就會影響Principal C

Big Data & Cloud Computing: The Roles & Relationships

Big Data & Cloud Computing: The Roles & Relationships Introduction You’ve likely heard the terms “Big Data” and “Cloud Computing” before. If you’re involved with cloud application deve

Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads 論文解讀（VLDB 2021）

阿里巴巴開源canal 工具資料同步異常CanalParseException:parse row data failed，column size is not match for table......

一、異常現象截圖二、解決方式： 1、背景早期的canal版本(<=1.0.24)，在處理表結構的DDL變更時採用了一種簡單的策略，在記憶體裡維護了一個當前資料庫內表結構的映象(通過desc table獲取)。

Data truncation: Incorrect datetime value: '' for column 'create_time' at row 1 問題

org.springframework.dao.DataIntegrityViolationException: PreparedStatementCallback; SQL [insert into orders values(?,?,?,?,?,?,?,?,?,?,?)]; Data truncation: Incorrect datetime value: \'\' for col

Consensus-Driven Propagation in Massive Unlabeled Data for Face Recognition 人臉聚類

看論文前可先看下作者自己在知乎的總結： https://zhuanlan.zhihu.com/p/51806059 這篇論文簡稱CDP，利用監督方式訓練一個度量模型判別圖片對判別進而實現人臉聚類，使用聚類的人臉來訓練人臉識別模型

讀書筆記-多工學習-A Novel Multi-task Deep Learning Model for Skin Lesion Segmentation and Classification

一篇2017年的論文，A Novel Multi-task Deep Learning Model for Skin Lesion Segmentation and Classification，基於多工學習的面板病變分割與分類。

《SLIQ：A fast scalable classifier for data mining》論文筆記

1 簡介本文根據1996年《SLIQ：A fast scalable classifier for data mining》翻譯總結的，即一個快速的可擴充套件的資料探勘分類器。看了論文，論文中沒找到SLIQ的縮寫，還不清楚為什麼這麼叫。

解決ES因記憶體不足而無法查詢的錯誤，Data too large, data for [<http_request>]

本解決方案的前提是在docker環境下錯誤詳情： [type=circuit_breaking_exception, reason=[parent] Data too large, data for [<http_request>] would be [125643918/119.8mb], which is larger than the lim

decimal型別 MysqlDataTruncation: Data truncation: Out of range value for column ‘unit_price‘ at row 1

今天程式中報了個錯，錯誤的大概意思是unit_price欄位被截斷了，為什麼會被截斷呢，資料庫的長度不夠，那麼navicat中顯示的長度，到底是什麼意思呢？ 100.123456的長度為9位，小數點6位 123.45678的長度為8位

Mac Big Sur mkdir: /data: Read-only file system

技術標籤：macBig Surshelllinuxmacmacos mac根目錄讀寫許可權問題建立一個可達的目錄比如 /Users/user/datasudo vim /etc/synthetic.conf 內容

Python for Data Science - Treating missing values

Chapter 2 - Data Preparation Basics Segment 2 - Treating missing values import numpy as np import pandas as pd

Python for Data Science - Filtering and selecting data

Chapter 2 - Data Preparation Basics Segment 1 - Filtering and selecting data import numpy as np import pandas as pd

Python for Data Science - Removing duplicates

Chapter 2 - Data Preparation Basics Segment 3 - Removing duplicates import numpy as np import pandas as pd

XORing Elephants: Novel Erasure Codes for Big Data

0. ABSTRACT

1. INTRODUCTION

1.1. Importance of Repair

2. THEORETICAL CONTRIBUTIONS

2.1. LRC implemented in Xorbas

3. SYSTEM DESCRIPTION

3.1. HDFS-Xorbas

3.1.1. 編碼

3.1.1.2 解碼和修復

4.RELIABILITY ANALYSIS

5. EVALUATION

5.1 評價指標

5.2 EC2

5.2.1 HDFS Bytes Read

5.2.2 Network Traffic

5.2.3 Repair Time

5.2.4 Repair under Workload

5.3 Facebook’s cluster

6. RELATED WORK

7. CONCLUSIONS

相關推薦