RMQ(Range minimum query) based LCA solution

何為RMQ

在文章《Tarjan’s off-line lowest common ancestors algorithm》我們用圖形化的方式展示了Tarjan’s off-line LCA的求解過程，但是該文章有很多遺漏，例如下面的這些問題。在本篇文章中，我會介紹另外一種求解LCA的方法，然後嘗試順帶回答列出的這些問題。

既然有了Leetcode 236中標準解法，為什麼還需要Tarjan這種比較重的方法？
在那篇文章中，提到Tarjan方法的本質是並查集，但是那種說法並不嚴謹，並查集只是Tarjan實現途徑

現階段比較好的求解LCA的方式是基於RMQ的求解方法，本文章會著重介紹什麼是RMQ以及常見的幾種求解RMQ的方法， 注：本篇文章完全按照TopCoder中的

此篇文章展開的，所以如果看過那篇文章就不用浪費時間本篇文章了

RMQ，全稱Range minimum query，用於查詢一個數組中子陣列的最值，這樣一個看似簡單的問題，卻有很多值得玩味的地方。

In computer science, a range minimum query (RMQ) solves the problem of finding the minimal value in a sub-array of an array of comparable objects. 《Range minimum query》

樸素解法

例如，給定包含 N 個數的陣列 data[N] 和 Q 個查詢。每個查詢的輸入 (a, b) 都是一對整數，要求打印出 data[a] 到 data[b] 之間的最大值和最小值之差。例如，N = 6，Q = 3，一個輸入樣例是 $d$

ata:17342569data\ :\ 1\ 7\ 3\ 4\ 2\ 5\ 6\ 9

d a t a : 17342569

query: 0\ 4

\qquad \quad \ \ 3\ 5

\qquad \quad \ \ 1\ 1

比較直觀的方法是，然後得到Max和Min，然後求差值，在陣列沒有發生變化的情況下，這種方法有很多資源的浪費，存在很多重複計算。例如我們在查詢（ $1$ ， $5$ ）之間Max和Min，可以順手將子區間的最大值和最小值記錄下來，使用一種Record Table來儲存計算後的結果。

Record Table

顯而易見將所有可能的查詢下標對（ $a$

a

，

b

）記錄下來，需要

O(N^2)

的空間複雜度。求子陣列最小值的記錄表格如下所示：

0	1	2	3	4	5	6	7
0	1	1	1	1	1	1	1	1
1	7	3	3	2	2	2	2
2	3	3	2	2	2	2
3	4	2	2	2	2
4	2	2	2	2
5	5	5	5
6	6	6
7	9

注：這種半矩陣肯定有更好的儲存方式

該方法的複雜度如下所示：

前期準備工作，亦即計算該表格的時間複雜度為 $O(N^2)$
查詢複雜度為 $O(1)$
空間複雜度為 $O(N^2)$ ，

該方法的查詢複雜度雖然很低，但是空間複雜度卻比較高，那麼是否可以對儲存的表格進行精簡？

可以使用動態規劃來求解該表格，注意對Table[i][i]的賦值移動到雙層loop中，但是那種做法沒有下面這種形式高效，經過我在quick-bench上的測試，下面的這種形式比另外一種形式快1.6倍，測試結果見http://quick-bench.com/oTprU_S6yaNvqI2xpjtBemq9O4w。下面這種方式比較快的原因可能是C++中的not pay for what you don’t use，類似於copy-and-swap idiom相較於傳統方式的優勢。

#include <iostream>
#include <vector>

using TableType = std::vector<std::vector<int>>;

void solution(std::vector<int> &Array, TableType &Table) {
    size_t size = Array.size();

    for (int i = 0; i < size; ++i)
        Table[i][i] = Array[i];
    
    for (size_t i = 0; i < size; ++i) {
        for (size_t j = i; j < size; ++j) {
            if (Table[i][j-1] < Array[j])
                Table[i][j] = Table[i][j-1];
            else
                Table[i][j] = Array[j];
        }
    }
}

int main() {
    std::vector<int> Vec{1, 7, 3, 4, 2, 5, 6, 9};
    TableType Table{Vec.size(), std::vector<int>(Vec.size(), 0)};
    solution(Vec, Table);
    return 0;
}

block-based Table

Sqrt-based Table

我們可以犧牲查詢操作的效率，來得到更小的表格需要的儲存空間。我們可以將 $Array$ 分成幾個chunks，儲存各chunk的最小值，然後將某次查詢經由這些chunk的最小值組合而成，由於我們至多可以將 $Array$ 分割成 $N$ 個chunk，所以儲存空間至多為 $O(N)$ 。TopCoder直接將Array分割成了 $sqrt(N)$ 個chunk，並沒有解釋緣由，GeekforGeeks中有一篇很好關於為什麼常常將 $Array$ 分割成 $sqrt(N)$ 的講解，見Sqrt (or Square Root) Decomposition Technique | Set 1 (Introduction)。

The key concept of this technique is to decompose given array into small chunks specifically of size sqrt(n).

我們以開頭陣列為例，選擇將 $Array$ 分割成不同的chrunk，如下圖所示： chunks

所以每次查詢都可分為下面兩種情況，查詢的複雜度就可以通過下面兩種情況中較大的複雜度決定。

查詢跨越多個chunk
查詢只侷限在一個chunk中那麼分成幾個chunk，才能使查詢的最壞複雜度最小呢？答案是將長度為 $N$ 的 $Array$ 分為 $sqrt(N)$ 個chunk時，worst case complexity最小。

Why sqrt is perfect?

假如我們將 $WC(N, x)$ 定義為將長度為 $N$ 的陣列分割為 $x$ 個chunk的複雜度，那麼該函式如下所示：

$WC(N, x) = \begin{cases} N/x, \qquad {\rm if\ } N/x > x \\ x, \qquad \quad \ \ {\rm otherwise} \end{cases}$ 當 $x$ 取 $sqrt(N)$ 時， $WC(N, x)$ 達到最小值，如下圖所示，也就是 $8/x$ 與 $x$ 交點的位置。

此時空間複雜度為 $O(sqrt(N))$ ，查詢複雜度為 $O(sqrt(N))$ ，構建Table的複雜度是 $O(N)$ 。

但是這種方式有個問題，就是雖然我們有了一個額外的table，但是還得必須訪問原有的陣列。

泛華形式

Sparse Table

注：該小節的標題其實不是很合適，sparse table是一個很寬泛的概念，上一小節中的block-based table其實也可以算作這一小節中 現如今針對RMQ中的sparse table就特指文章《The LCA Problem Revisited》中提出的sparse table的方法（注：也是該篇文章首次將LCA問題轉換成RMQ問題求解的）。該方法首先也是基於預先處理原陣列，然後使用一個額外的Table儲存指定query的值得方式。

首先我們定義 $M{_i}{_,}{_j}$ ，來表示子陣列 $A[i...i + 2^j-1]$ 的最小值的index（從這裡可以看到當 $j=0$ 時，表示就是A[i]這一個陣列單元），如下圖所示。任何關於子陣列的query都可以通過兩個 $M{_i}{_,}{_j}$ 覆蓋。例如 $A[2...8]$ 就可以由 $A[2...5]$ ，亦即 $M{_2}{_,}{_2}$ ，和 $A[5...8]$ ，亦即 $M{_5}{_,}{_2}$ 覆蓋。所以求一個子陣列最小值的問題就轉化成為求兩個預先儲存好兩個值的最小值的問題。 ST solution 注：該圖摘於《Faster range minimum queries》

由於對於陣列中的元素 $i{_i}{_h}$ 而言，都有 $log_2n$ 個值要儲存，所以

空間複雜度為 $O(n * log_2n)$
查詢時間複雜度為 $O(1)$
預處理複雜度，使用動態規劃來計算的話，複雜度也是 $O(n * log_2n)$

下面我們給出，這個 $O(n * log_2n)$ 空間複雜度的動態規劃演算法。根據上面 $M{_i}{_,}{_j}$ 的定義，轉移方程如下： $M{_i}{_,}{_j} = \begin{cases} M_{i, j-1} \qquad if A[M_{i,j}] <= A[M_{i + 2^{j-1}, j-1}] \\ M_{i+2^{j-1}, j-1} \end{cases}$ 注：示例陣列是從下標1開始的 例如我們要計算 $M_{2, 2}$ ，首先計算得到 $M_{2, 1} = 3$ ， $M_{4,1} = 5$ ，然後 $A[3] < A[5]$ ，所以 $M_{2,2} = M_{2,1} = 3$ 。

然後介紹一下，給定一個查詢 $RMQ_A(l, r)$ ，如何計算出能夠覆蓋該子陣列 $A[r...l]$ 的兩個已經儲存好的區間。例如我們想要求出 $A[2...8]$ 的最小值，那麼首先這個區間長度為 $i_{th}...j_{th} = 6$

RMQ(Range minimum query) based LCA solution

何為RMQ 在文章《Tarjan’s off-line lowest common ancestors algorithm》我們用圖形化的方式展示了Tarjan’s off-line LCA的求解過程，但是該文章有很多遺漏，例如下面的這些問題。在本篇文章中，我會

RMQ (Range Minimum/Maximum Query)演算法

RMQ演算法是一種查詢一個區間最值的演算法，當然是有Q次詢問，如果只詢問一次，當然直接遍歷就好了，如果是詢問很多次，這時就需要RMQ演算法了。 RMQ演算法 RMQ演算法用的是DP求解，預處理是nlogn的，查詢是O(1)。 A[i]表示要查詢的數列，F[i,j]

RMQ (Range Minimum/Maximum Query)問題的ST（Sparse Table）解法

RMQ (Range Minimum/Maximum Query)問題，就是要求：數字序列區間最值。如果直接遍歷查詢，複雜度為O(n). 對於比較大的資料和需要多次查詢的場景，都是很不理想的。常見的方法有線段樹和Sparse Tabel兩種方法。複雜度：兩種演算法都

Segment Tree Range Minimum Query.

int rangeMinQuery(int segTree[], int qlow, int qhigh, int low, int high, int pos) { if (qlow <= low && qhigh >= high) retur

（leetcode題解）Range Sum Query - Immutable

int 之間 push man color 留下 mut () ack Given an integer array nums, find the sum of the elements between indices i and j (i ≤ j), inclusive.

303. Range Sum Query - Immutable 數組範圍求和 - 不變

family elements ger mon integer ack man gin 不變 Given an integer array nums, find the sum of the elements between indices i and j (i ≤ j),

LeetCode - 307. Range Sum Query - Mutable

arr right fin 解決 dice div integer distrib anti Given an integer array nums, find the sum of the elements between indices i and j (i ≤ j),

303. Range Sum Query - Immutable

integer chang bsp mut tween elements 註意 between pub Given an integer array nums, find the sum of the elements between indices i and j (i

Range Sum Query - Immutable

clas fun chang bsp all mutable object length 可能 Given an integer array nums, find the sum of the elements between indices i and j (i ≤ j)

307. Range Sum Query - Mutable

indices mutable counter num index 二維 mod mar bit Given an integer array nums, find the sum of the elements between indices i and j (i ≤ j

Binary Indexed Tree-307. Range Sum Query - Mutable

indices odi nbsp func elements index date ott -h Given an integer array nums, find the sum of the elements between indices i and j (i ≤

307 Range Sum Query - Mutable

between brush size update com tor number cnblogs tps Given an integer array nums, find the sum of the elements between indices i and j (i

[leetcode] Range Sum Query - Immutable

cti lin [] arr change interview nts 總結 fin Given an integer array nums, find the sum of the elements between indices i and j (i ≤ j), in

CF893F Subtree Minimum Query 主席樹

mxd pen update getchar algo char insert ios 最小值如果是求和就很好做了... 不是求和也無傷大雅.... 一維太難限制條件了，考慮二維限制一維$dfs$序，一維$dep$序詢問$(x, k)$對應著在$dfs$上

[leetcode]304. Range Sum Query 2D - Immutable二維區間求和 - 不變

圖片 rectangle 元素 borde ive mat element 技術分享 red Given a 2D matrix matrix, find the sum of the elements inside the rectangle defined by its

308. Range Sum Query 2D - Mutable

Given a 2D matrix matrix, find the sum of the elements inside the rectangle defined by its upper left corner (row1, col1) and lower right corn

LeetCode : 303. 區域和檢索 - 陣列不可變（Range Sum Query - Immutable）解答

303. 區域和檢索 - 陣列不可變給定一個整數陣列 nums，求出陣列從索引 i 到 j (i ≤ j) 範圍內元素的總和，包含 i, j 兩點。示例：給定 nums = [-2, 0, 3, -5, 2, -1]，求和函式為 sumRange() su

[CF893F]Subtree Minimum Query (主席樹)

題面：傳送門：http://codeforces.com/problemset/problem/893/F 題目大意：給你一顆有根樹，點有權值，問你每個節點的子樹中距離其不超過k的點的權值的最小值。（邊權均為1，強制線上） Solution 這題很有意思。我們一般看到這種距離

[LeetCode] 304. Range Sum Query 2D - Immutable 二維區域和檢索 - 不可變 303. Range Sum Query - Immutable [LeetCode] 303. Range Sum Query - Immutable 區域和檢索 - 不可變

Given a 2D matrix matrix, find the sum of the elements inside the rectangle defined by its upper left corner (row1, col1) and lower right corner

[CF 893F] Subtree Minimum Query

Description 給定一棵有根樹，點 $x$ 有點權 $a[x]$，多組詢問，每次詢問以 $x$ 為根的子樹中的所有滿足 $dep[y]-dep[xi]<=ki$ 的 $y$ 中，最小的 $a[y]$。$n\leq10^5,q\leq10^6$。強制線上。 Solu

RMQ(Range minimum query) based LCA solution

何為RMQ

樸素解法

Record Table

block-based Table

Sqrt-based Table

Why sqrt is perfect?

泛華形式

Sparse Table

相關推薦