雜湊連結串列及其變種

阿新 • • 發佈：2019-01-31

前言

先來直觀的比較下普通連結串列和雜湊連結串列：

普通連結串列

普通連結串列的表頭和節點相同

struct list_head {
    struct list_head *next, *prev;
};

雜湊連結串列

雜湊連結串列頭

struct hlist_head {
    struct hlist_node *first;
};

雜湊連結串列節點

struct hlist_node {
    struct hlist_node *next, **pprev;
};

設計原理

Linux連結串列設計者認為雙指標表頭雙迴圈連結串列對於HASH表來說過於浪費，因而另行設計了一套用於HASH表的hlist資料結構，

即單指標表頭雙迴圈連結串列。hlist表頭僅有一個指向首節點的指標，而沒有指向尾節點的指標，這樣在海量的HASH表中儲存

的表頭就能減少一半的空間消耗。

這裡還需要注意：struct hlist_node **pprev，也就是說pprev是指向前一個節點(也可以是表頭)中next指標的指標。

Q：為什麼不使用struct hlist_node *prev，即讓prev指向前一個節點呢？

A：因為這時候表頭(hlist_head)和節點(hlist_node)的資料結構不同。如果使用struct hlist_node *prev，只適用於前一個為節點

的情況，而不適用於前一個為表頭的情況。如果每次操作都要考慮指標型別轉換，會是一件麻煩的事情。

所以，我們需要一種統一的操作，而不用考慮前一個元素是節點還是表頭。

struct hlist_node **pprev，pprev指向前一個元素的next指標，不用管前一個元素是節點還是表頭。

當我們需要操作前一個元素(節點或表頭)，可以統一使用*(node->pprev)來訪問和修改前一元素的next(或first)指標。

原理圖如下：

常用操作

(1) 初始化

/*
 * Double linked lists with a single pointer list head.
 * Mostly useful for hash tables where the two pointer list head is too wasteful.
 * You lose the ability to access the tail in O(1).
 */
#define HLIST_HEAD_INIT { .first = NULL }
#define HLIST_HEAD (name) struct hlist_head name = { .first = NULL }
#define INIT_HLIST_HEAD(ptr) ((ptr)->first = NULL)

(2) 插入

/* next must be != NULL */
static inline void hlist_add_before(struct hlist_node *n, struct hlist_node *next)
{
    n->pprev = next->pprev;
    n->next = next;
    next->pprev = &n->next;
    *(n->pprev) = n;
}

(3) 刪除

static inline void __hlist_del(struct hlist_node *n)
{
    struct hlist_node *next = n->next;
    struct hlist_node **prev = n->pprev;
    *pprev = next;
    if (next)
        next->pprev = pprev;
}

(4) 遍歷

#define offsetof(TYPE, MEMBER) ((size_t) &((TYPE *) 0)->MEMBER)

/*
 * container_of - cast a member of a structure out to the containing structure
 * @ptr: the pointer to the member.
 * @type: the type of the container struct this is embedded in.
 * @member: the name of the member within the struct.
 */
#define container_of(ptr, type, member) ({    \
    const typeof(((type *) 0)->member) * __mptr = (ptr);    \
    (type *) ((char *) __mptr - offsetof(type, member)); })

#define hlist_entry(ptr, type, member) container_of(ptr, type, member)

#define hlist_for_each(pos, head) \
    for (pos = (head)->first; pos; pos = pos->next)
 
/**
 * hlist_for_each_entry - iterate over list of given type
 * @tpos: the type * to use as a loop cursor.
 * @pos: the &struct hlist_node to use a loop cursor.
 * @head: the head for your list.
 * @member: the name of the hlist_node within the struct.
 */
#define hlist_for_each_entry(tpos, pos, head, member)    \
    for (pos = (head)->first;    \
           pos && ({ tpos = hlist_entry(pos, typeof(*tpos), member); 1;});    \
           pos = pos->next)

雜湊連結串列變種

以下是作者的說明：

Special version of lists, where end of list is not a NULL pointer,

but a 'nulls' marker, which can have many different values.

(up to 2^31 different values guaranteed on all platforms)

In the standard hlist, termination of a list is the NULL pointer.

In this special 'nulls' variant, we use the fact the objects stored in

a list are aligned on a word (4 or 8 bytes alignment).

We therefore use the last significant bit of 'ptr':

Set to 1: This is a 'nulls' end-of-list maker (ptr >> 1)

Set to 0: This is a pointer to some object (ptr)

設計原理

當遍歷標準的雜湊連結串列時，如果節點為NULL，表示連結串列遍歷完了。

雜湊連結串列變種和標準雜湊連結串列的區別是：連結串列的結束節點不是NULL。如果first或者next指標的最後一位為1，

就說明遍歷到連結串列尾部了。

Q：為什麼可以根據節點指標的最後一位是否為1來判斷連結串列是否結束？

A：因為在一個結構體中，其元素是按4位元組(32位機器)或者8位元組(64位機器)對齊的。所以有效的節點指標的

最後一位總是為0。因此我們可以通過把節點指標的最後一位置為1，來作為結束標誌。

/* 表頭 */
struct hlist_nulls_head {
    struct hlist_nulls_node *first;
};

/* 節點 */
struct hlist_nulls_node {
    struct hlist_nulls_node *next, **pprev;
};

原理圖如下：

常用操作

(1) 初始化

#define INIT_HLIST_NULLS_HEAD(ptr, nulls)    \
    ((ptr)->first = (struct hlist_nulls_node *) (1UL | (((long) nulls) << 1)))

(2) 判斷是否為結束標誌

/*
 * ptr_is_a_nulls - Test if a ptr is a nulls
 * @ptr: ptr to be tested
 */
static inline int is_a_nulls(const struct hlist_nulls_node *ptr)
{
    return ((unsigned long) ptr & 1);
}

(3) 獲取結束標誌

/*
 * get_nulls_value - Get the 'nulls' value of the end of chain
 * @ptr: end of chain
 * Should be called only if is_a_nulls(ptr);
 */
static inline unsigned long get_nulls_value(const struct hlist_nulls_node *ptr)
{
    return ((unsigned long)ptr) >> 1;
}

(4) 插入

把節點n插入為連結串列的第一個節點。

static inline void hlist_nulls_add_head(struct hlist_nulls_node *n, struct hlist_nulls_head *h)
{
    struct hlist_nulls_node *first = h->first;
    n->next = first;
    n->pprev = &h->first;
    h->first = n;
    if (! is_a_nulls(first))
        first->pprev = &n->next;
}

(5) 刪除

/*
 * These are non-NULL pointers that will result in page faults
 * under normal circumstances, used to verify that nobody uses
 * non-initialized list entries.
 */
#define LIST_POISON1 ((void *) 0x00100100 + POISON_POINTER_DELTA)
#define LIST_POISON2 ((void *) 0x00200200 + POISON_POINTER_DELTA)

static inline void __hlist_nulls_del(struct hlist_nulls_node *n)
{
    struct hlist_nulls_node *next = n->next;
    struct hlist_nulls_node **pprev = n->pprev;
    *pprev = next;
    if (! is_a_nulls(next))
        next->pprev = pprev;
}

static inline void hlist_nulls_del(struct hlist_nulls_node *n)
{
    __hlist_nulls_del(n);
    n->pprev = LIST_POISON2; /* 防止再通過n訪問連結串列 */
}

(6) 遍歷

同標準雜湊連結串列的基本一樣。

hlist_nulls_for_each_entry(tpos, pos, head, member)

hlist_nulls_for_each_entry_from(tpos, pos, member)

Author

zhangskd @ csdn blog

雜湊連結串列及其變種

前言先來直觀的比較下普通連結串列和雜湊連結串列：普通連結串列普通連結串列的表頭和節點相同 struct list_head { struct list_head *next, *prev; }; 雜湊連結串列雜湊連結串列頭 struct hl

hlist 雜湊連結串列

Linux 連結串列設計者（因為 list.h 沒有署名，所以很可能就是 Linus Torvalds）認為雙頭（next、prev）的雙鏈表對於 HASH 表來說 "過於浪費"，因而另行設計了一套用於 HASH 表應用的 hlist 資料結構--單指標表頭雙迴圈連結串列，從

嵌入式linux c 學習筆記9---雜湊連結串列

/* * ===================================================================================== * * Filename: hash.c * * Descri

連結串列及其操作的實現

#include <iostream> #include <cstdio> #include <stdlib.h> using namespace std; typedef char ElemType; typedef struct LNode {

散列表（雜湊表）及其儲存結構和特點詳解

順序儲存的結構型別需要一個一個地按順序訪問元素，當這個總量很大且我們所要訪問的元素比較靠後時，效能就會很低。散列表是一種空間換時間的儲存結構，是在演算法中提升效率的一種比較常用的方式，但是所需空間太大也會讓人頭疼，所以通常需要在二者之間權衡。我們會在之後的具體演算法章節中得到更多的領悟。什麼是散列表讓我

一致性雜湊演算法原理及其在分散式系統中的應用

分散式快取問題假設我們有一個網站，最近發現隨著流量增加，伺服器壓力越來越大，之前直接讀寫資料庫的方式不太給力了，於是我們想引入Memcached作為快取機制。現在我們一共有三臺機器可以作為Memcached伺服器，如下圖所示。很顯然，最簡單的策略是將每一次Memcached請求隨機發送到一臺Memca

資料結構-靜態連結串列及其插入刪除操作

什麼是靜態連結串列我們平常提及的連結串列一般指的是動態連結串列，是使用指標將一個一個的結點連起來，除了動態連結串列之外，還有靜態連結串列，這種連結串列用陣列來描述，主要為了解決沒有指標或者不用指標的情況下具備連結串列插入刪除操作便捷的特性。靜態連結串列中

c語言實現連結串列及其基本操作

介紹連結串列是一種物理儲存單元上非連續、非順序的儲存結構，資料元素的邏輯順序是通過連結串列中的指標連結次序實現的。連結串列由一系列結點（連結串列中每一個元素稱為結點）組成，結點可以在執

從NLP任務中文字向量的降維問題，引出LSH（Locality Sensitive Hash 區域性敏感雜湊）演算法及其思想的討論

1. 引言 - 近似近鄰搜尋被提出所在的時代背景和挑戰 0x1：從NN（Neighbor Search）說起 ANN的前身技術是NN（Neighbor Search），簡單地說，最近鄰檢索就是根據資料的相似性，從資料集中尋找與目標資料最相似的專案，而這種相似性通常會被量化到空間上資料之間的距離，例如歐幾里

資料結構和演算法精講版（陣列、棧、佇列、連結串列、遞迴、排序、二叉樹、紅黑樹、堆、雜湊表）Java版

查詢和排序是最基礎也是最重要的兩類演算法，熟練地掌握這兩類演算法，並能對這些演算法的效能進行分析很重要，這兩類演算法中主要包括二分查詢、快速排序、歸併排序等等。我們先來了解查詢演算法! 順序查詢: 順序查詢又稱線性查詢。它的過程為：從查詢表的最後一個元素開始逐個與給定關鍵字比較，若某個記錄的關鍵字和給定值比較

資料結構之雜湊表與連結串列、陣列

雜湊表主要描述雜湊表的定義：通過關鍵碼尋找值的資料對映結構，類似於查字典當存在雜湊衝突時，有兩種常用的方式：開發定址法和鏈地址法開發定址法通俗的來說就是判斷該地址是否存資料，沒存就放進去，存了就找下一個地址，依次類推，問題是如果空間不足，無法處理衝突。鏈地

【leetcode】#雜湊表【Python】138. Copy List with Random Pointer 複製帶隨機指標的連結串列

連結：題目：給定一個連結串列，每個節點包含一個額外增加的隨機指標，該指標可以指向連結串列中的任何節點或空節點。要求返回這個連結串列的深度拷貝。解法1：先迴圈一遍，把node建完，把所有的no

【資料結構】--1.連結串列的基本操作和雜湊表定義

C實現連結串列的基本操作初始化插入刪除雜湊表的定義 //連結串列的基本操作初始化插入刪除雜湊表的定義 #include<iostream> using namespace std; typedef struct Node { int

Qt之豐富的容器類---陣列QVector、連結串列QLinkedList、對映表QMap、雜湊表QHash

本文轉載：http://www.cnblogs.com/newstart/archive/2013/05/09/3068625.html 在C++裡做大型程式時，少不了要與陣列、連結串列等資料結構打交道。就是最簡單的字串也常常讓頭痛萬分，Qt中有QString解決了字串的頭痛，那麼其他陣列等

Linux核心工程導論——資料結構：連結串列與雜湊

scatterlist table由於可以被拼接（chain），不同的scatterlist如果所指向的記憶體是相鄰的還可以被合併，所以其遍歷格外複雜。1.4 llistllist全稱是Lock-less NULL terminated single linked list，意思是不需要加鎖

用雜湊表加連結串列實現動態malloc

///////////////////////////////////////////////////////////////////////在指定的Free雜湊表項中查詢滿足size大小的記憶體並返回相應的地址void *find_malloc(Node *head, size_t size) { size

從陣列、連結串列到雜湊

為什麼我們需要各種各樣的資料結構？對我們而言，通常對於資料的操作無外乎以下幾種方式：增、刪、改、查。其中除增加外，其他幾種操作均要求對集合進行搜尋。而結構化的資料模型可以通過陣列、連結串列或者樹形結構等建立，不同的建模方式對於資料處理中的各種操作有不同的效能

雜湊演算法 C語言（連結串列巨量且隨機的查詢）

7-18 詞頻統計（30 分）請編寫程式，對一段英文文字，統計其中所有不同單詞的個數，以及詞頻最大的前10%的單詞。所謂“單詞”，是指由不超過80個單詞字元組成的連續字串，但長度超過15的單詞將只擷取保留前15個單詞字元。而合法的“單詞字元”為大小寫字母、數字和下劃線

陣列、連結串列和雜湊表

陣列和連結串列的區別： 1、陣列是將元素在記憶體中連續存放。連結串列中的元素在記憶體中不是順序儲存的，而是通過存在元素中的指標聯絡到一起。 2、陣列必須事先定義固定的長度，不能適應資料動態地增減的情況。連結串列動態地進行儲存分配

《演算法筆記二》連結串列、棧、佇列、遞迴、雜湊表、順序表

[TOC] # 連結串列、棧、佇列、遞迴、雜湊 ## 連結串列 ### 單向連結串列 > 單向連結串列的節點結構(可以實現成泛型) ： ```Java public class Node { public int value; public Node nex

雜湊連結串列及其變種

前言

普通連結串列

雜湊連結串列

設計原理

常用操作

雜湊連結串列變種

設計原理

常用操作

Author

相關推薦