Linux per-CPU實現分析

阿新 • • 發佈：2019-02-08

217 static DEFINE_PER_CPU(struct runqueue, runqueues);
11 #define DEFINE_PER_CPU(type, name)
12 __attribute__((__section__(".data.percpu"))) __typeof__(type) per_cpu__##name
13

首先，在arch/i386/kernel/vmlinux.lds中有

  /* will be freed after init */

  . = ALIGN(4096);		/* Init code and data */

  __init_begin = .;



  /* 此處省略若干行:) */

  

  . = ALIGN(32);

  __per_cpu_start = .;

  .data.percpu  : { *(.data.percpu) }

  __per_cpu_end = .;

 
  . = ALIGN(4096);

  __init_end = .;

  /* freed after init ends here */

這說明__per_cpu_start和__per_cpu_end標識.data.percpu這個section的開頭和結尾
並且，整個.data.percpu這個section都在__init_begin和__init_end之間，
也就是說，該section所佔記憶體會在系統啟動後釋放(free)掉

因為有
#define DEFINE_PER_CPU(type, name)
__attribute__((__section__(".data.percpu"))) __typeof__(type) per_cpu__##name

所以
static DEFINE_PER_CPU(struct runqueue, runqueues);
會擴充套件成
__attribute__((__section__(".data.percpu"))) __typeof__(struct runqueue)
per_cpu__runqueues;
也就是在.data.percpu這個section中定義了一個變數per_cpu__runqueues，
其型別是struct runqueue。事實上，這裡所謂的變數per_cpu__runqueues，
其實就是一個偏移量，標識該變數的地址。

--------------------
其次，系統啟動後，在start_kernel()中會呼叫如下函式

unsigned long __per_cpu_offset[NR_CPUS];



static void __init setup_per_cpu_areas(void)

{

	unsigned long size, i;

	char *ptr;

	/* Created by linker magic */

	extern char __per_cpu_start[], __per_cpu_end[];



	/* Copy section for each CPU (we discard the original) */

	size = ALIGN(__per_cpu_end - __per_cpu_start, SMP_CACHE_BYTES);

#ifdef CONFIG_MODULES

	if (size < PERCPU_ENOUGH_ROOM)

		size = PERCPU_ENOUGH_ROOM;

#endif



	ptr = alloc_bootmem(size * NR_CPUS);



	for (i = 0; i < NR_CPUS; i++, ptr += size) {

		__per_cpu_offset[i] = ptr - __per_cpu_start;

		memcpy(ptr, __per_cpu_start, __per_cpu_end - __per_cpu_start);

	}

}

在該函式中，為每個CPU分配一段專有資料區，並將.data.percpu中的資料拷貝到其中，
每個CPU各有一份。由於資料從__per_cpu_start處轉移到各CPU自己的專有資料區中了，
因此存取其中的變數就不能再用原先的值了，比如存取per_cpu__runqueues
就不能再用per_cpu__runqueues了，需要做一個偏移量的調整，
即需要加上各CPU自己的專有資料區首地址相對於__per_cpu_start的偏移量。
在這裡也就是__per_cpu_offset[i]，其中CPU i的專有資料區相對於
__per_cpu_start的偏移量為__per_cpu_offset[i]。
這樣，就可以方便地計算專有資料區中各變數的新地址，比如對於per_cpu_runqueues，
其新地址即變成per_cpu_runqueues+__per_cpu_offset[i]。

經過這樣的處理，.data.percpu這個section在系統初始化後就可以釋放了。

--------------------
再看如何存取per cpu的變數

/* This macro obfuscates arithmetic on a variable address so that gcc

   shouldn't recognize the original var, and make assumptions about it */

#define RELOC_HIDE(ptr, off)					

  ({ unsigned long __ptr;					

    __asm__ ("" : "=g"(__ptr) : "0"(ptr));		

    (typeof(ptr)) (__ptr + (off)); })



/* var is in discarded region: offset to particular copy we want */

#define per_cpu(var, cpu) (*RELOC_HIDE(&per_cpu__##var, __per_cpu_offset[cpu]))

#define __get_cpu_var(var) per_cpu(var, smp_processor_id())



#define get_cpu_var(var) (*({ preempt_disable(); &__get_cpu_var(var); }))

對於__get_cpu_var(runqueues)，將等效地擴充套件為
__per_cpu_offset[smp_processor_id()] + per_cpu__runqueues
並且是一個lvalue，也就是說可以進行賦值操作。
這正好是上述per_cpu__runqueues變數在對應CPU的專有資料區中的新地址。

由於不同的per cpu變數有不同的偏移量，並且不同的CPU其專有資料區首地址不同，
因此，通過__get_cpu_var()便訪問到了不同的變數。

--END

轉至:http://www.unixresources.net/linux/clf/linuxK/archive/00/00/47/91/479165.html

Linux per-CPU實現分析

Linux per-CPU實現分析

Linux 管道的實現分析

linux路由核心實現分析(一)----鄰居子節點

Linux fsync和fdatasync系統呼叫實現分析（Ext4檔案系統）

Linux核心中的per-cpu變數

ioctl配置IP地址 Linux核心實現分析

Linux 執行緒實現機制分析(轉載)

多程序與多執行緒（八）--Linux 執行緒實現機制分析 (轉)

linux下vlan的實現分析(上)

Linux NO_HZ_FULL NO_HZ 框架實現分析

《Linux核心設計與實現》與《Linux核心原始碼情景分析》讀書筆記

Linux程序排程CFS演算法實現分析

java中遠端監控Linux主機CPU及記憶體程式碼實現

Linux核心--網路棧實現分析（二）--資料包的傳遞過程（上）

linux主機cpu 佔用過高分析

Linux核心--網路棧實現分析（十）--網路層之IP協議（下）

作業系統實戰之CPU的模式切換與Linux上的實現

linux sysctl 引數實現暨 ip_forward引數對Linux核心轉發影響分析

Linux原始套接字實現分析---轉

[Linux]用Clang實現程式碼靜態分析

Linux per-CPU實現分析

相關推薦