1. 程式人生 > >This is Chuanqi‘s Blog

This is Chuanqi‘s Blog

1. 費米架構


FERMI架構圖

SM

這裡寫圖片描述

  • SM Streaming multi-processors with multiple processing cores
    • Each SM contains 32 processing cores
    • Executive in a Single Instruction Multiple Thread ( SIMT ) fashion
    • Up to 16 SM on a card for a maximum of 512 compute cores
    • Instruction Cache ?K 快取指令
    • Warp Scheduler Warp 排程器
    • Dispatch Unit 將指令傳送的要執行的warp中
    • Register File 暫存器檔案
    • core 也叫 streaming processor,相當於CPU的ALU單元
    • LD/ST load 和 store 單元,負責訪存
    • SFU special function unit 特殊函式單元 cos sin
    • L1 cache /shared mem 64K可配置

計算能力 2.x Fermi 關於cache 的描述

const cache

A multiprocessor also has a read-only constant cache that is shared by all functional units and speeds up reads 
from the constant memory space, which resides in device memory.

data cache

There is an L1 cache for each multiprocessor and an L2 cache shared by all multiprocessors, 
both of which are used to cache accesses to local or global memory, including temporary register spills. 
The cache behavior (e.g., whether reads are cached in both L1 and L2 or in L2 only) can be partially configured on 
a per-access basis using modifiers to the load or store instruction

The same on-chip memory is used for both L1 and shared memory: It can be configured as 48 KB of shared memory and 16 KB of L1 cache
 or as 16 KB of shared memory and 48 KB of L1 cache, using cudaFuncSetCacheConfig()/cuFuncSetCacheConfig():

b) 開普勒架構
c) Maxwell
d) 最新的Pascal架構
e) 講一下 sp sm sfu ld/st
f) Regeister file
g) Shared memory l1cache
h) l2cache
2. GPU計算流程
a) 取指令
b) 譯碼
c) 執行
d) 寫回
e) Warp排程的特點
f) 記憶體請求合併的特點
g) Warp分歧的處理
3. 儲存分層介紹 各層主要的特點,以及發現的問題
a) 片上儲存
i. Register file
ii. Shared memory
iii. L1Dcache
iv. Bypass
b) 片外儲存
i. L2cache
ii. DRAM 排程

相關推薦

This is Chuanqis Blog

1. 費米架構 FERMI架構圖 SM SM Streaming multi-processor

Welcome ! This is Guanpx's blog.

利用kmeans演算法進行非監督分類 1.聚類與kmeans 引例:2004美國普選布什51.52% 克里48.48% 實際上,如果加以妥善引導,那麼一有小部分人就會轉換立場,那麼如何找到這一小部分人以及如何在有限預算採取措施吸引他們呢?答案就是聚類(&l

This file's format is not supported or you don't specify a correct format. 解決辦法

版本問題 body ecif 新版 ted you cor spec asp string path = @"c:\請假統計表.xlsx"; Workbook workBook = new Workbook(); workBoo

【Welcome to Smile-Huang 's Blog.】This Blog aims to share my experience with you. Please leave comments if you have any thoughts.

This Blog aims to share my experience with you. Please leave comments if you have any thoughts.

Kaspars Grosu on LinkedIn: "This is happening now it's not a dream not even Science fiction #innovation #tech #ai #tesla "

This is happening now it's not a dream not even Science fiction #innovation #tech #ai #tesla Friday 7 September 2018 Real life incident.. What happens whe

What is your life purpose? @ Alex Pliutau's Blog

Each of us has a story. Own your story, it’s yours. Don’t pretend to be something you are not. Worrying about what people think blocks us from b

This is how Tesla's Autopilot system sees the idyllic streets of Paris

Navigating Paris' chaotic streets requires a high level of attentiveness to avoid colliding with another car, a herd of scooters, a pedestrian, or a garbag

This Is The Robot Being Used To Prevent Tomorrow's Car Crashes

There's a hell of a lot of testing that needs to be done to not only get to fully-autonomous cars, but to even have the sort of technology that a lot of mo

This is what it’s like to fail your interviews at Google.

The entire interview process was to take about a month, and if all went well, would encompass a series of five interviews for the role. Google’s courtship

Xcode 真機調試報錯:This application's application-identifier entitleme

報錯 調試 win cati app itl ati 刪除 allow This application‘s application-identifier entitlement does not match that of the installed applicatio

The Struts dispatcher cannot be found. This is usually caused by using Struts

without through san sans word needed ice ffffff per 對於struts2中的問題: org.apache.jasper.JasperException: The Struts dispatcher cannot be fou

[SQL] - Attempted to read or write protected memory. This is often an indication that other memory is corrupt. 問題之解決

img png .com 異常 hresult image select att blog 場景: 使用 Oracle.DataAccess.dll 訪問數據庫時,OracleDataAdapter 執行失敗。 異常: System.AccessViolationExce

mysql命令gruop by報錯this is incompatible with sql_mode=only_full_group_by

插入 gin div ins his columns group col and 在mysql 工具 搜索或者插入數據時報下面錯誤: ERROR 1055 (42000): Expression #1 of SELECT list is not in GROUP BY

僵屍進程和孤兒進程-(轉自Anker's Blog)

進程表 信號 wait 例如 rmi 答案 class 正常 dia 2、基本概念   我們知道在unix/linux中,正常情況下,子進程是通過父進程創建的,子進程在創建新的進程。子進程的結束和父進程的運行是一個異步過程,即父進程永遠無法預測子進程 到底什麽時候結束。 當

This is a bug I believe, and it took me 2-3 days to figure it out. Please do the following to get it working,

this nco etc figure ood client clas gpo see This is a bug I believe, and it took me 2-3 days to figure it out. Please do the following to

elasticsearch this is not a http port

ogr 文件 ack tcp str imageview https png oca 訪問的是elastic search的tcp端口,需換成http端口。 elastic search默認tcp端口9300,http端口9200 如果瀏覽器中訪問http://loca

this is my first

rem ctu lac oct item ros isa targe del this is my first or <!DOCTYPE html> <html xmlns:th="http://www.thymeleaf.org" layout:dec

hdu-2685 I won't tell you this is about number theory---gcd和快速冪的性質

return ont 題目 def clas number class HR strong 題目鏈接: http://acm.hdu.edu.cn/showproblem.php?pid=2685 題目大意: 求gcd(am-1,an-1)%k 解題思路: 對於am-1 =

錯誤:this is incompatible with sql_mode=only_full_group_by

通過 res 屬性 table mys exp oba etc depend Expression #1 of SELECT list is not in GROUP BY clause and contains nonaggregated column ‘H5APP_W

MySQL使用group by 報this is incompatible with sql_mod

函數 size bee type term bst ans 查看sql 聚合函數 下面是employee表的所有數據。使用group by 分組查詢報錯this is incompatible with sql_mode=only_full_group_by查看sql_mo