CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

阿新 • • 發佈：2022-12-08

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

標籤（空格分隔）： CS:APP

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

prologue

computer can only execute machine code,which is a seuqence of binary code encoding the low-level operations such as manipulating data,manage memory,read and write data on storage devices and communicating over the internet.

most machine codes are derived from the work of compiler,which can generate machine code under the constraint of programming language,the targeted machine and operation system.

Generally speaking,(pre-processor will expand the source code to include all files declared in the source file with the command #include as well as any macros specified with #define. )compiler first convert source code to assembly code followed by invoku=ing noth an assembler and a linker to generate the executable machine code from assembly code.

What the chapter 3 is gonna tell is one system created for studying and working backwards to take a closer look into compiler.

some terms:

terms	description
x86-64	one processor named as Intel64:the 64 bits extension to IA32

3.1 historical perspective

3.2 program encdoings

GNU provides many tools for us to compile and aseemble the whole code via various cpmmands:

3.2.1 some commands

1.gcc -Og -o exe_name source1_name.c source2_name.c

gcc compiles the source code then outputs the executable file.

options	description	similar options
-Og	the level of optimization	-O1,-O2

2.gcc -Og -S source_name.c

gcc run the compiler to generate an assembly file and go no further.

3.gcc -Og -c source_name

gcc runs the compiler to generate object file in the binary format with the extension of .o.

3.2.2 machine-level code

1.several different forms of abstraction

a) What computer acts and follows can be described at machine level winthin the ISA(Instructions set architecture).

b) each instructions defines :

1. the processor state
2. the format of the instructions
3. the effect after each instruction executed on the processor state

c) virtual address space

this abstraction treats the whole memory as a very large byte array.(actual implement will cover in the chapter9 but for better understanding virtual treatment is accpeted.)

2.ISA : assembly language

the whole brief process from source code to machine code also reveals the even close relation between assembly code and machine code.Their main distinction is assembly code is representated as texture in an easily decipherable way whereas machine code is representated in binary format.

3.machine code versus C language

There are four parts of the processor state visable in machine code &&assembly code but hidden from C language:

No.	name	description
1	PC(program counter)	where the PC points at is the next instruction will be executed
2	rgister file	16 named registers of the length 64 bits[x86-64]
3	condition code register(flag register)	record the program state
4	vector registers	????no idea!

Even though C provideds a tons of data types , for example ,interger,float,array,customized data types,machine language just treats them as a continous array of bytes in virtual address space.

4.run-time stack

It is run-time stack which is crucial to manage procudures and returns and parameters as so forth.

3.2.3 codes example

The code of this chapter03 is that the program executed by compter is simplely a sequence of binary code.Nothing more,nothing less.

the whole process of C compiler:

code.c -> code.s -> code.o ->code.exe

1. one tool : disassembler

disassembler is a tool provided by gdb where it can generate assembly code from executable file.

One important point:disassembler identifies the end of some segment extends with nop,which means no operation will happen here,the main purpose of it is make a better placement for the next segment in terms of managing system performance.

2. x86-64 instructions

1.Instructions ranges in length from 1 to 15 bytes.A commonly used instructions with fewer operands has a smaller number of bytes than a less commonly used ones or ones with more operands.

2.Every instruction has its unique decoding of bytes.

3.Disassembler only decode bytes in executable file without any access to assembly code generated by C compiler.

4.Different naming convention between disassembler and compiler.

3.2.4 assembly code

Not only some information we donot need to concern about but also no any readible texture can impende the understanding to it.So it's important to learn how to read these assembly code.

easy instructions like mov,add and so on
any line begins with "." are the directives to guide the linker and assembler.(CSAPP suggest that it's better to ignore these kinds of directives.)
a brilliant stylized version provided on page.212
how to incorporate assembly code into c code? =>combine assembly code of one complete function with the c code DURING the linking stage.

3.3 Data formats

Consider this issues with the knowledge of assembly language we learnt,it's similar to the topic of how to move immediate number into the main memory?

One question arise:
If 1 of int is ready to put into main memory,how to specify the length of 4 bytes?

solution provided by asm:(assume number 1 of 4 bytes)

;solution
mov dword ptr ds:[ea] , 1

=>the size of operand must be specified .

3.3.1 several types of data formats(x86-64)

bit:the smallest unit of describing computer storage device.

data size	description
byte	8 bits
word	16 bits - 2 bytes
dword(double words)	2 words-4 bytes
qword(quad words)	4 words-8 bytes

3.3.2 the suffixes to the instructions in assembly code

movw ds:[ea],1
;equivalent to 
mov word ptr ds:[ea],1

3.3.3 assembly instruction versus C language

c data type	data size	intel asm suffix	bytes
char	byte	b	1
short	word	w	2
int	dword	l^[1]	4
long	quad word	q	8
char*	quad word	q	8
float	dword	l	4
double	qword	q	8

Question :
integer and point number appears to be dufferent in terms of instructions....no detail so far!

l stands for "long words" ↩︎

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

CS:APP--Chapter03: machine-level representation of program - part 1 basic(1) 標籤（空格分隔）： CS:APP

Study Notes of CS:APP (Till Book 3.8 & Lecture 8.1, Regularly Updated)

Computer Systems: A Programmer\'s Perspective, Third Edition, Pearson, 2016 15-213/18-213: Introduction to Computer Systems (ICS)

Study Notes of CS:APP (Till Book 3 & Lecture 9, Regularly Updated)

Computer Systems: A Programmer\'s Perspective, Third Edition, Pearson, 2016 15-213/18-213: Introduction to Computer Systems (ICS)

【Azure 應用服務】App Service .NET Core專案在Program.cs中自定義新增的logger.LogInformation,部署到App Service上後日志不顯示Log Stream中的問題

問題描述在.Net Core 5.0 專案中，新增Microsoft.Extensions.Logging.AzureAppServices 和Microsoft.Extensions.Logging.Abstractions外掛，並且在專案中新增logging.AddAzureWebAppDiagnostics()

[LeetCode] 1161. Maximum Level Sum of a Binary Tree 最大層內元素和

Given therootof a binary tree, the level of its root is1, the level of its children is2, and so on. Return thesmallestlevelxsuch that the sum of all the values of nodes at levelxismaximal.

TS2Vec: Towards Universal Representation of Time Series 論文翻譯

重點： ①有的模型只能進行instance級別的representation，本文是任意層級 ②選取positive pair 的原則是：模型根據不同上下文對於同一個時間戳的representation應當一致。

Structured data representation of python

Structured data https://databricks.com/blog/2017/02/23/working-complex-data-formats-structured-streaming-apache-spark-2-1.html

論文《Instance-level Human Parsing via Part Grouping Network》復現

目錄論文《Instance-level Human Parsing via Part Grouping Network》復現資料準備環境配置執行前準備test_pgn.py執行test_pgn.py執行結果未完待續...參考文獻

Select Screen 0 with xrandr Ask QuestionScreen 0" here describes your whole virtual display made of these two outputs: eDP-1-

Screen0\" here describes your whole virtual display made of these two outputs: eDP-1-1: physicalscreenplugged to a display-port output

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

prologue

3.1 historical perspective

3.2 program encdoings

3.2.1 some commands

3.2.2 machine-level code

1.several different forms of abstraction

2.ISA : assembly language

3.machine code versus C language

4.run-time stack

3.2.3 codes example

1. one tool : disassembler

2. x86-64 instructions

3.2.4 assembly code

3.3 Data formats

3.3.1 several types of data formats(x86-64)

3.3.2 the suffixes to the instructions in assembly code

3.3.3 assembly instruction versus C language

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

Study Notes of CS:APP (Till Book 3.8 & Lecture 8.1, Regularly Updated)

Study Notes of CS:APP (Till Book 3 & Lecture 9, Regularly Updated)

【Azure 應用服務】App Service .NET Core專案在Program.cs中自定義新增的logger.LogInformation,部署到App Service上後日志不顯示Log Stream中的問題

[LeetCode] 1161. Maximum Level Sum of a Binary Tree 最大層內元素和

TS2Vec: Towards Universal Representation of Time Series 論文翻譯

Structured data representation of python

論文《Instance-level Human Parsing via Part Grouping Network》復現

Select Screen 0 with xrandr Ask QuestionScreen 0" here describes your whole virtual display made of these two outputs: eDP-1-

uniapp使用外掛小程式正常 app報錯cid unmatched at view.umd.min.js:1

An analysis of a simple Java basic interview question: short s1=1; s1 = s1 +1 will report an error?

linux 下 docker 版的 sqlserver 執行報錯：This program requires a machine with at least 2000 megabytes of memory.

C# read and compute the code lines number of cs files based on given directory

UAC即Windows 使用者帳戶控制級別以及app.manifest清單選項requestedExecutionLevel level="requireAdministrator" uiAccess="true"說明

netcore---Program.cs配置相關資訊，及讀取配置資訊

PAT(Advanced Level)A1053. Path of Equal Weight

解決Java compiler level does not match the version of the installed Java project facet.問題

HypoML: Visual Analysis for Hypothesis-based Evaluation of Machine Learning Models

解決java compiler level does not match the version of the installed java project facet

uni-app打包報錯Caused by: com.android.tools.r8.errors.CompilationError: Program type already present:

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

CS:APP--Chapter03 : machine-level representation of program - part 1 basic(1)

prologue

3.1 historical perspective

3.2 program encdoings

3.2.1 some commands

3.2.2 machine-level code

1.several different forms of abstraction

2.ISA : assembly language

3.machine code versus C language

4.run-time stack

3.2.3 codes example

1. one tool : disassembler

2. x86-64 instructions

3.2.4 assembly code

3.3 Data formats

3.3.1 several types of data formats(x86-64)

3.3.2 the suffixes to the instructions in assembly code

3.3.3 assembly instruction versus C language

相關推薦