mirror of https://github.com/pkivolowitz/asm_book.git synced 2026-06-21 03:36:49 +08:00

History

Perry Kivolowitz a7e89718b1 corrected layout of floats for AARCH64		2024-02-15 13:14:59 -06:00
..
.vscode	added expansion of ldr =	2022-08-25 10:23:54 -05:00
apple-linux-convergence.S	added MOD to apple-linux	2023-03-31 09:42:08 -05:00
asm_rounding.S	links	2023-04-06 18:53:40 -05:00
away.s	moved float into its own section	2022-07-27 09:23:55 -05:00
float_dump.cpp	big refactoring and sprucing up of programs and text	2023-01-20 17:59:43 -06:00
floatster.cpp	moved float into its own section	2022-07-27 09:23:55 -05:00
fmov.md	more about fmov but still mystified about mov immediate	2023-04-05 11:17:24 -05:00
fmov.pdf	freshened pdf	2023-04-05 11:18:23 -05:00
frintp.s	moved float into its own section	2022-07-27 09:23:55 -05:00
gdb01.png	added expansion of ldr =	2022-08-25 10:23:54 -05:00
half.md	various improvements to the floating point chapters	2022-12-26 13:18:54 -06:00
half.pdf	updated PDFs	2023-01-18 08:26:45 -06:00
literals.md	freshened literals.md	2023-04-05 11:28:05 -05:00
literals.pdf	freshened literals.md	2023-04-05 11:28:05 -05:00
literals.S	links	2023-04-06 18:53:40 -05:00
notes.md	found a nice source to cite someday	2023-01-18 17:42:03 -06:00
notes.pdf	last time a full update of pdf will be done.	2023-01-19 09:40:45 -06:00
README.md	will NOT be covering SVE	2023-02-04 17:36:55 -06:00
README.pdf	will NOT be covering SVE	2023-02-04 17:36:55 -06:00
regs.png	moved float into its own section	2022-07-27 09:23:55 -05:00
rounding.cpp	moved float into its own section	2022-07-27 09:23:55 -05:00
rounding.md	big refactoring and sprucing up of programs and text	2023-01-20 17:59:43 -06:00
rounding.pdf	freshened the pdfs	2023-01-20 18:00:16 -06:00
simdlanes.jpg	corrected layout of floats for AARCH64	2024-02-15 13:14:59 -06:00
t.s	exploring mov	2023-04-05 11:17:47 -05:00
test.cpp	big refactoring and sprucing up of programs and text	2023-01-20 17:59:43 -06:00
test.s	added fmov chapter	2023-01-23 17:58:56 -06:00
what.md	big refactoring and sprucing up of programs and text	2023-01-20 17:59:43 -06:00
what.pdf	freshened the pdfs	2023-01-20 18:00:16 -06:00
working.md	corrected layout of floats for AARCH64	2024-02-15 13:14:59 -06:00
working.pdf	corrected layout of floats for AARCH64	2024-02-15 13:14:59 -06:00

README.md

Section 2 / Floating Point

This chapter deals exclusively with the handling of floating point operations on the AARCH64 platform.

What are Floating Point Numbers?

Let's first begin with an understanding of what floating point numbers are. That can be found here.

The TL;DR is that floating point numbers are approximations with double precision being better approximations that single precision.

Floating point numbers have a sign bit (for signed floating points), an exponent which controls the range of representable numbers, and a mantissa that controls precision.

Floating Point Registers

There are 31 general registers (the X and W registers). Similarly there are 31 floating point registers which are reused for single, double and vector (SIMD - Single Instruction Multiple Data) instructions.

A bit more detail is provided here.

Rounding and Truncation

Truncation is part of casting float and double to int and long.

Rounding is important too.

Coverage on rounding and truncation is found here.

Loading Floating Point Numbers into Registers

This is a little confusing because some values can be loaded from arguments in the fmov instruction. For example, 1.0 can be fmoved. Trying to do the same for 1.1 will fail. Remember that AARCH64 instructions are always 32 bits wide and that floating point numbers are at least that size.

This chapter covers the loading of floating numbers into registers. A sample program is linked below.

Nuances of `fmov`

As indicated above, you can fmov a floating point literal into a register. Except when you can't. Well, mostly you can't.

Additionally, there are some rules about using fmov between registers.

This chapter covers the nuances of using fmov.

Half Precision Floating Point Numbers

Often used in Computer Graphics, half precision floats fit within 16 bits, the size of a short.

The TL;DR here is: Avoid them.

This chapter explains why.

SIMD

There are two types of SIMD instruction sets available in the AARCH64 ISA but the makers of processors are not obligated to implement them on any particular chip.

The first kind is NEON. This is described here.

The second kind of Scalable Vector Extension (SVE) for which we do not have near-term plans to cover. SVE is not implemented on any generally available ARM processor including Apple Silicon.

Demo Programs in this Chapter

In case you want to get right to the code, here are the demos presented in this chapter.

If you receive the assembly language files with a lower case extension, you will need to make the .s extension into .S.

Link	Contents	Converged
Link	Deconstructs floating point values	NA
Link	Demonstrates some rounding in asm	Yes
Link	Demonstrates some rounding in C++	NA
Link	Demonstrates dealing with floating point literals	Yes