assembly - 如何在 x86 汇编中划分浮点数？

Question

当我尝试编写 Heron 算法从 ECX 寄存器中计算 sqrt 时，它不起作用。看起来问题是除法浮点数，因为结果是整数。

我的算法：

 sqrtecx:

MOV EDX, 10 ; loop count
MOV EAX, 5 ; x_0 in heron algorythm
MOV DWORD[EBP-100], ECX  ; save INPUT (ecx is input)    
MOV DWORD[EBP-104], EDX  ; save loop count
jmp     loop
MOV     ECX, EAX ; move  OUTPUT to ECX

loop:

MOV DWORD[EBP-104], EDX ; save loop count
xor edx, edx

MOV ECX, EAX
MOV     EAX, DWORD[EBP-100]
DIV ECX
ADD EAX, ECX
XOR EDX, EDX
mov ecx, 2
DIV ecx

MOV EDX, DWORD[EBP-104] ; load loop count
DEC EDX
JNZ loop

score 8 · Accepted Answer

您需要使用浮点指令集来实现您的目标。您可能会发现一些有用的说明是：

fild <int>  - loads and integer into st0 (not an immediate)
faddp       - adds st0 to st1, and pop from reg stack (i.e. result in st0)
fdivp       - divides st1 by st0, then pop from reg stack (again, push the result in st0)

这是一个简短的示例片段（VS2010 内联汇编）：

int main(void)
{
    float res;

    __asm {
        push    dword ptr 5;     // fild needs a memory location, the trick is
        fild    [esp];           // to use the stack as a temp. storage
        fild    [esp];           // now st0 and st1 both contain (float) 5
        add     esp, 4;          // better not screw up the stack
        fadd    st(0), st(0);    // st0 = st0 + st0 = 10
        fdivp   st(1), st(0);    // st0 = st1 / st0 = 5 / 10 = 0.5
        sub     esp, 4;          // again, let's make some room on the stack
        fstp    [esp];           // store the content of st0 into [esp]
        pop     eax;             // get 0.5 off the stack
        mov     res, eax;        // move it into res (main's local var)
        add     esp, 4;          // preserve the stack
    }

    printf("res is %f", res);    // write the result (0.5)
}

编辑：
正如哈罗德指出的那样，还有一条指令可以直接计算平方根，它是fsqrt. 操作数和结果都是st0.

编辑#2：
我不确定你是否真的可以加载到st0一个立即值，因为我的参考没有明确说明。因此我做了一个小片段来检查，结果是：

    float res = 5.0 * 3 - 1;
000313BE D9 05 A8 57 03 00    fld         dword ptr [__real@41600000 (357A8h)]  
000313C4 D9 5D F8             fstp        dword ptr [res]

这些是位于的字节357A8h：

__real@41600000:
000357A8 00 00                add         byte ptr [eax],al  
000357AA 60                   pushad  
000357AB 41                   inc         ecx

所以我不得不得出结论，不幸的是，在加载和存储数字时，您必须将数字存储在主内存中的某个位置。当然，按照我上面的建议使用堆栈并不是强制性的，实际上您也可以在数据段或其他地方定义一些变量。

编辑#3：
别担心，汇编是一个强大的野兽；）关于你的代码：

mov     ecx, 169    ; the number with i wanna to root
sub     esp, 100    ; i move esp for free space
push    ecx         ; i save value of ecx
add     esp,4       ; push was move my ebp,then i must come back 
fld                 ; i load from esp, then i should load ecx 
fsqrt               ; i sqrt it
fst                 ; i save it on ebp+100 
add     esp,100     ; back esp to ebp

您缺少fldand的操作数fst。看着你的评论，我想你想要fld [esp]，fst [esp]但我不明白你为什么在谈论ebp。ebp应该保存堆栈帧的开头（其中有很多我们不应该搞砸的东西），而esp保存它的结尾。我们基本上想在堆栈帧的末尾进行操作，因为在它之后就是没有人关心的垃圾。
最后，您还应该add esp, 4在计算并保存平方根之后。这是因为在后台push ecx也会sub esp, 4为您推动的价值腾出空间，并且在保存价值时仍然需要一些空间。正因为如此，你也可以避免sub esp, 100和add esp, 100，因为房间已经为你准备好了push。
最后一个“警告”：整数和浮点值以非常不同的方式表示，因此当您知道必须使用这两种类型时，请注意您选择的指令。您建议的代码使用fldand fst，它们都对浮点值进行操作，因此您得到的结果不会是您期望的结果。一个例子？00 00 00 A9 是 169 上的字节表示，但它表示浮点数 +2.3681944047089408e-0043（对于那些挑剔的人来说，它实际上是一个长双精度数）。
所以，最终的代码是：

mov     ecx, 169;   // the number which we wanna root
push    ecx;        // save it on the stack
fild    [esp];      // load into st0 
fsqrt;              // find the square root
fistp   [esp];      // save it back on stack (as an integer)
// or fst [esp] for saving it as a float
pop ecx;            // get it back in ecx

score 5 · Accepted Answer

DIV用于整数除法-您需要FDIV浮点数（或者FIDIV在这种特殊情况下更有可能，因为看起来您从整数值开始）。

score 4 · Accepted Answer

我不完全确定你真正想要做什么，所以现在我假设你想要取整数的浮点平方根。

mov dword ptr[esp],ecx   ; can't load a GRP onto the FPU stack, so go through mem
fild dword ptr[esp]      ; read it back (as integer, converted to float)
fsqrt                    ; take the square root

第一个dword ptr可能是可选的，具体取决于您的汇编程序。

在这段代码之后，结果位于 FPU 堆栈的顶部，ST(0)。我不知道你以后想用它做什么..如果你想把它四舍五入到一个 int 并把它放回 ecx，我建议这样做：

fistp dword ptr[esp]     ; again it can't go directly, it has to go through mem
mov ecx,dword ptr[esp]

我将采用 SSE2 的方式来衡量：

cvtsi2sd xmm0,ecx  ; convert int to double
sqrtsd xmm0,xmm0   ; take the square root
cvtsd2si ecx,xmm0  ; round back to int (cvttsd2si for truncate instead of round)

这样会容易一些。

assembly - 如何在 x86 汇编中划分浮点数？

3 回答 3

Related

Reference