x86 - x86 示例代码。“从头开始编程”

Question

我正在尝试了解有关汇编编程的更多信息。我正在关注“从头开始编程”一书。我在使用第二个示例程序时遇到了问题。我一直在对其进行故障排除，并且还在线检查了任何已发布的勘误表。

该程序旨在输出所列数字中的最高值。此列表在 data_items 下标记。在终端中运行后，程序应该什么都不做。但是，在您输入 echo $? 后，它应该返回 222（列表中的最高数字）。目前它返回 3 或第一个数字。

我认为以下不是问题

- inc 语句必须工作，否则它将陷入无限循环，因为它永远不会到达 0 以退出循环

- 我认为这可能是 64 位环境中 mov 语句中的 .long 和 4 的问题，但是我确实尝试用 8 代替 4 并且没有运气。

- 除了上述声明之外，我认为这不是一般的 64 位与 32 位问题，因为我在这里查看过，并且共识似乎是 32 位程序在 64 位上运行良好，我已经尝试过我认为可能出现的具体问题。

我认为问题可能出在 ebx 和 eax 寄存器之间的第二个 cmp 语句上。

提前谢谢你，如果我上面的任何或所有假设是错误的，我很抱歉我只是想让你们知道我已经尝试过，研究过什么，以及我的思考过程在哪里。

代码如下。

#PURPOSE:       This program finds the maximum number of a
#           set of data items.
#

#VARIABLES:     The registers have the following uses:
#
# %edi - Holds the index of the data item being examined
# %ebx - Largest data item found
# %eax - Current data item
#
# The following memory locations are used:
#
# data_items - contains the item data. A 0 is used 
#          to terminate the data
#

.section .data

data_items:     #These are the data items
 .long 3,67,34,222,45,75,54,34,44,33,22,11,66,0

 .section .text

 .globl _start
_start:
 movl $0, %edi              # move 0 into the index register
 movl data_items(,%edi,4), %eax     # load the first byte of data
 movl %eax, %ebx                # since this is the first item, 
                    # %eax is the biggest


start_loop:                 # start loop
 cmpl $0, %eax                  # check to see if we've hit theend
 je loop_exit               # if so uncondition jmp to exit
 incl %edi                  # load next value
 movl data_items(,%edi,4), %eax
 cmpl %ebx, %eax                    # compare values
 jle start_loop             # jump to loop beginning

 loop_exit:
  # %ebx is the return value, and it has the highest number availible
    movl $1, %eax           #1 is the exit() syscall
    int $0x80

score 3 · Accepted Answer

这是整个程序中唯一写入的语句%ebx：

movl %eax, %ebx        # since this is the first item, 
                       # %eax is the biggest

循环运行直到%ebx小于%eax，然后立即退出，%ebx等于数组中的第一个值，即 3。

如果我在之后立即添加两行jle start_loop，即

movl %eax, %ebx  # copy new largest value to %ebx
jmp start_loop   # and continue

然后程序按预期执行（使用编译时gcc -nostdlib -nostartfiles -m32 test.s）。

最好像这样编写这个程序：

    .section .rodata
data_items:
    .long 3,67,34,222,45,75,54,34,44,33,22,11,66,0
    .section .text
    .globl main
main:
    xorl  %ecx, %ecx
    movl  data_items(,%ecx,4), %edx
    movl  %edx, %eax
loop:
    # data_items assumed to have at least two elements
    cmpl  %eax, %edx
    cmovg %edx, %eax
    incl  %ecx
    movl  data_items(,%ecx,4), %edx
    testl %edx, %edx
    jne loop
    # return the largest value, which is in %eax
    ret

除了一些小的微优化之外，我还使程序符合 ABI，将其入口点从更改为_start，main以便 C 库有机会初始化，并且我从main（导致调用exit）返回而不是开放编码系统_exit调用。

在这种情况下，ABI 合规性是使用正确寄存器的问题。我从返回main，所以我必须满足调用 main的代码的期望，特别是当main返回时，%ebx、%esi、%edi和%ebp仍然具有与函数调用之前相同的值。如果您使用这些寄存器中的任何一个，您有责任在进入时将它们的旧值推入堆栈，并在退出时再次弹出它们。由于此代码只需要三个寄存器并且本身不进行任何函数调用，因此最好只使用%eax、%ecx和%edx，它们不需要在函数调用后保留其旧值。你的书可能在某个时候解释了这一点。

（64位ABI的寄存器使用规则比较复杂，因为它在寄存器中传递参数，我没有记住。）

尽管有许多指南和教程告诉您相反的情况，但您永远不应该打开代码系统调用。调用 C 库中的包装器，或者在本例中，从main. C 库可以将您与您不想处理的低级 ABI glorp 隔离开来——例如，它总是使用最有效的陷阱序列，它知道正确的系统调用号（我回想几年前的一个问题，其答案归结为“_exit不是 64 位 ABI 中的系统调用 #1”），它知道如何设置errno.

x86 - x86 示例代码。“从头开始编程”

1 回答 1

Related

Reference