On Fri, 9 Feb 2001, David Blythe wrote:
I did a sample-by-sample compare against the Intel implementation and (oops) found a bug (i introduced). The add with carry sequence in the MLA code for the ppc was being too aggressively scheduled and the carry bit was being lost. The attached patch fixes it.
[...]
asm ("addc %0, %2, %3\t\n" \
"adde %1, %4, %5" \
For a prettier assembly output, you might consider '\n\t' instead of '\t\n' in the line above.
Nicolas