-
Notifications
You must be signed in to change notification settings - Fork 133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve payne hanek #210
Improve payne hanek #210
Conversation
src/libm-tester/tester2simddp.c
Outdated
#define CONFIG 1 | ||
#include "helperpurec.h" | ||
#include "norename.h" | ||
#ifdef ENABLE_AVX512FNOFMA |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The changes in this file seem to be unnecessary. If I understand correctly, these are the "nofma" and "pure_c*_scalar" target that are needed for the "determinism" patch, and are unrelated to the improvements of the payne hanek algorithm.
In fact, "rename*nofma.h" is not included in this patch.
May I ask you to revert the changes in this file?
src/libm-tester/tester2simddp.c
Outdated
@@ -126,6 +112,18 @@ typedef Sleef_vector_double_2 vdouble2; | |||
typedef Sleef_vector_float_2 vfloat2; | |||
#endif | |||
|
|||
#ifdef ENABLE_VECEXT |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please revert the change in line 115-126.
@@ -111,11 +99,9 @@ typedef Sleef_float32x4_t_2 vfloat2; | |||
#ifdef ENABLE_SVE | |||
#define CONFIG 1 | |||
#include "helpersve.h" | |||
#ifdef DORENAME |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why remove this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because DORENAME is always defined.
src/libm-tester/tester2simdsp.c
Outdated
@@ -127,6 +121,12 @@ typedef Sleef_vector_double_2 vdouble2; | |||
typedef Sleef_vector_float_2 vfloat2; | |||
#endif | |||
|
|||
#ifdef ENABLE_PUREC |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please restore line 124-129 in the original position.
src/libm/sleefsimdsp.c
Outdated
@@ -2473,6 +2463,7 @@ EXPORT CONST vfloat __tgammaf_u1_finite(vfloat) __attribute__((weak, ali | |||
// gcc -DENABLE_MAIN -Wno-attributes -I../common -I../arch -DENABLE_AVX2 -mavx2 -mfma sleefsimdsp.c ../common/common.c -lm | |||
#include <stdio.h> | |||
#include <stdlib.h> | |||
#include <math.h> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Spurious include?
This patch improves the Payne-Hanek reduction algorithm. The restriction on the input domain of trig functions is all removed.
This patch improves the Payne-Hanek reduction algorithm.
The restriction on the input domain of trig functions is all removed.