-
Notifications
You must be signed in to change notification settings - Fork 536
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hotfixes 3.2.2 #1200
Merged
Merged
Hotfixes 3.2.2 #1200
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
- Remove searching for MKL - Fix bug when looking up symbols - Now check for libblas on ubuntu based systems
* AF_REVISION was in version.h which is included in defines.h * Which meant with each new commit, the entire source would have to be rebuilt * AF_REVISION is only required by src/backend/*/platform.cpp * So AF_REVISION has been moved into backend/version.hpp and is compiled into the library * Renamed AF_CMPLR_STR to AF_COMPILER_STR * Moved AF_COMPILER_STR into version.hpp as it is also only required by backend/platform.cpp * src/backend/version.hpp is generated by CMake using Version.cmake and version.hpp.in
Fix leak in CUDA Random
Minor cleanup to FindCBLAS.cmake
* This is similar to the bug in triangle fixed in 144a2db
* MP adds multiprocess compilation * Gm- disables minimal rebuild (this options was being used by default before)
Replaced cudaMemcpy with async version calls in homography
Fix bug in identity cuda plaguing compute 5.2
Visual Studio Multiprocess Compilation options
- Replaced getchar within ifdefs with a new macro - Cleaned up unneeded "console" options from a few examples - Removed getchar option for windows. - This was already available via cmake
Examples clean up and test fixes
* Errors now generated as function_sign()(file:line): Message * TODO Add function name to af::exception class in devel branch for v3.3.0
* Also removed the THROW macro
- Needed when liblapack or liblapacke are not symlinked to libopenblas
* Mainly affects the order in which tests are executed * Cleaner, more organized * Tests are * 1 - N -> CPU * N+1 - 2N -> CUDA * 2N+1 - 3N -> OpenCL * 3N+1 - E -> Unified
Fixes when building with OpenBLAS
This reverts commit 091cdf9.
… compile * Od on MSVC * Default is OFF. Flags are set when toggled to ON. * Resets the flags to default release when toggled back to OFF.
MIN_BUILD_TIME Option for release build with O0 compilation for speed up
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
[skip ci]
Release Notes {#releasenotes}
v3.2.2
Bug Fixes
CUDA Random number generators
af::select() and af::replace() tests
thrown when printing empty arrays with af::print()
random number generation. Changed the generator to
mt19937
now show function, short file name and line number
macro to handle returning errors.
in \ref af::identity() that may have affected CUDA Compute 5.2 cards
Build
option to build with minimum optimization compiler flags resulting in faster
compile times
detection by CMake
FreeImage and
LAPACK
for unified backend
are now built for unified backend
execution alphabetically
into src/backend. This is because as revision is updated with every commit,
entire ArrayFire would have to be rebuilt in the old code.
external project suffix from external -> ext
lapack/lapacke alternative
Improvements
manager (cross issue)
for searching for
libaf*
for Unified backend on unix-style OS.in
LD_LIBRARY_PATH
as described in [Unified Backend](ref unifiedbackend)for devices only when required in CUDA Backend
Documentation
appearing for pre and code styles
for converting the output of af::getAvailableBackends() into bools