ENH: add support for BLIS to numpy.distutils #7294

rgommers · 2016-02-20T19:44:21Z

Note: the status of BLIS and some experiments/benchmarks in using it are discussed in gh-5479.

Besides adding a blis_info class plus the corresponding changes to site.cfg.example, a few things that generate spurious logging (checks for empty paths) are cleaned up.

Edit: mention that #5479 is now closed and #7372 has a more extensive discussion of BLIS.

charris · 2016-02-20T19:49:59Z

Needs a note in 1.12.0-notes.

charris · 2016-02-22T15:58:07Z

@rgommers Could you add the relese note?

[ci skip]

rgommers · 2016-02-22T22:24:57Z

@charris done. Please don't merge yet though - @matthew-brett wanted to test this.

juliantaylor · 2016-02-23T00:26:42Z

site.cfg.example

+# [blis]
+# libraries = blis
+# library_dirs = /home/username/blis/lib
+# include_dirs = /home/username/blis/include/blis


this is unusual, normally the include directory is just "include/" and subfolders are included via the #include line

hm but it seems where blis puts its stuff, should probably have a comment that this needs to be the folder containing cblas.h which is prefix/blis by default

sure, done. also added a few more notes on compiling BLIS itself.

juliantaylor · 2016-02-23T00:39:36Z

seems to work but import ends with numpy/core/multiarray.so: undefined symbol: cblas_cdotc_sub but that seems to be a problem in blis itself (dunnington and reference configuration)

juliantaylor · 2016-02-23T00:52:07Z

you actually have to put #define BLIS_ENABLE_CBLAS into the appropriate bli_config.h to get cblas, then it seems to work

matthew-brett · 2016-02-23T01:44:31Z

I also get undefined symbols starting with: arraytypes.obj : error LNK2001: unresolved external symbol cblas_zdotu_sub.

I don't get these errors if I specify BLIS in the blas section of site.cfg instead:

[blas]
blas_libs = numpy-blis-reference
library_dirs = c:\code\blis\test\lib
include_dirs = c:\code\blis\test\include

matthew-brett · 2016-02-23T03:03:45Z

python setup.py config using the [blas] config section gives these flags for BLIS:

    libraries = ['numpy-blis-reference']
    define_macros = [('NO_ATLAS_INFO', 1)]
    library_dirs = ['c:\\code\\blis\\test\\lib']
    language = f77

Using the [blis] section, I get this config:

  FOUND:
    libraries = ['numpy-blis-reference', 'numpy-blis-reference']
    include_dirs = ['c:\\code\\blis\\test\\include']
    library_dirs = ['c:\\code\\blis\\test\\lib']
    language = c
    define_macros = [('HAVE_CBLAS', None)]

njsmith · 2016-02-23T03:50:43Z

Yeah, BLIS seems to export a fortran-compatible BLAS interface by default, and only export the CBLAS interface if specifically configured to do so.

juliantaylor · 2016-02-23T10:20:29Z

@matthew-brett in your configuration in [blas] numpy will not use it, we need to have HAVE_CBLAS defined

rgommers · 2016-02-23T20:29:31Z

It's of course a bit odd that for multiarray we only use cblas, while in linalg we do link and compile Fortran BLAS and LAPACK if available. But that is how it is now - I think defining HAVE_CBLAS is correct here. Right?

rgommers · 2016-02-23T20:31:10Z

On the other hand, for Scipy it makes little sense. The Fortran interface would be better.

juliantaylor · 2016-02-23T20:34:03Z

we will need both and all suitable blas libraries provide both. Its just not default in BLIS which can and should probably be changed.

Maybe one can think about adding two blas sections to the config, one for fortran blas and one for cblas, but the need didn't really come up yet.

njsmith · 2016-02-23T20:39:52Z

It's not hard to enable CBLAS in BLIS if that's what we need...

@fgvanzee: any reason in particular why BLIS's CBLAS layer is disabled by default?

rgommers · 2016-02-23T21:18:44Z

For reference, here's the BLIS config changes I had to make to make this work on 32-bit:

diff --git a/config/reference/bli_config.h b/config/reference/bli_config.h
index 5195e61..a1bd205 100644
--- a/config/reference/bli_config.h
+++ b/config/reference/bli_config.h
@@ -35,7 +35,11 @@
 #ifndef BLIS_CONFIG_H
 #define BLIS_CONFIG_H

+#endif
+

+#ifndef BLIS_ENABLE_CBLAS
+#define BLIS_ENABLE_CBLAS

 #endif

diff --git a/config/reference/make_defs.mk b/config/reference/make_defs.mk
index cf61534..1254c29 100644
--- a/config/reference/make_defs.mk
+++ b/config/reference/make_defs.mk
@@ -44,8 +44,8 @@ MAKE_DEFS_MK_INCLUDED := yes

 # Variables corresponding to other configure-time options.
 BLIS_ENABLE_VERBOSE_MAKE_OUTPUT := no
-BLIS_ENABLE_STATIC_BUILD        := yes
-BLIS_ENABLE_DYNAMIC_BUILD       := no
+BLIS_ENABLE_STATIC_BUILD        := no
+BLIS_ENABLE_DYNAMIC_BUILD       := yes



@@ -86,7 +86,7 @@ CDBGFLAGS      := #-g
 CWARNFLAGS     := -Wall
 COPTFLAGS      := -O2
 CKOPTFLAGS     := $(COPTFLAGS)
-CVECFLAGS      := #-msse3 -march=native # -mfpmath=sse
+CVECFLAGS      := -msse3 -march=native # -mfpmath=sse

 # Aggregate all of the flags into multiple groups: one for standard
 # compilation, and one for each of the supported "special" compilation
diff --git a/config/sandybridge/make_defs.mk b/config/sandybridge/make_defs.mk
index a2f7ee4..44b13ff 100644
--- a/config/sandybridge/make_defs.mk
+++ b/config/sandybridge/make_defs.mk
@@ -80,7 +80,7 @@ CC             := gcc
 # Enable IEEE Standard 1003.1-2004 (POSIX.1d). 
 # NOTE: This is needed to enable posix_memalign().
 CPPROCFLAGS    := -D_POSIX_C_SOURCE=200112L
-CMISCFLAGS     := -std=c99 -m64 -fopenmp  # -fopenmp -pg
+CMISCFLAGS     := -std=c99 -m32 -fopenmp  # -fopenmp -pg
 CPICFLAGS      := -fPIC
 CDBGFLAGS      := #-g
 CWARNFLAGS     := -Wall
diff --git a/frame/include/bli_config_macro_defs.h b/frame/include/bli_config_macro_defs.h
index 3f33c7e..343b570 100644
--- a/frame/include/bli_config_macro_defs.h
+++ b/frame/include/bli_config_macro_defs.h
@@ -45,7 +45,7 @@
 // internally within BLIS as well as those exposed in the native BLAS-like BLIS
 // interface.
 #ifndef BLIS_INT_TYPE_SIZE
-#define BLIS_INT_TYPE_SIZE               64
+#define BLIS_INT_TYPE_SIZE               32
 #endif


@@ -155,7 +155,7 @@
 // C99 type "long int". Note that this ONLY affects integers used within the
 // BLAS compatibility layer.
 #ifndef BLIS_BLAS2BLIS_INT_TYPE_SIZE
-#define BLIS_BLAS2BLIS_INT_TYPE_SIZE     64
+#define BLIS_BLAS2BLIS_INT_TYPE_SIZE     32
 #endif

It works, with

./configure reference   # auto doesn't work on 32-bit
make
make install

but I'm of course not sure if those are all the changes needed (Numpy doesn't exercise that much of BLAS). It's a bit fiddly....

…st it. [ci skip]

juliantaylor · 2016-02-23T21:36:06Z

as blis defaults to no cblas, maybe it is a good idea to add an explicit cblas test like the blas_info class has

juliantaylor · 2016-02-23T21:40:09Z

but to encourage testing with blis I'm also fine with merging now, we can sort out details later.

njsmith · 2016-02-23T21:41:53Z

diff --git a/config/reference/bli_config.h b/config/reference/bli_config.h
index 5195e61..a1bd205 100644
--- a/config/reference/bli_config.h
+++ b/config/reference/bli_config.h
@@ -35,7 +35,11 @@
 #ifndef BLIS_CONFIG_H
 #define BLIS_CONFIG_H

+#endif
+

+#ifndef BLIS_ENABLE_CBLAS
+#define BLIS_ENABLE_CBLAS

 #endif

Here I think you just want a bare #define BLIS_ENABLE_CBLAS inside the first #ifndef.

diff --git a/config/reference/make_defs.mk b/config/reference/make_defs.mk
index cf61534..1254c29 100644
--- a/config/reference/make_defs.mk
+++ b/config/reference/make_defs.mk
@@ -86,7 +86,7 @@ CDBGFLAGS      := #-g
 CWARNFLAGS     := -Wall
 COPTFLAGS      := -O2
 CKOPTFLAGS     := $(COPTFLAGS)
-CVECFLAGS      := #-msse3 -march=native # -mfpmath=sse
+CVECFLAGS      := -msse3 -march=native # -mfpmath=sse

I think this is wrong -- you're saying "make a binary that uses SSE3 instructions, and also all the instruction sets available on the machine where I'm compiling this". For our purposes I think we want something more like -msse2 -mfpmath=sse -mtune=generic.

diff --git a/config/sandybridge/make_defs.mk b/config/sandybridge/make_defs.mk
index a2f7ee4..44b13ff 100644
--- a/config/sandybridge/make_defs.mk
+++ b/config/sandybridge/make_defs.mk

You're not using the sandybridge config, so I think you can discard these changes :-)

diff --git a/frame/include/bli_config_macro_defs.h b/frame/include/bli_config_macro_defs.h
index 3f33c7e..343b570 100644
--- a/frame/include/bli_config_macro_defs.h
+++ b/frame/include/bli_config_macro_defs.h
@@ -45,7 +45,7 @@
 // internally within BLIS as well as those exposed in the native BLAS-like BLIS
 // interface.
 #ifndef BLIS_INT_TYPE_SIZE
-#define BLIS_INT_TYPE_SIZE               64
+#define BLIS_INT_TYPE_SIZE               32
 #endif


@@ -155,7 +155,7 @@
 // C99 type "long int". Note that this ONLY affects integers used within the
 // BLAS compatibility layer.
 #ifndef BLIS_BLAS2BLIS_INT_TYPE_SIZE
-#define BLIS_BLAS2BLIS_INT_TYPE_SIZE     64
+#define BLIS_BLAS2BLIS_INT_TYPE_SIZE     32
 #endif

These should probably be set in config/reference/bli_config.h. (Note that these are protected by #ifndef, i.e. these are default settings that only apply if the specified macros haven't already been set.)

Also, the default internal integer size should probably be sizeof(void*), not 32 or 64, because the reference configuration ought to work on both 32- and 64-bit systems out-of-the-box... and the BLAS/CBLAS-level public API should default to 32-bits unconditionally, because that's the de facto standard. (See also all the problems Julia had when trying to use OpenBLAS built with 64-bit API, before they renamed the symbols to avoid clashes with programs expecting dgemm etc. to be 32-bit.) But these are things to submit back upstream to blis...

tkelman · 2016-02-23T23:00:05Z

But these are things to submit back upstream to blis

In a way that can be selected at build time without patching headers, ideally.

fgvanzee · 2016-02-24T14:14:56Z

@njsmith, CBLAS is disabled in most BLIS configurations by default because very few people I interact with need/use it. :)

matthew-brett · 2016-02-24T19:15:41Z

@fgvanzee - would it be easy to enable CBLAS with a flag like make CBLAS=1 or similar?

Where is the best place to ask about default SSE2 and SSE3 templates? I think we're really close to being able to use BLIS by default, but at the moment the reference implementation is very often selected by the template selection algorithm, and that is too slow compared to the alternatives.

Is there any help we can offer in exchange?

njsmith · 2016-02-24T19:46:27Z

@matthew-brett: from looking at the autodetection code, I think it's just poorly written. Two obvious problems jump out at me: (a) if the CPU supports AVX but the OS does not, then it falls back on reference, instead of falling back on SSE3. (b) since it has a hard-coded table of every CPU model, it needs to be updated every time a new CPU type is released. So e.g. it doesn't know anything about the latest skylake processors, and therefore they get the reference kernel instead of AVX2 like they should. Instead it should be checking directly for which instruction sets are supported using feature flags ("dunnington" = sse3, "sandybridge" = avx, "haswell" = avx2).

Also note that there are no SSE2-specific kernels, but the reference configuration can/should be built with -msse2 -mfpmath=sse to at least let the compiler attempt to autovectorize the loops -- this is not the default.

(@fgvanzee: for context note that @matthew-brett has been experimenting with building multiple configurations of BLIS and then using cpu-detection code to decide which version to load at runtime.)

fgvanzee · 2016-02-25T00:52:04Z

@matthew-brett:

On CBLAS: For now, no. The only way to enable CBLAS is to do by editing the configuration's bli_config.h file prior to configuration. I know that is not ideal for non-developers, but it's how the build system works for now. (BLIS was designed initially to be a developer's tool.)
On SSE: Not sure what you mean by templates.

@njsmith: The auto-detection code may very well be poorly written, but I did not write it--Xianyi did, during his visit. Now, we were very grateful that he put at least something in place, as prior to his stint here at UT there was zero auto-detection. As for SSE2/3, you are correct: we only have SSE3 kernels, and they are incomplete (real domain only, I think). But I don't know to what extent those systems are properly detected, and based on your comments it sounds like there are gaps. Given that we no longer have the hardware those were developed on, it will be up to others to test that functionality and propose patches.

@matthew-brett: I am rearchitecting BLIS to facilitate various features which are not practical (feasible) presently, including runtime auto-detection of kernels/blocksizes. That will probably result in a redesign of the configure-time build system as well. So, these issues are on our radar, but they take time. But the goal is not to "fill the tree" of support for all hardware. We will need contributions from the community to fill the gaps once the new software architecture is in place.

Again, not sure what you mean by "templates". Maybe you mean configurations? Or kernels?

matthew-brett · 2016-02-25T01:20:28Z

Sorry - by templates I mean 'configurations' as in the sub-directories in your config directory.

Do you think it is possible, with something like the current state of BLIS, to make a collection of built configurations from which we could select at run-time, to give us a reasonably good performance on average across CPUs?

If that was possible, then I think you would find a lot of developer interest coming your way from us, from Julia, R and Octave, by which I mean issues and patches.

fgvanzee · 2016-02-25T01:42:59Z

@matthew-brett: The current BLIS software architecture does not allow one to change configuration information (kernels, blocksizes, etc.) at runtime. As I alluded to, I am working to facilitate this, but it requires a lot of groundwork. (Auto-detection is just one "application" of the general feature. Experts--those who know what they are doing--will be able to switch between custom kernels on-demand, if they wish. I tend to think of all of this under the umbrella of "runtime management" of what we currently think of as "the configuration." The CPUID/hardware-detection side of this is actually the least interesting part, to me, even if it is the most useful to end-users.)

matthew-brett · 2016-02-25T01:50:16Z

@fgvanzee - sorry - I am not being clear. What we are experimenting with, is building all the currently defined x86 configurations into separate libraries. So we have a directory structure something like this:

reference/numpy-blis.dll
dunnington/numpy-blis.dll
haswell/numpy-blis.dll
...

Then, at runtime, we check cpuid (e.g. with https://pypi.python.org/pypi/x86cpu) and select the numpy-blis.dll library with the configuration likely to give the best performance for the CPU that we are running on.

njsmith · 2016-02-25T23:02:15Z

@fgvanzee:

The auto-detection code may very well be poorly written, but I did not write it

Sincere apologies if that came across as criticism -- I'm aware that you didn't write it, and even if you had I wouldn't have considered it a judgement on you. All my code can certainly be improved further too :-)

matthew-brett · 2016-02-27T00:49:38Z

@fgvanzee - following up here.

Let's say we have built each defined x86 configuration as a separate library, as I described above, and we have found some optimal way of choosing between these libraries using the CPU identification, do you think we would be able to get reasonable performance across a range of CPUs?

If not - what would it take for that to be possible?

fgvanzee · 2016-02-27T17:19:29Z

@njsmith: No offense taken. I just wanted to make clear that it was one of the few parts of BLIS that I did not author, and that I have not even really looked at myself. I will probably try to clean it up eventually, but I don't know exactly when that will be. Furthermore, it will depend on how successfully I can learn the absurd intricacies of CPUID, and I will tell you right now, I may get to a point where it makes me want to jump off of a building. If you or anyone in your circle has expertise in that, and can describe how the CPUID register values should be detected for the modern family of Intel and AMD systems, that would be welcome.

@matthew-brett: Yes, that clarifies things a bit. First, I hope we can agree that your extra-BLIS solution will become unnecessary in the short- to medium-term, since that functionality is planned for implementation within BLIS. Now, to your question: I feel like I'm missing something. If your multi-shared object approach uses reference and haswell, you will only get good performance on haswell/broadwell systems. If it uses reference, sandybridge, and haswell, you will only get good performance on sandy/ivybridge and haswell/broadwell, and so forth. (The reference configuration will always be slow, but it will always work.) I don't know what your definition of "reasonable range" of CPUs is.

njsmith · 2016-02-28T04:25:30Z

@matthew-brett @tkelman @juliantaylor @fgvanzee:

You know, it occurs to me that it would be very straightforward to create our own runtime-autoconfiguring build of BLIS right now, as a temporary measure while Field is rearchitecting the core to support more powerful forms of runtime configuration. This could be done without any hackery to BLIS itself. Just create a new BLIS configuration called, config/runtime-x86-64 or something, with:

bli_kernel.h

#define BLIS_DGEMM_UKERNEL bli_selected_dgemm
#define BLIS_DEFAULT_MC_D bli_selected_mc_d
/* ... and so forth ... */

kernels/runtime-selected.c

static void (*selected_dgemm_pointer)(...);
int bli_selected_mc_d;
/* ...and so forth... */

static void select_kernels() {
    if (cpuid_says_we_have_avx2) {
        selected_dgemm_microkernel = avx2_kernel;
        /* magic values from haswell/bli_kernel.h */
        bli_selected_mc_d = 72;
        /* ... */
    } else if (cpuid_says_we_have_avx) {
        /* ... */
    } else {
        /* use reference settings */
    }
}

void bli_selected_dgemm(...) {
    if (!selected_dgemm_pointer) {
        select_kernels();
    }
    (*selected_dgemm_pointer)(...);
}

The only thing we'd need to change about the core of BLIS is that we'd probably want to modify the existing x86-64 kernels to have more unique names (e.g. renaming the current bli_dgemm_asm_8x6 to instead be bli_dgemm_avx2_asm_8x6), to avoid collisions and confusion.

For bonus points:

To eliminate the extra branch+indirect jump from the microkernel call (does it even matter?), we could mark select_kernels() as a constructor so it gets called automatically at startup, and then #define BLIS_DGEMM_UKERNEL to point directly selected_dgemm_pointer. This would require one tiny tweak to the BLIS core code -- right now there's some clever code in frame/include/bli_kernel_prototypes.h that tries to automatically declare a prototype for BLIS_DGEMM_UKERNEL, and this won't work if we've defined that macro to refer to a pointer-to-a-function instead of a function itself.
select_kernels() in this configuration could also check whether the user has specified any particular settings for the threading environment variables, and if not then make some better-than-nothing guess and set them.
If we want to get really fancy, we could use cpuid to check the cache configuration, and use that information when setting KC and friends (as per the formulas given Analytical modeling is enough for high-performance BLIS).

Of course this would be slightly klugey, but rather minimally: it'd be simple, would take advantage of the existing BLIS API configuration abstractions (and in particular, since all changes would be restricted to a new directory in config/ it would not conflict with any of Field's rearchitecting work), and it'd allow us to start experimenting with BLIS and autoconfiguration heuristics for real. (Presumably all the configuration heuristics we developed here could then be carried over directly when the "real" autoconfiguration API arrives.)

tkelman · 2016-02-28T08:04:54Z

Anything that can be implemented around BLIS in C rather than Python would be much easier for Julia (and R, Octave, SciLua, anyone else) to test and use, so I like the sound of that.

matthew-brett · 2016-02-28T08:15:57Z

For CPUID - the heavy lifting for https://github.com/matthew-brett/x86cpu is in the C code at https://github.com/matthew-brett/x86cpu/tree/master/src so would be easy enough to recycle into kernel selection.

njsmith · 2016-02-28T23:16:11Z

@matthew-brett: also, a lot of the complexity there goes away if we don't care about supporting compilers without inline asm (and blis very much requires inline asm).

tkelman · 2016-02-29T02:26:00Z

Anywhere you care about MSVC runtime compatibility but don't need Fortran, you should probably already be using clang.

fgvanzee · 2016-02-29T18:15:58Z

@njsmith: your interim solution sounds fine. Just be advised that you will probably need to rework your solution at least slightly (for example, frame/include/bli_kernel_prototypes.h will be going away) after my latest code refactoring is committed. (This factoring will lay the groundwork, but will not yet include, runtime management of kernels.)

EDIT: you've also hit on yet another item on my to-do list, which is to clean up the kernels directory. For example, the avx2 directory does not actually contain any instructions specific to AVX2. I know, it's awful. Adding architecture labels to the kernel function names, as you suggest, is one possibility to help out.

njsmith · 2016-03-01T23:00:48Z

@fgvanzee: is your refactoring-in-progress available anywhere to peek at (e.g. on a personal branch)? And are there any kernels that you know that we should specifically steer clear of until you have time to clean things up more? (Or is there some way that someone external could help with the clean up?)

njsmith · 2016-03-01T23:13:21Z

@rgommers: sorry this turned into a general chat channel about numpy+blis interaction -- I kinda lost track of what's going on with the underlying PR. Should we just merge it?

rgommers · 2016-03-01T23:26:55Z

Can be merged as is, or I can add the CBLAS check to this PR as Julian suggested (but that'll take a few days). Either way is fine with me.

njsmith · 2016-03-01T23:29:41Z

@rgommers: I don't think the CBLAS check is too urgent just because whether we have it or not, numpy+BLIS is going to remain an experts-only endeavour in several ways for the next while :-). Might well be a good idea to fix it up later, but we can do that as a separate PR?

rgommers · 2016-03-01T23:34:27Z

sure

njsmith · 2016-03-01T23:59:33Z

Then let's do this

ENH: add support for BLIS to numpy.distutils

matthew-brett · 2016-03-09T08:34:58Z

I refactored the CPUID etc code into:

C library with the algorithms : https://github.com/matthew-brett/libx86cpu
Thin Python wrapper around C library : https://github.com/matthew-brett/x86cpu

fgvanzee · 2016-03-10T23:32:53Z

@njsmith: Sorry for the delay. I became overwhelmed with communication (and still am). And so I just kind of ignored most everything on github. Sometimes I feel like I'm autistic and I do strange things to cope.

Unfortunately, I don't typically use branches for interim work. There certainly hasn't been any reason to so far; I recently did a git diff of my work-in-progress, and it is north of 7MB. (I wasn't kidding when I told a few people that this is as close to a total rewrite of BLIS as the project will probably ever see.) Good news: I'm done with my first pass of changes. Bad news: now I get to try to compile it for the first time in three months. More good news: once everything works again, I will have enough confidence to just push it to master. (I tend to be a very careful commit pusher.)

@matthew-brett: Thanks for your efforts vis-a-vis the CPUID. That should be quite helpful. We may need to talk about licensing, though. There was a time when it was very important to our project that all of our code be "owned" by people employed by the university, and thus owned by UT. (I know, very much not in the spirit of OSS, but I don't call the shots.) Worst case, we would need to discuss what degree of rewrite I would need to perform to your code for it to qualify as "original" for the purposes of authorship (and thus having the new code qualify as being UT-authored).

matthew-brett · 2016-03-10T23:35:27Z

Field - is there any license I can put on the code that would allow you to use the code (and still allow me to distribute and modify)?

fgvanzee · 2016-03-10T23:40:07Z

@matthew-brett: The answer is probably something very close to whatever is the least restrictive "license" possible. Public domain? Something narrower would probably work, something that granted copyright to UT. I'm not an IP expert, so I don't know exactly how this works. But we still have time to figure it out. I'm not quite ready to pivot to the CPU autodetection stuff yet. But we can definitely revisit this later. That will give me time to huddle with Robert and for us to determine whether I'm being overly conservative, and if not, determine who to contact within UT for further clarification.

matthew-brett · 2016-03-10T23:43:30Z

I'm happy to make it public domain if that would help - just let me know.

fgvanzee · 2016-03-11T00:06:03Z

@matthew-brett: Many thanks for your flexibility and willingness to contribute. I'll get back to you.

njsmith · 2016-03-12T01:35:46Z

@fgvanzee: No worries! I think we all understand very well about being overwhelmed by communication :-) I'll look forward to seeing what you've got when it's ready...

ENH: add support for BLIS to numpy.distutils

4409c5c

rgommers added 01 - Enhancement component: numpy.distutils labels Feb 20, 2016

rgommers mentioned this pull request Feb 20, 2016

Windows wheel package (.whl) on Pypi #5479

Closed

charris added the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Feb 20, 2016

DOC: add added BLIS support to 1.12.0 release notes.

770619b

[ci skip]

charris removed the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Feb 22, 2016

juliantaylor reviewed Feb 23, 2016
View reviewed changes

DOC: add notes to site.cfg.example on compiling BLIS itself and again…

2303323

…st it. [ci skip]

njsmith added a commit that referenced this pull request Mar 1, 2016

Merge pull request #7294 from rgommers/blis-support

25ac6b1

ENH: add support for BLIS to numpy.distutils

njsmith merged commit 25ac6b1 into numpy:master Mar 1, 2016

njsmith mentioned this pull request Mar 2, 2016

Tracker issue for BLIS support in NumPy #7372

Closed

rgommers deleted the blis-support branch July 2, 2018 17:29

ENH: add support for BLIS to numpy.distutils #7294

ENH: add support for BLIS to numpy.distutils #7294

Conversation

rgommers commented Feb 20, 2016 • edited by mattip Loading

charris commented Feb 20, 2016

charris commented Feb 22, 2016

rgommers commented Feb 22, 2016

juliantaylor Feb 23, 2016

Choose a reason for hiding this comment

juliantaylor Feb 23, 2016

Choose a reason for hiding this comment

rgommers Feb 23, 2016

Choose a reason for hiding this comment

juliantaylor commented Feb 23, 2016

juliantaylor commented Feb 23, 2016

matthew-brett commented Feb 23, 2016

matthew-brett commented Feb 23, 2016

njsmith commented Feb 23, 2016

juliantaylor commented Feb 23, 2016

rgommers commented Feb 23, 2016

rgommers commented Feb 23, 2016

juliantaylor commented Feb 23, 2016

njsmith commented Feb 23, 2016

rgommers commented Feb 23, 2016

juliantaylor commented Feb 23, 2016

juliantaylor commented Feb 23, 2016

njsmith commented Feb 23, 2016

tkelman commented Feb 23, 2016

fgvanzee commented Feb 24, 2016

matthew-brett commented Feb 24, 2016

njsmith commented Feb 24, 2016

fgvanzee commented Feb 25, 2016

matthew-brett commented Feb 25, 2016

fgvanzee commented Feb 25, 2016

matthew-brett commented Feb 25, 2016

njsmith commented Feb 25, 2016

matthew-brett commented Feb 27, 2016

fgvanzee commented Feb 27, 2016

njsmith commented Feb 28, 2016

tkelman commented Feb 28, 2016

matthew-brett commented Feb 28, 2016

njsmith commented Feb 28, 2016

tkelman commented Feb 29, 2016

fgvanzee commented Feb 29, 2016

njsmith commented Mar 1, 2016

njsmith commented Mar 1, 2016

rgommers commented Mar 1, 2016

njsmith commented Mar 1, 2016

rgommers commented Mar 1, 2016

njsmith commented Mar 1, 2016

matthew-brett commented Mar 9, 2016

fgvanzee commented Mar 10, 2016

matthew-brett commented Mar 10, 2016

fgvanzee commented Mar 10, 2016

matthew-brett commented Mar 10, 2016

fgvanzee commented Mar 11, 2016

njsmith commented Mar 12, 2016

rgommers commented Feb 20, 2016 •

edited by mattip

Loading