First draft of Spectrum1D class #1

wkerzendorf · 2012-02-14T21:38:39Z

looking forward to your comments.

eteq · 2012-02-15T07:39:51Z

This is going to be quite ugly-formatted, but I'm going to copy over the comments that went through via e-mail previous to this pull request for posterity:

@wkerzendorf
I have done the first implementation of the Spectrum1D class according to what we chatted about. I have added some things here and there to try out.
https://github.com/wkerzendorf/specutils/blob/spectrum1d/specutils/spectrum1d.py
Here's some example

test = Spectrum1d.from_dispflux(linspace(4000, 5000, 1000), ones(1000))
#slicing and interpolation work with
test.slice(4200, 4500)
test.interpolate(linspace(4000, 5000, 200))
#arithmetic
test + 5
test + test
5 + test

There's still quite a few simple bugs in there and it is not completely fleshed out. What are your thoughts for the general structure?

@astrofrog
It's great that the code is starting to take shape! A couple of comments:

np.interp is a factor of 10 faster than using scipy's interp1d, and
removes the scipy dependency.
it would be nice to be able to instantiate a spectrum from a table
object, though I'm not sure whether Spectrum1D should try and guess
the columns to read, or if they should be specified. Actually, the
same will apply if you want to read directly from files - it might be
nice to have some column name guessing, and an option to force it to
use specific columns?
I would personally use 'end' instead of 'stop'
The import statements at the top are in the wrong order. From PEP8:

Imports should be grouped in the following order:
1. standard library imports
2. related third party imports
3. local application/library specific imports
  
  You should put a blank line between each group of imports.
All astropy imports should be relative
When adding two spectra, I wonder whether there should be a way to
specify whether to return a new spectrum which covers the union or
intersection of the original dispersions?

Oops, ignore my comment about relative imports, since this is an
affiliated package :-)

@crawfordsm
Yes definitely an excellent start!

Just as there is a short cut for dispersion and flux, should their be a short cut for disp_units and flux_units?

I was thinking that the doc string for the class should be expanded to include the information about what it does inherent from NDData (particularly units and meta--since these are functions which are availalbe, but not immediately obvious unless you are familiar with NDData). Although I'm not sure if this is unnecessary duplication (or if there is a better way to include this), but should the following be included (mostly copied directly from NDData):

   Parameters
   -----------
   disp : undefined, optional
       WCS-object containing the world coordinate system for the data or an array containing the dispersion solution to the data (e.g. wavelength array)

   flux : `~numpy.ndarray`
       The actual flux data contained in this `NDData` object.  This should be a 1D array.

   error : `~numpy.ndarray`, optional
       Error of the data. This should be interpreted as a 1-sigma error (e.g,
       square root of the variance), under the assumption of Gaussian errors.
       Must be a shape that can be broadcast onto `data`.

        .. warning::
            The physical interpretation of the `error` array may change in the
            future, as it has not been intensively discussed. For now assume
            the above description holds, using an `error` property if
            necessary, but feel free to use the most convinient internal
            representation in subclasses

   mask : `~numpy.ndarray`, optional
       Masking of the data; Should be False/0 (or the empty string) where the
       data is *valid*.  All other values indicate that the value should be
       masked. Must be a shape that can be broadcast onto `data`.

   meta : `dict`-like object, optional
       Metadata for this object.  "Metadata" here means all information that
       is included with this object but not part of any other attribute
       of this particular object.  e.g., creation date, unique identifier,
       simulation parameters, exposure time, telescope name, etc.

   units : undefined, optional
       The units of the data.

       .. warning::
           The units scheme is under development. For now, just supply a
           string when relevant - the units system will likely be compatible
           with providing strings to initialize itself.

   copy : bool, optional
       If True, the array will be *copied* from the provided `data`, otherwise
       it will be referenced if possible (see `numpy.array` :attr:`copy`
       argument for details).

   validate : bool, optional
       If False, no type or shape-checking or array conversion will occur.
       Note that if `validate` is False, :attr:`copy` will be ignored.

@wkerzendorf

It's great that the code is starting to take shape! A couple of comments:

np.interp is a factor of 10 faster than using scipy's interp1d, and
removes the scipy dependency.

The nice thing about scipy's interp1d is that it can do much more than just linear. What do others think?

it would be nice to be able to instantiate a spectrum from a table
object, though I'm not sure whether Spectrum1D should try and guess
the columns to read, or if they should be specified. Actually, the
same will apply if you want to read directly from files - it might be
nice to have some column name guessing, and an option to force it to
use specific columns?

Done! so the newest version of the code very crudely implements this.

so there's now: Spectrum1D.from_array(disp, flux) the same as before just with different names

Spectrum1D.from_table(table_name, disp_col='disp', flux_col='flux')
and Spectrum1D.from_ascii(filename, usecols, etc.....) using loadtxt in the background.

I would personally use 'end' instead of 'stop'
I think we should use stop as it is consistent with the nomenclature of a python slice.

The import statements at the top are in the wrong order. From PEP8:

Imports should be grouped in the following order:
 1. standard library imports
 2. related third party imports
 3. local application/library specific imports

 You should put a blank line between each group of imports.
I agree, for now this is a rough draft and will be fixed before it's send out for evaluation.

All astropy imports should be relative
not astropy - as noted in your second email

When adding two spectra, I wonder whether there should be a way to
specify whether to return a new spectrum which covers the union or
intersection of the original dispersions?

that is right, there needs to be more thought on this. This was just meant as prove of concept.

crawfordsm: I agree that the docstrings are not very informative yet. But the plan is definitley to get that into a better state.

@keflavich

It's great that the code is starting to take shape! A couple of comments:

np.interp is a factor of 10 faster than using scipy's interp1d, and
removes the scipy dependency.

The nice thing about scipy's interp1d is that it can do much more than just linear. What do others think?

My 2 cents: I don't like using anything with a scipy dependency. If
anything, make the interpolation np.interp by default, and allow
options to change it if scipy can be imported.

@astrofrog

The nice thing about scipy's interp1d is that it can do much more than just linear. What do others think?
My 2 cents: I don't like using anything with a scipy dependency. If
anything, make the interpolation np.interp by default, and allow
options to change it if scipy can be imported.

For example, if linear interpolation is requested (default), you can use numpy, and otherwise you could import scipy on the fly. So to the user, the switch is transparent, but if they don't have scipy installed they get an Exception.

adrn · 2012-02-15T21:50:45Z

This note expresses both of our thoughts, but it was written by Demitri; I just cleaned up the formatting to post it here
I recommend viewing the skeleton code I wrote here: https://github.com/adrn/specutils/blob/master/specutils/spectrum1d.py

Adrian and I met this afternoon and discussed the code. Below are the thoughts we had. Before going into them, I'm going to reiterate that I'm coming from a place where I feel we should write to your average (i.e. mostly programming-novice) physicist - not to programming experts. They outnumber us, but we want them to feel as comfortable programming as possible. Also, please take all of this in the spirit of constructive criticism. :)

Instantiator

There should only be one way to create a Spectrum1D object. We propose this:
```
s = Spectrum1D(flux=<flux array>, dispersion=<dispersion array>)
```
At its most basic, a spectrum is these two arrays. It's up to the user to get those values of course. Generalizing the creation of a spectrum is not possible from a table, ASCII file, or FITS file. For a table you have to specify the column names, so why not this:
```
s = Spectrum1D(flux=mytable["flux"], dispersion=mytable["wavelength"])
```
Text files can be anything, and by passing all of the possible parameters to Spectrum1D, you are merely reimplementing (or wrapping) what already exists. This violates the idea of encapsulation anyway - a "spectrum" should have no knowledge of files. As for FITS, which HDU is requested? Which column? It would be much better to provide examples as snippets to do all of these things, but it's not adding anything to thinly wrap functionality that exists elsewhere.
Be verbose in naming

Every time I see "disp" I read "display". I always favor more descriptive names. It's self documenting, and (more importantly) someone new to the code will more easily understand the intent.

We suggest spelling out "dispersion".
SciPy dependency

There was a little discussion about this. While I strive to keep dependencies to an absolute minimum, it would be a shame not to use something in SciPy if it's available. Given our highly constrained resources in developing this code, having SciPy as a dependency is a worthy tradeoff to implementing code that exists there. Besides, the Enthought distributions are free(ish) and easy to install. If we had full-time developers on this stuff I'd probably have a different opinion, but we don't have that luxury.
Spectrum1D is not an array

It's important to note that Spectrum1D is not an array; it is an object. What we are proposing is that it's not even a thin wrapper around an array (or pair of arrays), but actually knows what it is and can Do Things. This means that for a given Spectrum1D object s this:
```
s[1000,2000]
s.slice([1000,2000])
```
is wholly ambiguous. Are we slicing index values? Angstroms? Something else? If we're adding "5" units to the flux, doesn't this impact the error array? If this generates a new object, you can't create a new Spectrum1D object - what about the other properties of the spectrum (e.g. mask)? What if someone subclasses the spectrum and adds properties (they will)? They will be lost. If someone wants to create new spectrum, this is easy and obvious:
```
s2 = Spectrum1D(s.flux[0:1000], s.dispersion[0:1000])
```
It should then be clear to the user that the other properties are not carried over in this case.
Operations

By extension of above,
```
s1 + s2
```
is also too ambiguous.
- What happens when the two spectra are different lengths?
- What is they have different wavelength arrays?
- Units different?
- How are the error array/masks/metadata handled?
While this would be convenient for the most simple cases, it again treats the objects as (homogenous) arrays.

We propose there be no operation overloading.
Passing in options as text

I'm opposed to passing in parameters to objects as free text whenever it can be helped. It leads to errors, it forces the user to go to the documentation, and most importantly, we're much better than ROOT. (If you don't know ROOT, be grateful.) If there are two options, then I would prefer to implement this:
```
s.slice(start=0, end=1000, units='disp')
s.slice(start=0, end=1000, units='pixel')
```
as
```
s.slice_dispersion(start=0, end=1000)
s.slice_pixel(start=0, end=1000)
```
It's not like the user can specify something not implemented anyway.
Interpolation

There are of course many types of interpolation. Since these are complex objects, any interpolation must be aware of (and appropriately handle) error arrays and masks. Again, I would not generate a new object from an interpolation method (for the same reasons as above). If the user needs to save the original, then in the examples we show:
```
import copy
s_orig = copy.deepcopy(s)
s.interpolate()
```
Interpolate will modify the internal data. I've tried to implement "reverting" in prior code that I've written, and it's rather difficult to get right. I'd say just make a copy. Besides, the user probably just recently created the Spectrum1D objects anyway, so they'll have the data handy.

Interpolate is a little tricky since it's awkward to take an existing interpolate API and just reproduce it in our local method. We propose a few interpolations methods that rewrite the internal flux, dispersion, mask, and error arrays as straightforward cases, and leave anything more complicated for the user to do themselves, e.g.:
```
s.interpolate_linear(new_dispersion=xxxx)
s.interpolate_bspline(new_dispersion=xxxx)
```
Nothing is returned (though an exception could be thrown).
Error arrays

To properly handle the error array in the interpolation (and for other purposes), the kind of error must be known. Is it an inverse variance? Variance? Something else? We propose a few classes like this:
```
class SpectrumError(object):
    self.values # array

    @abstractmethod
    def interpolate(self, xxx):
        pass

class SpectrumInverseVarianceError(SpectrumError):

def interpolate(self, ...):
    # do work!

class SpectrumVarianceError(SpectrumError):
    ...
class Spectrum...Error(SpectrumError):
    ...
```
The error object will not only contain the error array, but any metadata that is required to interpret it. We'll develop an API such that if someone wants to write their own error object and handle things their own way (i.e. per discipline), then this will be easy to do, be self-contained, and simply plug into our structure. Thus, s.error would only accept objects subclassed from "SpectrumError".
Plotting

As I mentioned on the telecon, I believe that any NDData object should be plotted with a minimum of effort. To that end, a "smart" plot command would plot based on the parameters given:
```
s.plot() # display a plot on screen
s.plot(filename="spectrum_plot.pdf") # writes a pdf of the plot to the given path/filename
s.plot(filename="spectrum_plot.png") # writes a png of the plot to the given path/filename
```
To work with matplotlib, we propose to also be able to plot to a pre-created Axes() object. Something along the lines of the following, but we haven't fully though through this implementation yet:
```
spec_axes = s.axes() # return a matplotlib.Axes() object of the spectrum
spec_axes.set_title("Spectrum Plot")
spec_axes.set_xlabel("wavelength")
```

Smoothing

As with interpolation, we would modify the internal structures. We propose:

s.smooth_boxcar(width=10)
s.smooth_gaussian(mean=m, sigma=s, height=h)

We can demonstrate custom smoothing functions as a snippet like this:

def my_smoothing_function(self):
    """ template code here with explanation, e.g.
        don't forget to modify the error object, etc.
        appropriately
    """
    # implement my smoothing
    # modify error object
    # do the right thing

s.my_smooth = my_smoothing_function
s.my_smooth(...)

This obviates the need for a subclass (and is a nice feature of Python).

SpectrumCollection()

I raised the possibility and we briefly discussed having a spectrum collection object. The intent would be that a large number of spectra would share a single dispersion array, but that these details would be hidden in the implementation of the collection object. That means I'd like to have a class that contains (or at least supports) having an empty dispersion array. We propose a superclass, e.g.
```
class Spectrum1DBase(NDData):
    ...

class Spectrum1D(Spectrum1DBase):
    ...
```
I'd like to require for Spectrum1D that "flux" and "dispersion" be defined, and this will help

Comments and feedback certainly welcome. :)

Cheers,
Demitri

keflavich · 2012-02-15T21:50:59Z

Re: smoothing

The proposed approach is:

s.my_smooth = my_smoothing_function
s.my_smooth(...)

I believe to add a function as a class method, you need to do something ugly like:

import types
s.my_smooth = types.MethodType(my_smoothing_function, s, s.__class__)

Is there any nicer way to implement this? If not, I'd argue against that approach; something more like atpy's reader registration feature would be preferable.

demitri · 2012-02-15T21:59:16Z

@keflavich Re: adding a function to an existing class, here's a toy example:

class A(object):
    def __init__(self):
        self.data = range(10)

def addFiveFunc(self):
    self.data = [x+5 for x in self.data]

A.addFive = addFiveFunc

a = A()
print a.data
a.addFive()
print a.data

astrofrog · 2012-02-15T22:52:59Z

1 - I agree with this, it does indeed not make sense to initialize from anything other than two 1-D sequences (numpy array, list, tuple), because it is easy enough to specify two columns from a table, from a file, etc. and Spectrum1D should not really be implementing I/O. I think this does not mean that in future, one could not imagine having a 'read' method for well defined 1-D spectrum specific formats that don't have any ambiguities though.

2 - Also agree, I keep stumbling every time I see 'disp' too!

3 - Astropy has the requirement that if scipy is not available, it should not break the imports, so if scipy is not available, the interpolate method would have to be disable. Therefore, I am suggesting that for linear interpolation, np.interp should be used so that at least basic interpolation is available if scipy is not installed. But I'm all for allowing more complex interpolation if scipy is installed.

5 - I agree that operations are too ambiguous. I think there should be functions such as:

    add_spectra(s1, s2, range='inner')

which allows the final range to be specified for example.

6 - I agree with having two methods instead of one with a text option, but I'm opposed to referring to the indices as 'pixels'. I would use slice_index. Not all spectra will be defined on pixels (such as model spectra).

7 - This is basically down to the debate between in-place modifications and returning the modified value. I can see arguments both ways...

9 - +1 for the plot and axes methods.

10 - I wonder whether for smoothing we couldn't have:

    s.smooth(type='boxcar', width=10)
    s.smooth(my_custom_function)

etc which would be easier than having people assign their own function to the class.

astrofrog · 2012-02-15T23:35:19Z

Regarding copying - if we did go with methods that modify the spectrum in place, we could define a copy() method that would remove the need for the user to do:

import copy
s2 = copy.deepcopy(s2)

Instead, they could just do:

s2 = s1.copy()

demitri · 2012-02-16T01:45:30Z

@astrofrog

#3. OK, I did not know that. I think that's the right approach. I'd suggest throwing a custom SciPyDependency exception that contains more detail/explanation for the user as appropriate.

#6. I like index better as well.

#7. I considered my typical case: I have a spectrum, I change the redshift, I interpolate, etc. If I'm given a new spectrum at each step, that's a lot of object creation/memory usage/property copying. I think returning a new array as might be typical in IDL is the right choice, but these are objects and I'd want to minimize object creation, particularly if I'm going to analyze thousands of spectra (I am).

#10. As with point 6 above, I prefer not passing in a free text value as a parameter. Nothing is being saved in the implementation: it will be one long if/elif/elif statement. The user can only pass in values that we've implemented anyway, so to me

s.smooth_boxcar(width=10)

is cleaner than:

s.smooth(type='boxcar', width=10)

The custom option could be

s.smooth_custom(**args)

Or something.

But I'm open to suggestion. Providing a custom... oh, that's the answer. There's a SmoothSpectrum object, we define an API, and they implement it. They pass the object to the Spectrum1D class and it uses it to do the work. OO, baby!

Re copying: The reason I suggested the deepcopy method is that it's standard Python, and I think it's always best to use what's out of the box. This way people will pick up a tool that they can apply everywhere.

eteq · 2012-02-16T07:48:41Z

@demitri - github's markup edited your 3. and so on to be a neat ordered list from 1-4... but of course that's horribly confusing because it's supposed to be 3,6,7,10... So I edited your comment to be #3. and so on instead of 3.

EDIT: github auto-marks things like #1 to link to the pull request... so I think @astrofrog's 1 - is the only safe way to escape the markup!

eteq · 2012-02-16T08:24:52Z

Now my own list (and thanks @adrn for the well-organized discussion points!):

1 - I mostly agree with @astrofrog (and @adrn and @demitri) - we don't want any of the io stuff actually implemented here... The from_table is a bit less clear, to me - it seems needlessly verbose to type Spectrum1D(table['wavelength'],table['flux']) when I could type Spectrum1D.from_table(table). But then again, perhaps that also should be considered "io" and a io.from_table function that we make later could accept either a filename of a table OR a Table object.

Put another way, all the class functions @wkerzendorf proposes could be moved to specutils.io.from_* and I think then everyone's happy?

2 - Agreed, except that "dispersion" is a bit long to type for something we may be using a lot (in an interactive/ipython setting). What about having dispersion be the actual name (which we will use in all examples and documentation), and add in a very-short alias like x to make interactive work quicker/easier (with a clear warning somewhere that it should not be used in a code/library setting)? Or is that too much rope to hang oneself with?

3 - +1 for @astrofrog's suggestion

4 - I'm not sure what @adrn is saying here - is this a general comment or something about this particular implementation? Are you saying you think we should not allow s[0:1000] or similar? If so, I agree - I was persuaded in the conference all that either that or s(0,1000) is a bad idea, and s.slice(...) is better.

5 - I'm not as sure about this one - I think arithmetic operations should be always allowed on scalars to alter the flux - that is , newspec = spec * 5 should be allowed, and it just gives you a copy of spec that has had flux multiplied by (and errors should be updated assuming that 5 is 5 +/- 0).

Also, I think it should be safe to do newspec = s1 + s2 as long as the dispersion and units match. If they don't, an exception gets raised. There are a variety of use cases where I want to subtract off matched spectra that have been shifted to the appropriate axis or the like. It also allows the usage newspec = s1 - s2.interpolate(s1.dispersion) which gets completely around the need for an add_spectrum function.

6 - I agree with using the method name instead of a parameter setting, and +1 to @astrofrog's suggestion

7 - I like the idea of an interpolate_* class of functions and the general approach @adrn suggestions. It might also be worthwhile to include an interplate function that has a mode or smilar argument that if called as interpolate(...,mode='foo') internally finds the interpolate_foo function... but I could go either way on that.

However, I'm not sold at all on modifying internal state. Why not instead have those functions make a copy themselves, and return the copy? Then you needn't ever worry about accidentally overwriting your raw data... I guess in this case (unlike @astrofrog), I don't see the virtue of in-place modification (although I could be sold on it). If there's a lot of disagreement here, the best thing is probably to have an inplace parameter that, if False, returns a new copy, and if True, modifies the spectrum in-place.

8 - I definitely like what @adrn proposes here. Unfortunately, it steps on the work that needs to be done in the nddata package of the Astropy repository. Spectrum1D needs to inherit from nddata.Nddata, and therefore must follow those conventions of what the error attribute means. The good news is that no one else is likely to take up this challenge for nddata until we do, so we can be the driving force behind what happens there as long as we come up with a scheme compatible with errors for other (non-spectral) types of Nddata.

9 - +1 here - plot is a must.

10 - Whatever we do here should absolutely be consistent with what is doen for interpolate_*, so this should probably be just one discussion. See my next comment about my concerns with using class attribute setting, though.

11 - I have no strong opinions here - those that do should set the rules :)

Whew!

eteq · 2012-02-16T08:35:30Z

@astrofrog @adrn re: copying:
+1 to @astrofrog's suggestion of copy method - it makes it all a lot easier to deal with interactively.

@demitri @keflavich @adrn re:smoothing and adding methods:

I think it's dangerous to encourage the A.addFive = addFiveFunc form. It does work, but modifying an already-implemented class is asking for trouble - what if I have two spectrum objects, and I want to do different smoothing methods for each? If I'm paying attention I could do

Spectrum1D.my_smooth_1 =smooth_func_1
Spectrum1D.my_smooth_2 = smooth_func_2
s1.my_smooth_1()
s2.my_smooth_2()

But novice users following documentation are more likely to do

Spectrum1D.my_smooth = smooth_func_1
s1.my_smooth()
Spectrum1D.my_smooth = smooth_func_2
s2.my_smooth()
... other stuff ...
s3.my_smooth()

And then get confused which one they're using. One solution, as @keflavich suggested, is to attach smoothing methods to instances, but I agree that's very clunky.

So I like @demitri's final suggestion of having a SpectrumSmoother (better name than SmoothSpectrum because it's clear that's an action rather than just a smooth spectrum) API, with the addition that I would suggest including a registry - then you can either do:

class MySmootherClass(SpectrumSmoother):
    ...

s1.smooth_custom(MySmootherClass)

or

class MySmootherClass(SpectrumSmoother):
    name = 'cool smoother'
    ...
Spectrum1D.register_smoother(MySmootherClass)

s1.smooth_custom('cool smoother')
s2.smooth_custom('cool smoother')
...

eteq · 2012-02-16T08:37:59Z

One final comment based on @wkerzendorf's PR here: we definitely should not use the wcs attribute to store any information right now - that attribute will be used in the future of nddata to mean something specific. Once it's fleshed-out, we can derive dispersion from it, but for now dispersion should be independent.

eteq · 2012-02-16T08:40:41Z

@adrn I know we've kind of got a free-form discussion going, but maybe you should consider creating a second pull request based on adrn/specutils@bc8dbd8 ? Although maybe we want to avoid splitting the discussion at this point - your call.

astrofrog · 2012-02-16T09:16:45Z

@demitri - regarding the copy method - as you said, we are catering for astronomers, not programmers, so I personally feel that:

import copy
s2 = copy.deepcopy(s1)

is more clunky than

s2 = s1.copy()

In fact, we can push for most Astropy objects to have this method for consistency.

I think at this stage, the fundamental debate is whether Spectrum objects should be modified in place, or copied and returned. Python lists follow the former:

l = [1,2,3,4,5]
l.sort()

does not return a copy. To return a copy, the user has to do:

sort(l)

The value of in-place modification is that you save memory, and if you are doing e.g.

s.plot()

after each operation, you would be creating a new plot rather than updating the existing one. If we did have in-place modifications, we should provide all methods as functions which do return copies, as for sort above. Numpy arrays of course behave differently, and often return a copy. I'm sure there are arguments both ways, and it seems to me that is one of the most important points to discuss right now.

At this stage, I would be against opening another pull request, as it will scatter the discussion, but if there is a fundamental divide in the group regarding in-place modifications and returning copies, then ultimately I say we implement both and submit it to the list to see what people think (but I think it's much better to have a tangible example that people can try out, than just having a theoretical discussion).

adrn · 2012-02-16T13:12:37Z

Regarding a new pull request, I also think we should avoid splitting discussion at least for now. When we come to some more solid conclusions about the suggestions, I'll create an updated list and that can be appended to a new pull request.

Here are some thoughts from me:

1 - @eteq regarding your point about from_table() -- I think the idea here is that Spectrum1D(table['wavelength'],table['flux']) vs. Spectrum1D.from_table(table, columns=[0,1]) (or similar) doesn't save you anything, because if you have non-standard column names you still end up specifying the columns. But regardless, I agree that it probably belongs somewhere in io.
2 - @eteq I agree that it is long, but I don't know how I feel about having both dispersion and x. What do others think?
3 - @astrofrog +1 to your suggestion
4 - @eteq I think the point is just what you say: we shouldn't allow spectrum[0:100] because (as a general comment) a Spectrum is not an array!
5 - @eteq I don't have strong opinions here, I think allowing it to work under some tight constraints seems valid to me.
6 - @astrofrog I also like index instead of pixel.
7/10 - @eteq Hm, I think we should think these implementations through a bit more carefully
8 - @eteq Ok cool, so I think the way I'd like to proceed is to implement the SpectrumError class, and then see what things can be generalized / moved up to a superclass. Does that sound reasonable?
9 - Agree with everyone's comments.
11 - I know @demitri has some ideas here, and I can certainly contribute -- we both come from the SDSS camp where it's sometimes convenient to deal with a full 'plate' of spectra, which all share the same dispersion axis.

Great comments everyone -- loving the discussion!

astrofrog · 2012-02-16T13:29:57Z

Regarding convolution, I don't understand the need for a SmoothSpectrum class. Why not just allow the user to specify a smoothing function? (would be accessible to more people that way)

keflavich · 2012-02-16T14:50:14Z

Topic: inplace vs. copy - I'd like to see some methods that do both explicitly. For example, I'd like to see X and Y unit conversion like:

spec.dispersion.convert_to_unit('microns') # inplace
spec.dispersion.view_as_unit('microns')     # copy
spec.flux.convert_to_unit
spec.flux.view_as_unit
spec.view_with_units(dispersion='microns', flux='janskys') # copy of whole spectrum

On other things, like interpolation and smoothing, I'm ambivalent and therefore would like to see both options implemented e.g. with an "inplace" flag.

I'm entirely in favor of the .copy() method

Topic: smoothing methods - I like the idea of registering smoothing functions, but agree with Tom that there's no need for a class (just a clear specification for what the function accepts and returns).

wkerzendorf · 2012-02-18T20:36:14Z

First of all, thanks for providing a working example, that really helps when trying out different implementations. Sorry for the long delay in answering - I was travelling.

I hope everything that goes on here on the list is taking in the spirit of constructive criticism ;-).

Well here's my suggestions for the different points.

Instantiator 1

I agree 100% there should only be one way to instantiate the Spectrum1D object. In my opinion this is init(self, data, error=None, mask=None, wcs=None, meta=None,
units=None, copy=True, validate=True).

What I merely did was include the read-in methods, that you guys mentioned, within the Spectrum1D object. Obviously we could put them somewhere in specutils.io, but why? IMHO, they do belong to Spectrum1D as they are only useful with a Spectrum1D object. It would obviously be reflected in the doc_string of Spectrum1D, how to use these read methods.

A spectrum dispersion solution is in the core a world coordinate system. If you reduce spectra with IRAF or other software for that matter, one often fits some polynomial to get a mapping between pixel and dispersion space. I think it's not a good idea to just force every user to map out his WCS into an array, as there's definitely a loss of information that is important in some cases.
In addition, the current implementation (just using flux and dispersion arrays) sees no way of specifying what units the dispersion axis is. For example, I would like to multiply my spectrum with a filter curve and subsequently integrate it to get a flux (I think Adam suggested this in one of the emails as well). I believe all the attributes in NDData are important for Spectrum1D and should therefore be used in Spectrum1D.
Of course we could include the units in such a implementation, but as the wcs will have that anyways I don't see a reason to duplicate the effort.

As for my implementation it just set WCS equal to the lookup table (dispersion array), for now. But this is only a temporary measure until there's a proper WCS in place that deals with lookup tables. With a proper WCS class, when .disp is required it would return a lookup table generated from the true WCS.

#verbosity 2

I agree in most cases to be verbose. I think however that disp for now is fine. We spoke in great length about what we wanted to call that axis (and after 'wave' was shot down and that rightly so ;-) ), we decided between dispersion and disp. disp won and I still think that's a good name. It would be at a prominent place in the Spectrum1D docstring. We could have both I guess.

#Scipy 3

I share your opinion on that one, but it seems most of the other developers don't (as there seems to be some problem installing it on older systems). It was decided to try to use only numpy wherever possible. And I think we can by with that most of the time.

#Slicing 4
I believe that was taken a little bit out of context. My slice operations had a switch, what units to use with slicing. The essence of the spectrum1d object is the flux array. I believe it is pretty clear, what is meant by slicing (in the sense that we cut out a bit of the array, not in the what values we are slicing on). I also believe that slicing should always return a new Spectrum1D array. Many other functions in python work that way (slicing ndarrays, slicing lists, ….). I have had great success with overloading the getitem operator and using this as slicing in dispersion units. It's a simple and obvious convention that many people in my old department liked. I propose having the overload in addition to slice.

#Operations 5

@eteq has made a good point that at least adding constants and adding spectra on the same wavelength grid should be allowed. I have also had a lot of success with interpolating spectra on a common wavelength grid. pysynphot (I think) also does and many people like that. The meta dictionary could store the settings how it arithmetic behaves (e.g., arithm_mode = 'inner range' or equivalent. If the current implementation of addition does not suite a user, they can always make other arithmetic functions or subclass it. However, I still believe that an "inner range"-implementation will suite most people for simple arithmetic.

Options as text 6

I don't quite get what the problem with text options is: Your suggestion is to have the text stored in the function name rather than as a parse-in option for a standard function. For many things this will duplicate code (e.g. smooth boxcar and smooth gaussian will probably both rely on the same convolution library just passing different parameters to it). Secondly, if there's different text options users can look the up easily in the docstring of the common function (e.g. spectrum1d.convolve). If there's different function names then the user will have to go to the documentation to figure what is available. If one is worried about too many if statements, we can just use a dictionary. I might misunderstand, but if we can't use text options we have to use integer IDs (e.g. for convolutions boundary treatment: 1=wrap, 2=mirror, 3=const, etc.). This would go against item 2 in your list asking for verbosity. Finally, many scientific python library like numpy, scipy and matplotlib make extensive use of text options

interpolation 7

I believe that most operations on the spectrum should return a new Spectrum1D object, as does the interpolation function in both numpy and scipy. I think most reduction and analysis software will create a new object on an operation like interpolation or convolution for that matter. As for the memory. If the old spectrum is not needed any more, one can just overwrite it (e.g. spectrum = spectrum.interpolate(arrange(6000,7000))). Internally a new array is created anyways if we use numpy or scipy functions (and then overwrites the old one).

Error arrays 8

I do very much agree that this is a very important point, but believe that this is more of an NDData issue than Spectrum1D. I believe we should save this issue for later and now discuss the other issues at hand.

Plotting 9

In item 1 there was talk that we should not include io in the spectrum object as it is not essential to the spectra (I wonder if essential is the right word). I believe that plotting faces exactly the same arguments. As was discussed in the Google+ session - everyone wants to plot a spectrum, but everyone also wants to load a spectrum from a fits file, text file, etc … . We should think about where to draw the line on this. Maybe we should have a specutil.plot module which has lots of option (like line id etc)

Smoothing 10

Similar to interpolation I believe it should return a copy. I think this is an issue, we should think after figuring out initialization routines and so on (however important).

SpectrumCollection 11

Good point! There I again believe that the WCS (and using the NDData init function) will help us a lot as it can store only one dispersion solution and make it available for all spectra in the collection.

The other main point you bring up here is: Spectrum1DBase. That might be a good idea to have a very simple base class from which we inherit other Spectrum classes (maybe OpticalSpectrum, Xray spectrum). It's definitely worth thinking about it.

So to proceed, I think it one of the main problems is how to instantiate the function (including IO). I created a discussion thread here () to separate this issue.

Thanks again for the well structured list and bringing up the suggestions!

wkerzendorf · 2012-02-18T20:38:56Z

Sorry forgot the link to the other issue: #2

astrofrog · 2012-02-19T14:13:43Z

It sounds to me like now that we've all had a chance to ponder these issues, a telecon would be a more efficient way to go through each point and make a decision in each case?

Regarding I/O, this is not really important because it would only involve adding some methods, so the decision regarding that can wait until later anyway. More important is the general behavior of the class (in place mods vs returning copies - and I don't believe we should have a flag to choose that, we should just pick one and go with it).

demitri · 2012-02-19T14:42:53Z

I agree a telecon would be more productive. Before we have that though I'd like to propose we start making a list of features we'd like Spectrum1D to implement, so we can discuss that at the same time. I think the progression should be "desired functionality" -> "API" -> "implementation", and we seem to be working the other way around (although lots of great discussion came from it!). I'll start a new pull request for that.

Argh. Not a pull request. I just want to create a new thread. I don't get this GitHub thing.

astrofrog · 2012-02-19T14:46:33Z

@demitri - so you mean basically we fist should decide what we want a typical user script to look like, and then worry about making it work? We could even do it the fun way and do test-driven development (where we write the tests using what the code should look like, then implement it to make all the tests pass!).

astrofrog · 2012-02-19T14:47:42Z

Actually, I'm skipping a step again, I agree we should worry about functionality before even API...

demitri · 2012-02-19T14:52:47Z

@astrofrog It sounds reasonable, yeah. :) The are several reasons for this approach. At the very least, when someone has time to write code there will be clear pieces to implement, essentially a to do list. As you can guess, I value a clean API very highly. If it's not clear or easy to use, it doesn't matter how good the code underneath is. People (justifiably) won't use it. Implementation can always be changed underneath.

wkerzendorf · 2012-02-19T15:36:37Z

I guess everyone here is interested in a clean API. I also agree that is should be as easy as possible to use, but this is supposed to be a class that caters to most astronomy fields and such I worry about oversimplification. I don't believe with most things AstroPy that one will just be able to pick it up without having a glance of the documentation once. That is definitley not the case for all other reduction and data analysis tools (be it IDL, matlab, mathematica, numpy, scipy, ....).

Anyways +1 on a telecon (maybe Thursday or Friday?).

astrofrog · 2012-02-19T17:03:15Z

@wkerzendorf - remember that Spectrum1D doesn't have to do everything for all wavelengths, we can always sub-class Spectrum1D for more specific applications (OpticalSpectrum1D, XRaySpectrum1D, etc.). So in that sense, Spectrum1D should be fairly simple.

demitri · 2012-02-19T17:19:12Z

@wkerzendorf Let me try to explain my position.

I strongly believe that "power" is not at odds with "clean and easy to use", but to get both requires a great deal of thought and consideration (which I am happy to do). It means keeping things as consistent and self-evident as possible.

Let me give an example. There is a scientific code written in C++ called ROOT that is to the particle physics community what IDL is to astronomy. Everyone uses it, new students are immediately started on it. It is one of the most poorly designed frameworks I've ever seen (this is saying a lot). Consequently, people who use it are always cursing it, wasting incredible amounts of time to do straightforward things, and take a long time to "get up the learning curve". As an example, there is a Histogram object that you create from an array. (As you know, particle physicists can't get up in the morning without making a histogram.) The simplest thing you would do is create the histogram object, populate it with the array, and then plot it. If you did that, it's possible that you're plot wouldn't made sense, because the first bin of the histogram contains underflow values, and the last bin overflow values. This is not expected behavior. The histogram is an object - the underflow/overflow should be two properties. Virtually every ROOT user has run up against this, and in my (last) department alone I've seen hours wasted on figuring this out. In my mind I multiplied that by hundreds (more more) all of the other ROOT users, and that's a ridiculous amount of physicist time wasted by a poor design decision. There are many, many more examples like that.

Yes, people will have to go to the documentation. That is unavoidable. But the code should be as readable as possible, such that if you are simply reading someone's code - and you are not intimately familiar with AstroPy or Spectrum1D - that the intent is clear. If you are writing it, sure, you'll have to look at a tutorial or the docs to get the details right.

I recently converted someone to use IDL from Python, and one thing he was so excited about is that there was something he wanted to try but didn't know if it would work. He said he wrote some lines that, based on his using Python, that he thought should work, and was thrilled that they did. This is a huge reason for constancy with the greater Python in general.*

Another reason why I think the code should be easy to use is pragmatic: if it's not miles better in ease of use, then people who are entrenched in IDL will simply stay there. Simple as that. We don't have a hope to compete with the existing IDL libraries in the short term, so ease of use becomes even more important.

Finally, one more lesson I learned from ROOT. I remember looking at the documentation which was written by the developers (this was years ago). I quickly discovered the docs were very sparse and had holes in many, many places (and this in the reference describing the API!). The main docs started off with a statement that said (I'm paraphrasing), "The docs are written by the developers. It's not as complete as it can be, but we feel our time is better spent in developing the code further than writing documentation." What they were saying is that felt their time outweighed the countless hours that thousands of physicists have wasted trying to figure out how to get a job done, and I thought that was unbelievably arrogant. (I am not leveling that accusation at anyone here!!) But it's a lesson that's stuck with me; if I need to spend 5 hours to get a piece of code right that will save someone five minutes I will happily do it, because that five minutes has a huge multiplier next to it. I'm willing to make someone type the extra six characters in "dispersion" over "disp" for this reason. (I'm also willing to introduce people to editors that feature code-completion. :)

Anyway, I'll get off my soapbox. Simple + powerful = win.

demitri · 2012-02-19T17:27:06Z

I was thinking about it, and I'm going to +1 @astrofrog's mention of Python's sort functionality. We can implement something like this:

spec = Spectrum1D(...)
spec.interpolate() # in place

interpolate(spec, ...) # returns new object

Someone who knows Python will see the convention, and someone who doesn't will learn when they see it applied elsewhere. The details to handle that (and use the same code) are easily, um, handleable.

The main reason I'm opposed to returning copies is that if I am working with hundreds or thousands of spectra (SDSS recently crossed the 1 million spectrum milestone, so I will), generating new copies at each step (regridding, reshifting, etc.) will generate a huge number of (OO) objects as everything internal to the class is copied. This is a high drag to any OO code - you want to avoid creating large numbers of objects. Further, my spectra are going to come from a database, so I'd like to place the returned database object (which contains a huge volume of metadata and a pointer to a live connection to the database). Copying that into a new object (or any other user metadata that might come along for the ride), again, could be a big drag on the code at best or break things at worst.

andycasey · 2012-02-20T00:40:10Z

Hi everyone,

I've been interested in astropy (and more specifically in Spectrum1D since it's formal conception). For those of us who did not attend the meeting, I'm glad that the discussion has been brought more "online" so that others can participate; it seemed there was a lot discussed at the meeting which was not immediately reflected in the wiki.

For the background I currently use a custom class (like what Spectrum1D will eventually be) for my data analysis with tens of thousands of spectra where in the final stages of analysis they are all on a common dispersion.

1 - I think there are actually two issues present here; (i) whether to have the from_ascii/from_fits methods at all, (ii) whether they should be in the Spectrum1D class or in specutils.io and (iii) how to handle instantiating a Spectrum1D object if you want to allow for (a) arrays of dispersion and flux and (b) flux and a WCS dispersion mapping.

i - The from_ascii/from_fits (and to_ascii/to_fits) are common enough that they should be used, and they should live in specutils.io.

ii - The general way to instantiate a Spectrum1D object should be with a flux and dispersion array (with the units specified by disp_units and flux_units). I understand that if you instantiate a spectra from a FITS image which contains some polynomial WCS map for the dispersion, then information is lost if you simply feed in all the dispersion points as an array into Spectrum1D. Ideally you might want to have a WCS mapping for every object so that maximum information is retained, but the general case will be when people instantiate a class with two arrays. Moreover if you want a WCS mapping for every object, then when the dispersion is linear you are only adding a layer of unnecessary complexity and overhead.

If an object is instantiated from a FITS file or from an ASCII file then it would be prudent to keep the WCS dispersion. Thus, the from_ascii/from_fits functions are no longer simply "wrappers" for np.loadtxt, they actually provide additional functionality because your resultant Spectrum1D will have a better described dispersion - without the need to specify the dispersion units either.

I do not believe this is sufficiently worthy to have two types of Spectrum1D classes, or two different instantiating methods. I'm open for discussion on the implementation of this, but allowing a astropy.wcs class (or whatever the appropriate class is) to be passed through as the dispersion is probably the best way to go. The dispersion can then be specified as either an array or a astropy.wcs class, and the only overhead in the instantiator for most cases will be

if isinstance(dispersion, astropy.wcs):
...
else:
  # flux and dispersion array-types have been provided

or similar. Some may believe that this could give reason for multiple different use-cases requiring different if-statements in the instatiator causing unnecessary complication, but this is a logical fallacy. The general case is for when a dispersion and flux array will be passed in, and the only other way you would want to instantiate it would be if you wanted to provide additional dispersion information through the use of a WCS mapping.

2 - Verbosity is great, but it should not be forced for those who know what they're doing. Have the documentation all state the argument as dispersion, but just as matplotlib allows me to specify "edgecolor" or "ec", let me specify disp instead of dispersion.

FYI I have never read "disp" as display in these discussions. Just like you read RCE as rice, I read it as race. Let the user who knows what they're doing use a shortcut - just don't advertise the shortcut.

3 - +1 for @astrofrog

4 - Spectrum1D is a custom class, and its usage should be designed for the most general of cases. If I'm wanting to take certain parts of a spectrum, I would rarely be splicing it based on flux levels. In general I would want to cut out certain portions of the spectrum. Using,

spliced = spectrum[450:580]

is natural and clean. Am I referencing the indices or the dispersion? The idea of having this class and having it interact with other spectra and scalars is that I shouldn't have to worry about the indices in a general sense. I work with the dispersion and that's the level I want to interact on. What about in the scenario above there? What does 450 actually represent physically? It ought to slice on whatever the natural units of the dispersion are. If the dispersion units are nm, then I should get a resultant spectra from 450nm to 580nm (or a shorter region if bounded by the length of spectrum).

I know that some people feel uncomfortable with this because it's typically how they handle lists or arrays. But the difference here is that the natural way to associate with lists and arrays is to splice by their indices because you have no other mapping available. Here I don't want to have to work on the index space, I always work in the dispersion space which is familiar to me. Having a splice function could also work, but then will you leave __getitem__ to reference the spectrum data? What use would it have than what you could access with spectrum.disp / spectrum.flux?

Write it for the way people use their existing spectra; we access arrays by indices, we work with spectra on dispersion. If you've got a dispersion layer there you ought to encourage the use of it!

5 - Here's how I think it should go:

Scalar operations are a no-brainer, they should operate on the flux.

If two spectra are on the same dispersion mapping then it's also quite simple; the fluxes should be operated on simply.

The metadatas should be updated together in the resulting spectra. If there is missing information (e.g. RA header in one object but not in the other) then the resulting spectrum should keep as much metadata as available. If there is conflicting headers then the first operand metadata should be used but a warning should be expressed.

Similarly if two spectra are not on the same disperson mapping then it should use the first operands dispersion mapping and throw a warning. In my code I would never write code like this (for the same reasons I would not use a known deprecated function in code), because it would spit out unnecessary errors. But for those of us just wanting to try things out in the terminal and "get things done", it should be just as easy to do so - but I should be warned of any potential complications.

Addition of the errors is a seperate issue - if the Spectrum1D is associated with gaussian errors, and another spectrum is associated with similar errors then the two errors should know how they should operate when added together. It should not be Spectrum1D's job to assume how the error classes should interact with one another.

6 - Don't clutter up the namespace, especially when smooth_x and smooth_y may access the same underlying functions with different arguments. Scipy allows text arguments, numpy does, matplotlib does; let's not clutter the namespace.

7 - I'm a firm believer that Spectrum1D (and nddata objects) should return new class objects. If I'm manipulating my spectrum (either by subtracting sky, interpolation, whatever the operand is), I want to keep data concurrency. I want to know how the spectrum was affected so that I can analyse the steps later on. The stages where I don't need to keep the original data before the operation, I would simply re-write over my object:

spectrum = spectrum.interpolate(s1)

But since in general it's good to have data concurrency and know what happened to the spectra at each step, I don't want to be forcing a deepcopy every time I want to do an operation. Return new objects. The ones that aren't needed would be overwritten by the user anyways.

8 - Not a relevant issue yet.

9 - "Smart" plot commands are not necessary! Why bring in matplotlib overhead when it's not required? I understand that in other libraries people have spent way too long trying to plot something. The documentation for astropy will not be lacking, and there should be examples on how to plot heaps of different data types. Nobody is ever going to use spectrum.plot() as the final plot they use in a journal paper because there is heaps of detailed customisation (axes ticks, titles, labels, colours) that would need to be adjusted before publishing it.

In my opinion there should be easily accessible examples on the web of how to easily plot different types of data, but we should not be wrapping matplotlib! Not to save the 4 lines it takes to do:

import matplotlib.pyplot as plt fig = plt.figure() ax = fig.add_subplot(111) ax.plot(spectrum.disp, spectrum.flux, 'k')

The data sets for Spectrum1D are simply not sufficiently complex to warrant a "smart" plotting function which could easily be shown as a documented example code.

10 - Once again, return a copy. If a copy is not required the user would have just overwritten their existing spectrum anyways. Only lists are sorted in place, numpy allows for sorting in place and for returning a new object.

11 - An interesting problem and I can greatly see the need for it. I think this issue should be split to a separate discussion.

adrn · 2012-02-20T17:36:22Z

By our telecon on Friday, I will compile a list of discussion points for each of Demitri and my ideas -- I would have done it sooner but have been swamped with work lately!

wkerzendorf · 2012-02-20T18:05:33Z

So @adrn does this mean you prefer Friday noon (is that true for Demitri as well)? I think for now we should stick to the two main issues of instantiator and copy vs inplace for this telecon. This will probably take a long time and has implications for many of the other points.

adrn · 2012-02-20T18:12:40Z

@wkerzendorf Friday noon works for me, not sure about @demitri -- agreed about the latter point, but I think it'll be good to collect our input before the discussion.

Thanks!

eteq · 2012-02-21T10:13:56Z

@wkerzendorf - I can do Thursday or Friday noon (assuming you mean EST).

I do not think we should limit the discussion to just copy vs inplace and instatiator, however... Looking over this list, I think less than half of the 11 originally mentioned are now in contention (or, alternatively, can be naturally merged). So @wkerzendorf, can you post here a short list of the items that we have not yet achieved consensus regarding?

eteq · 2012-02-21T10:17:22Z

And I would also suggest that, following the telecon, we close this pull request after someone posts a summary of what decisions were made, and we can use new issues for anything that's still not resolved (or new issues we want to poll online).

eteq · 2012-02-21T10:43:33Z

A few comments/summaries I hope might clear a few items up (I'm intentionally leaving out quite a bit that is probably better resolved on the telecon):

2 - @andycasey So if I'm reading you correctly, having dispersion and x as the same thing (but using only dispersion in the docs and the like) is reasonable to you, then? I think in this case disp is bad because we have an existence proof already that it lead to a fair amount of confusion among others even here...

3 - @wkerzendorf Actually, the standard we've come to in the Astropy core is that we can't require scipy at import - but having functionality that requires it is just fine. It may mean some functionality is lost if you don't have scipy installed, but in this case it should be pretty easy to default to the scipy version, and fallback on the numpy one if an ImportError is raised. In general, though, it's ok to use scipy if there's no easy alternative, as long as it's only imported in the function/method

4 - @andycasey, I think I'm more and more in line with @demitri and @adrn on this one: In this class we should not assume much about what the user wants... and allowing slice notation will always require assumptions that in many cases will be violated. The original intent (at least as I understand it) of Spectrum1D is to provide a (still useful) base upon which subclasses can be built as needed. So while I understand and sympathize with your wish to provide this option, I think it's better implemented as a subclass - that is, Spectrum1D should not have a __getitem__, but it would make sense to add an e.g. WavelengthSpectrum1D that assumes defaults including using wavelength slicing notation. Then people are free to use whatever interface they like, but can stick with the simplest one (Spectrum1D) if they're confused.

5 - I think there's a general consensus that scalar operations and matched dispersion operations are ok, but non-matching dispersion is not as definite. (My feeling is the same as 4 - don't assume anythin in Spectrum1D, but we should include/support "smarter" subclasses that do this in some reasonable way)

9 - @andycasey and @wkerzendorf I think this would in some cases be used for publication-quality plots... As long as there's some flexibility in the provided options, I can see using this for "real" plots. I've certainly done this with astropysics.spec (although, admittedly, I did customize that to my taste). Either way, though, as @keflavich says, it's important to have a "quick-look" method even if it isn't for publication-quality (although I suppose this could be in a higher level of "smartness"). Perhaps this particular discussion is best to decide in the telecon, though.

I should emphasize also that this would not mean adding matplotlib as a requirement. As I described above in 3, the intent in astropy is to encourage use of external libraries when needed as long as it doesn't crash at import. So a user without matplotlib can use Spectrum1D as normal, until they hit plot, and only than will they see an ImportError.

11 - Also sounds like there's consensus that this should at least be supported... I think it fits well with the suggestion I have above of layers of "smartness" in a class heirarchy.

astrofrog · 2012-02-21T11:17:10Z

I didn't comment on slicing previously because I had to think about it - my opinion is that that square bracket slicing should not be implemented because it would not be intuitive whether this would be index/wavelength/frequency based. Furthermore, if we do have slicing, it should be through slice or slice_index/slice_wavelength/slice_frequency methods which can have options and docstrings that can clarify exactly how the slicing works.

astrofrog · 2012-02-21T11:19:15Z

Thursday or Friday should be fine for me.

demitri · 2012-02-21T15:50:42Z

Both Thursday or Friday are ok for me.

wkerzendorf · 2012-02-21T22:18:31Z

Sorry, my collaborators just suggested putting an HST proposal in, so I'm out for Thursday or Friday. Early next week? (I can do weekend as well if people are really keen).

eteq · 2012-02-21T23:00:28Z

Monday or Tuesday (Feb 27/28) @ noon EST work for me. I could also do Sunday if others want to (although I would prefer weekday). Or perhaps a doodle is called for here (ideally with the "could make it work but prefer not" option)?

Also, @wkerzendorf, do you still intend to post a list of what needs to be decided and what needs to be? I think we will be more productive if we have all the items on the table and agree we need to make a decision on all of them in the telecon (or at least agree to post the telecon consensus and give a bit of time to make sure the rest of the community doesn't strongly disagree).

wkerzendorf · 2012-02-22T00:01:26Z

@eteq sounds great. I do intend to post a list, but haven't put it online yet. My idea, for the next meeting is to restrict it to the two main issues (instantiator; inplace/copy) so it doesn't get too long (and schedule subsequent issues for later meetings). I will however have a list with all current issues.

Will do a doodle as well.

eteq · 2012-02-22T21:28:45Z

@wkerzendorf - My vote would be try to get through most of the issues in one meeting if possible - that's why I was suggesting we at least have all the topics. I think if we have a clearly-defined agenda we can get through the items in a relatively short amount of time. But we can decide that as a group once the call starts.

Good luck with the Hubble proposal!

eteq · 2012-02-26T20:40:35Z

@wkerzendorf - Are we not going to do this next week after all? A doodle should go up at least a few days in advance...

wkerzendorf · 2012-02-26T20:41:59Z

We are I'll put the doodle up in an hour.

wkerzendorf · 2012-03-05T07:21:37Z

redacted and replaced with new draft

Improved documentation and some consistency impovements

change 'generic' to 'tabular' (there is no "generic" FITS spectrum)

APE1 tweaks

APE1, the meta-APE

Getting most up-to-date version of cubeviz

Update master in fork

Fix and unify internal links.

* WIP: Compat with asdf v3 * Fix schema references and move ASDF schemas a few level up. Co-authored-by: William Jamieson <wjamieson@stsci.edu> * TST: Stop using roundtrip_object * Fixes and cleanups (#1) * Fix the schema * Fix tags to be the ones listed in the manifest * Remove uncecessary tree packing/upacking. asdf converters no longer need to return a completely valid ASDF sub-tree. ASDF will recursively parse the tree until everything is converted automatically. * Add `spectral_axis-1.0.0` tag for `SpectralAxis` objects * Add `SpectralAxisConverter` * Fix broken tests * Rename converters to reflect new ASDF language * Better organize asdf extension * Rename `spectra` module to `converters` to better describe its contents * tmp fix jsonschema bug * Move jsonschema pin to test section because the pin will happen upstream anyway and should not be in our install pins. Other nitpicks. * Simplify create_spectrum1d in testing * Undo jsonschema pinning, update asdf pin Co-authored-by: William Jamieson <wjamieson@stsci.edu> --------- Co-authored-by: William Jamieson <wjamieson@stsci.edu>

* WIP: Compat with asdf v3 * Fix schema references and move ASDF schemas a few level up. Co-authored-by: William Jamieson <wjamieson@stsci.edu> * TST: Stop using roundtrip_object * Fixes and cleanups (astropy#1) * Fix the schema * Fix tags to be the ones listed in the manifest * Remove uncecessary tree packing/upacking. asdf converters no longer need to return a completely valid ASDF sub-tree. ASDF will recursively parse the tree until everything is converted automatically. * Add `spectral_axis-1.0.0` tag for `SpectralAxis` objects * Add `SpectralAxisConverter` * Fix broken tests * Rename converters to reflect new ASDF language * Better organize asdf extension * Rename `spectra` module to `converters` to better describe its contents * tmp fix jsonschema bug * Move jsonschema pin to test section because the pin will happen upstream anyway and should not be in our install pins. Other nitpicks. * Simplify create_spectrum1d in testing * Undo jsonschema pinning, update asdf pin Co-authored-by: William Jamieson <wjamieson@stsci.edu> --------- Co-authored-by: William Jamieson <wjamieson@stsci.edu>

wkerzendorf added 2 commits February 12, 2012 22:46

first simple implementation of the Spectrum1D class

f249628

added a couple more from_methods to instantiate from tables and files.

e2eb725

wkerzendorf closed this Mar 5, 2012

wkerzendorf added a commit that referenced this pull request Mar 29, 2013

Merge pull request #1 from andycasey/master

8515b04

Improved documentation and some consistency impovements

keflavich added a commit to keflavich/specutils that referenced this pull request Apr 21, 2017

only table astropy#1 is a bintablehud

24781b5

keflavich pushed a commit to keflavich/specutils that referenced this pull request Apr 21, 2017

Merge pull request astropy#1 from keflavich/tabular_reader

f959c9e

change 'generic' to 'tabular' (there is no "generic" FITS spectrum)

keflavich pushed a commit to keflavich/specutils that referenced this pull request Sep 21, 2017

Merge pull request astropy#1 from eteq/APE1

b2cc656

APE1 tweaks

keflavich pushed a commit to keflavich/specutils that referenced this pull request Sep 21, 2017

Merge pull request astropy#1 from perrygreenfield/master

236f867

APE1, the meta-APE

eteq mentioned this pull request Mar 4, 2019

Decide on idiom for recording redshift/velocity of spectra #455

Closed

SaOgaz pushed a commit that referenced this pull request Mar 25, 2019

Merge pull request #1 from spacetelescope/master

d3ae423

Getting most up-to-date version of cubeviz

eteq pushed a commit that referenced this pull request Nov 26, 2019

Merge pull request #1 from astropy/master

a31c0cb

Update master in fork

nmearl pushed a commit that referenced this pull request Aug 28, 2020

Merge pull request #1 from astropy/master

9eb738f

Update master in fork

rosteen pushed a commit that referenced this pull request Feb 17, 2022

Merge pull request #1 from ycopin/ycopin-doc

663969f

Fix and unify internal links.

cshanahan1 mentioned this pull request May 16, 2023

FluxConservingResampler returning incorrect fluxes and errors #1059

Closed

First draft of Spectrum1D class #1

First draft of Spectrum1D class #1

Conversation

wkerzendorf commented Feb 14, 2012

eteq commented Feb 15, 2012

adrn commented Feb 15, 2012

Instantiator

Be verbose in naming

SciPy dependency

Spectrum1D is not an array

Operations

Passing in options as text

Interpolation

Error arrays

Plotting

Smoothing

SpectrumCollection()

keflavich commented Feb 15, 2012

demitri commented Feb 15, 2012

astrofrog commented Feb 15, 2012

astrofrog commented Feb 15, 2012

demitri commented Feb 16, 2012

eteq commented Feb 16, 2012

eteq commented Feb 16, 2012

eteq commented Feb 16, 2012

eteq commented Feb 16, 2012

eteq commented Feb 16, 2012

astrofrog commented Feb 16, 2012

adrn commented Feb 16, 2012

astrofrog commented Feb 16, 2012

keflavich commented Feb 16, 2012

wkerzendorf commented Feb 18, 2012

Instantiator 1

Options as text 6

interpolation 7

Error arrays 8

Plotting 9

Smoothing 10

SpectrumCollection 11

wkerzendorf commented Feb 18, 2012

astrofrog commented Feb 19, 2012

demitri commented Feb 19, 2012

astrofrog commented Feb 19, 2012

astrofrog commented Feb 19, 2012

demitri commented Feb 19, 2012

wkerzendorf commented Feb 19, 2012

astrofrog commented Feb 19, 2012

demitri commented Feb 19, 2012

demitri commented Feb 19, 2012

andycasey commented Feb 20, 2012

adrn commented Feb 20, 2012

wkerzendorf commented Feb 20, 2012

adrn commented Feb 20, 2012

eteq commented Feb 21, 2012

eteq commented Feb 21, 2012

eteq commented Feb 21, 2012

astrofrog commented Feb 21, 2012

astrofrog commented Feb 21, 2012

demitri commented Feb 21, 2012

wkerzendorf commented Feb 21, 2012

eteq commented Feb 21, 2012

wkerzendorf commented Feb 22, 2012

eteq commented Feb 22, 2012

eteq commented Feb 26, 2012

wkerzendorf commented Feb 26, 2012

wkerzendorf commented Mar 5, 2012