# Hyperspy Tutorial

## EELS analysis of perovskite oxides

This tutorial shows the various functionalities in HyperSpy which is used to analyse Electron Energy Loss Spectroscopy data, using EELS datasets from a perovskite oxide heterostructure.

It assumes some knowledge on how to use HyperSpy, like loading datasets and how the basic signals work.

This notebook requires **HyperSpy 1.4.2** or later.

## Author

7/6/2016 Magnus Nord - Developed for HyperSpy workshop at Scandem conference 2016

## Changes

* 3/8/2016 Updated for HyperSpy 1.1. Added note about Gatan Digital Micrograph GOS.
* 20/08/2019 Katherine MacArthur - Checked for Hyperspy 1.5.1 and commented out sections requiring Gatan GOS files.

## Table of contents

1. <a href='#spec_and_data'> Specimen & Data</a>
2. <a href='#simple_quant'> Simple quantification</a>
3. <a href='#curve_fitting_quant'> Curve fitting quantification</a>
4. <a href='#fine_structure_analysis'> Fine structure analysis</a>
5. <a href='#fine_structure_ok'> Fine structure oxygen-K edge</a>

# <a id='spec_and_data'></a>1. Specimen & Data

This notebook was used for the HyperSpy workshop at the Norwegian University of Science and Technology for the Scandem 2016 conference, 7 June 2016.

The data was acquired on a Jeol ARM200cF using a Gatan Quantum ER with DualEELS capabilities.

The data itself is from La$_{0.7}$Sr$_{0.3}$MnO$_3$ thin films deposited on SrTiO$_3$. In the Fine Structure example parts of the film has been exposed to a very long electron beam exposure, inducing oxygen vacancies.

The datasets has been binned to reduce the file size and processing time.

# <a id='simple_quant'></a> 2. Simple quantification


Firstly we use some IPython magic to import the right plotting libraries, 

In [1]:
%matplotlib notebook

Then import HyperSpy. If the `traitsui` GUI is installed and enable, you will have two warnings *WARNING:hyperspy_gui_traitsui* warning about the incompatibility of the `notebook` matplotlib backend with the `traitsui` GUI. This can be safely be ignored if you don't to use any GUIs or if you are using the `ipwwidgets` GUI. Alternatively, you can use the 'qt' matplotlib backend which is compatible with the `traitsui` GUI.

In [2]:
import hyperspy.api as hs



First we take a look at an EELS line scan across an La$_{0.7}$Sr$_{0.3}$MnO$_3$/SrTiO$_3$ thin film. The core loss data has several peaks: Ti-L$_{2,3}$, O-K, Mn-L$_{2,3}$ and La-M$_{4,5}$. We can navigate the line scan using the navigation window, and by moving the red line.

In [3]:
s = hs.load("datasets/LSMO_STO_linescan.hdf5")

In [4]:
s.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Now we can quantifiy the first edge (Ti-L$_{2,3}$, 460 eV, 0-10 nm). Firstly by removing the background, then integrating the Ti-L$_{2,3}$ edge. Move the red line in the navigation figure towards the top part (0-10 nm, x axis). Then drag a span from about 400 to 440 eV in Figure 2.

Next, untick the "Fast" button in the dialog box under figure 2, and press Apply.

Note: sometimes the background removal doesn't work properly. If this happens, reload the data using the command above (s = hs.load("datasets/LSMO_STO_linescan.hdf5"). Then rerun the s.remove_background() command.

In [None]:
s.remove_background()

To integrate the Ti-L32 edge interactively we can use a region of interest:

In [6]:
roi = hs.roi.SpanROI(left=450, right=600)
s.plot()
roi.add_widget(s, axes=["Energy loss"])

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

<hyperspy.drawing._widgets.range.RangeWidget at 0x7f354d3061d0>

Finally, to integrate the signal in the selected ROI:

In [7]:
s_ti = s.isig[roi].integrate1D(axis="Energy loss")

In [8]:
s_ti.plot()

<IPython.core.display.Javascript object>

Notice that we can also perform the same operations in one single line if interactivity is not required:

In [9]:
s = hs.load("datasets/LSMO_STO_linescan.hdf5")

In [10]:
s_ti = s.remove_background(signal_range=(405.,448.)).isig[448.:480.].integrate1D(axis="Energy loss")

In [11]:
s_ti.plot()

<IPython.core.display.Javascript object>

# <a id='curve_fitting_quant'></a> 3. Curve fitting quantification

Now, lets do some more advanced quantification using HyperSpy's extensive modelling framework. Firstly we load the low loss and core loss spectra.

In [12]:
s_ll = hs.load("datasets/LSMO_STO_linescan_low_loss.hdf5")

In [13]:
s = hs.load("datasets/LSMO_STO_linescan.hdf5")

Hyperspy contains Ray Egerton's Hydrogenic cross-sections which are only able to apply curve fitting for the K and L edges. Because this dataset also contains the La-M edge we ideally want to be able to fit this too. In order to do that we could make use of the Digital Micrograph Hartree-Slater cross section files, if you have access to them. Firstly we'll have to tell HyperSpy where to find these files, since they are not included in HyperSpy. Go to the "EELS" tab, then set "GOS directory" to the "H-S GOS Tables" folder. Note that unfortunately this requires a license of Gatan Digital Micrograph.

In the following section, any command that requires the presence of Hartree-Slater cross-sections is commented out. Therefore it is possible to run using only the uncommented lines with the in-built cross sections. In the case where a cell contains two similar commands only one should be run depending on what cross-sections file you have access to.

*Remember*: The `#` simple converts a line of code to a comment line. Therefore any code line with `#` in the front of it will be treated as a comment rather than code and so will be ignored. Use this to control what gets run in this section.

In [14]:
hs.preferences.gui()

VBox(children=(Tab(children=(VBox(children=(HBox(children=(Label(value='Expand structures in DictionaryTreeBro…

Here, the metadata has been populated with some of the experimental parameters:

In [15]:
s.metadata

├── Acquisition_instrument
│   └── TEM
│       ├── Detector
│       │   └── EELS
│       │       └── collection_angle = 33.1
│       ├── beam_energy = 200.0
│       ├── convergence_angle = 27.1
│       └── dwell_time = 0.4999055733891917
├── General
│   ├── original_filename = LSMO_STO_linescan.dm3
│   └── title = EELS Spectrum Image (high-loss)
└── Signal
    ├── binned = True
    ├── signal_origin = 
    └── signal_type = EELS

Firstly we want to fix the zero point for the energy axis using the zero loss peak.

Plot it, and use the zoom functionality (the box button) in the Signal plot to zoom in on the zero loss peak.

It is offset by approximetely 0.6 eV.

In [16]:
s_ll.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

To fix this we use align_zero_loss_peak. The subpixel argument interpolates the data, so we get sub-pixel alignment. Using the also_align argument, we can also apply the alignment on a another signal. For example when using dualEELS, where both the low loss and core loss is acquired quasi-simultaneously. Note the other signals must have the same navigation shape as the low loss signals.

In [17]:
s_ll.align_zero_loss_peak(subpixel=True, also_align=[s])


Initial ZLP position statistics
-------------------------------
Summary statistics
------------------
mean:	-1
std:	0

min:	-1
Q1:	-1
median:	-1
Q3:	-1
max:	-1


HBox(children=(IntProgress(value=0, max=10), HTML(value='')))




By doing this, we aligned both our low loss and core loss spectra.

In [18]:
s_ll.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

We have to add the elements which is present in the sample to `s`

In [19]:
#For Hydrogenic cross sections run this line.
s.add_elements(('Mn','O','Ti'))
#For Hartree-Slater cross sections run this line.
#s.add_elements(('Mn, 'O', 'Ti', 'La'))

Then we make a model out of the core loss spectrum. The low loss spectrum is convolved with the model, which means plural scattering is automatically taken into account. In addition this leads to better fits.

**NOTE:** creating this model requires using the GOS files from Gatan Digital Micrograph. If you don't have these files only K and L edges can be created using Hydrogenic cross sections. If you do have them but HyperSpy can't find them in the default location, you can specify the location using `hs.preferences.gui()`.

In [20]:
s.metadata

├── Acquisition_instrument
│   └── TEM
│       ├── Detector
│       │   └── EELS
│       │       └── collection_angle = 33.1
│       ├── beam_energy = 200.0
│       ├── convergence_angle = 27.1
│       └── dwell_time = 0.4999055733891917
├── General
│   ├── original_filename = LSMO_STO_linescan.dm3
│   └── title = EELS Spectrum Image (high-loss)
├── Sample
│   └── elements = ['Mn', 'Ti', 'O']
└── Signal
    ├── binned = True
    ├── signal_origin = 
    └── signal_type = EELS

In [21]:
#For Hydrogenic cross sections run this line, in order crop out the La edge.
m = s.isig[0.:825.].create_model(ll=s_ll)
#For Hartree-Slater cross sections run this line.
#m = s.create_model(ll=s_ll)

The model new consist of many different EELSCLEdge components, including a component for the plasmon background

In [22]:
m.components

   # |      Attribute Name |      Component Name |      Component Type
---- | ------------------- | ------------------- | -------------------
   0 |            PowerLaw |            PowerLaw |            PowerLaw
   1 |               Ti_L3 |               Ti_L3 |          EELSCLEdge
   2 |                 O_K |                 O_K |          EELSCLEdge
   3 |               Mn_L3 |               Mn_L3 |          EELSCLEdge

We can fit the model to the experimental data by using the `multifit` function, with the `smart` fitting. Which is fits in a way optimized for EELS data, by fitting from the lowest to the highest energy losses.

In [23]:
m.multifit(kind='smart')

HBox(children=(IntProgress(value=0, max=10), HTML(value='')))




In [24]:
m.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

We can check the error of the fitting

In [25]:
#For Hydrogenic cross sections run this line.
edges = ("Ti_L3", "Mn_L3","O_K")
#For Hartree-Slater cross sections run this line.
#edges = ("Ti_L3", "La_M5", "Mn_L3","O_K")

In [26]:
hs.plot.plot_spectra([m[edge].intensity.as_signal("std") for edge in edges], legend=edges)

<IPython.core.display.Javascript object>

<matplotlib.axes._subplots.AxesSubplot at 0x7f354c9e4588>

This fitted mostly ok, but it is still not very good. Firstly we can move the Hartree-Slater onsets interactively

In [27]:
m.plot()
m.enable_adjust_position()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Or manually, by directly changing the parameters within the Hartree-Slater edges. The parameter is called onset_energy

In [28]:
m.components.O_K.onset_energy.value = 528

However, to change it for all the probe positions we have to use assign_current_value_to_all()

In [29]:
m.components.O_K.onset_energy.assign_current_value_to_all()

We repeat this for the Manganese edges. Since this is an L-edge, there are 3 different ones. However, we only have to set the Mn-L3: the L2 and L1 is a set to an energy relative to the L3.

In [30]:
m.components.Mn_L3.onset_energy.value

640.0

In [None]:
# Only with Hartree-Slater cross-sections
#m.components.Mn_L2.onset_energy.value

In [32]:
m.components.Mn_L3.onset_energy.value = 638.5
m.components.Mn_L3.onset_energy.assign_current_value_to_all()

The bad fitting to the data is also due to the fine structure not currently taken into account by the model. To get a good fit, we can either not fit to the fine structure regions, or model them somehow.
The easiest way is defining certain regions as fine structure:

In [33]:
m.enable_fine_structure()

This will produce a much better fit, but will be much slower (~2 minutes).

In [34]:
m.multifit(kind='smart')

HBox(children=(IntProgress(value=0, max=10), HTML(value='')))




Now the fit is much better, due to the model taking into account the fine structure.

In [35]:
m.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Now we can can have a look at the relative intensity from the individual EELS-edges using plot_spectra

In [36]:
#For Hydrogenic cross sections run this line.
edges = ("Ti_L3", "Mn_L3", "O_K")
#For Hartree-Slater cross sections run this line.
#edges = ("Ti_L3", "La_M5", "Mn_L3","O_K")

In [37]:
hs.plot.plot_spectra([m[edge].intensity.as_signal() for edge in edges], legend=edges)

<IPython.core.display.Javascript object>

<matplotlib.axes._subplots.AxesSubplot at 0x7f354de109b0>

While the fitting looks nicer, we can clearly improve this. Firstly the intensities are negative where it should be zero. Secondly, the fine structure regions can be fine tuned. Especially the Mn-L1 fine structure window can be reduced

In [None]:
# Only with Hartree-Slater cross-sections
#m.components.Mn_L1.fine_structure_width = 15

To avoid the negative values we use bounded fitting, where we can constrain the parameter values between certain values. The bmin and bmax properties in the parameters are used for this.

In [38]:
m.components.Mn_L3.intensity.bmin = 0.0

In [None]:
# Only with Hartree-Slater cross-sections
#m.components.La_M5.intensity.bmin = 0.0

In [39]:
m.components.Ti_L3.intensity.bmin = 0.0

In [40]:
m.components.O_K.intensity.bmin = 0.0

In [41]:
m.multifit(fitter="leastsq", kind='smart', bounded=True)

HBox(children=(IntProgress(value=0, max=10), HTML(value='')))




In [42]:
m.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

In [43]:
hs.plot.plot_spectra([m[edge].intensity.as_signal() for edge in edges], legend=edges)

<IPython.core.display.Javascript object>

<matplotlib.axes._subplots.AxesSubplot at 0x7f354dab6278>

# <a id='fine_structure_analysis'></a> 4. Fine structure analysis

Here we take a look at a linescan from a La0.7Sr0.3MnO3 thin film, where parts of the film has been bombarded with the electron beam for an extended time.

In [44]:
s = hs.load("datasets/LSMO_linescan.hdf5")

Using the moving the red line in the there is clearly something going on in the middle on both the oxygen and the manganese edges. In addition, there are some thickness changes during the line scan.

In [45]:
s.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Using the low loss signal, we make sure the energy scale is properly calibrated

In [46]:
s_ll = hs.load("datasets/LSMO_linescan_low_loss.hdf5")

In [47]:
s_ll.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

The zero loss peak is not well aligned at 0 eV energy loss, so we should align it and the core loss

In [48]:
s_ll.align_zero_loss_peak(also_align=[s])


Initial ZLP position statistics
-------------------------------
Summary statistics
------------------
mean:	3.23
std:	0.249

min:	3
Q1:	3
median:	3
Q3:	3.5
max:	3.5


HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




Now the zero loss peak has been shifted to 0 energy loss, and likewise the core loss spectrum `s` has also been aligned

In [49]:
s_ll.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

We can also calculate the relative thickness using the low loss. We'll have to specify the end of the zero loss beam, which for cold field emissions guns 3.0 eV seems to work well.

In [50]:
s_thickness = s_ll.estimate_thickness(threshold=3.0)

It would also be possible to use hyperspy to determine the threshold itself using:
    
    th = wedge.estimate_elastic_scattering_threshold()
    s_ll.estimate_thickness(threshold=th)

This gives the relative thickness and, as expected, there is an increase towards the end of the line scan

In [51]:
s_thickness.plot()

<IPython.core.display.Javascript object>

# <a id='fine_structure_ok'></a> 5. Fine structure: oxygen K-edge
Lets take a closer look at the oxygen-K edge, firstly by removing the plasmon background, then cropping the spectrum to only include the oxygen-K edge. Note: this will overwrite the `s` spectrum with the cropped one. 

In [None]:
s.remove_background()

This makes it much easier to compare the different positions. Pressing 'e' with the spectrum window highlighted gives a second spectrum picker, which can be moved independently of the first one

We can then do Fourier ratio deconvolution to remove the effects of plural scattering

In [53]:
s_deconvolved = s.fourier_ratio_deconvolution(s_ll)

HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




In [54]:
s_deconvolved.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

### Fine structure modelling

Having had a qualitative look at the data, we can try to quantify some of these changes. We do this by making making a model of the oxygen-K edge signal. Firstly we crop the signal, leaving only the Oxygen-K edge (490 to 590 eV).

In [55]:
s.crop_signal1D()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

VBox(children=(HBox(children=(FloatText(value=nan, disabled=True), Label(value='eV'), Label(value='-'), FloatT…

As we've already removed the background, we set `auto_background=False`. In addition, since we haven't added any elements to the signal, we got no ionization edges.

In [56]:
m = s.create_model(ll=s_ll, auto_background=False)

So currently, the model does not contain any components

In [57]:
m.components

   # |      Attribute Name |      Component Name |      Component Type
---- | ------------------- | ------------------- | -------------------

We can try to model some of the fine structure with Gaussians

In [58]:
g1 = hs.model.components1D.Gaussian()

In [59]:
m.append(g1)

This added the gaussian component to the model

In [60]:
m.components

   # |      Attribute Name |      Component Name |      Component Type
---- | ------------------- | ------------------- | -------------------
   0 |            Gaussian |            Gaussian |            Gaussian

Then we can fit this Gaussian to the largest of the O-K peaks by dragging a span over the peak between 528 and 533 eV. Run it first with the "Only Current" option ticked, then run it without to fit the whole dataset

In [62]:
m.fit_component(g1)

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

VBox(children=(HBox(children=(Label(value='Only current', layout=Layout(width='auto')), Checkbox(value=True)),…

Having fitted the Gaussian to the experimental data, we can plot how the Gaussian three parameters change over the line scan: A, sigma and centre. The A changes quite a bit, which is probably (among others) related to thickness changes. However, there are clear changes in the sigma parameter in the region with the electron beam damage

In [None]:
g1.plot()

Using the same method we can also fit the second largest peak between 535 and 541 eV. Using the signal_range argument we don't have to select the region using the GUI.

In [64]:
g2 = hs.model.components1D.Gaussian()

In [65]:
m.append(g2)

In [66]:
m.fit_component(g2, signal_range=(535.,541.), only_current=False)

HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




In [67]:
m.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

However, this time the final fit does not look very good. This is due to the two components being fitted independently of each other. We should fit both of them at the same time. Firstly, we have to set the `signal_range` which is where the model will fit to the experimental data. Here we select the region spanning the two major peaks (528-541 eV)

In [71]:
m.set_signal_range()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

VBox(children=(HBox(children=(FloatText(value=nan, disabled=True), Label(value='eV'), Label(value='-'), FloatT…

In [73]:
m.multifit()

HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




After fitting, we reset the signal range so we can see the full range of the signal

In [74]:
m.reset_signal_range()

In [75]:
m.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

Lastly, we can fit the small "pre-peak" as well. First we "lock" the two Gaussian we have already fitted.

In [76]:
g1.set_parameters_not_free()

In [77]:
g2.set_parameters_not_free()

Then we add another Gaussian, and fit it using `fit_component` between 522 and 527 eV

In [78]:
g3 = hs.model.components1D.Gaussian()

In [79]:
m.append(g3)

In [80]:
m.fit_component(g3, signal_range=(522., 527.), only_current=False)

HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




Then we set the signal range to cover all the three peaks, from 520 eV to 541 eV

In [81]:
m.set_signal_range(520.,541.)

And set the g1 and g2 components free

In [82]:
g1.set_parameters_free()

In [83]:
g2.set_parameters_free()

In [84]:
m.multifit()

HBox(children=(IntProgress(value=0, max=40), HTML(value='')))




This fits all the three components to the experimental data, which hopefully gives a good fit

In [85]:
m.reset_signal_range()

In [86]:
m.plot()

<IPython.core.display.Javascript object>

<IPython.core.display.Javascript object>

We can then compare the different parameters in the components

In [87]:
g1_g3_ratio = g1.A.as_signal()/g3.A.as_signal()

In [88]:
g1_g3_ratio.plot()

<IPython.core.display.Javascript object>

In [89]:
g1_g3_position = g1.centre.as_signal()-g3.centre.as_signal()

In [90]:
g1_g3_position.plot()

<IPython.core.display.Javascript object>

In [91]:
g1_g3_sigma = g1.sigma.as_signal()/g3.sigma.as_signal()

In [92]:
g1_g3_sigma.plot()

<IPython.core.display.Javascript object>

In all of the comparisons there are some large changes in the regions with beam damage. However, the values can vary a great deal. This is most likely due to the g1 fitted to the pre-peak is not so clearly defined in these regions. Leading to potentially bad fitting.