-
Notifications
You must be signed in to change notification settings - Fork 9
/
whittaker_smooth.py
110 lines (88 loc) · 3.59 KB
/
whittaker_smooth.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
#!/usr/bin/env python3
# -*- coding: utf-8 -*-
"""
WHITTAKER-EILERS SMOOTHER in Python 3 using numpy and scipy
based on the work by Eilers [1].
[1] P. H. C. Eilers, "A perfect smoother",
Anal. Chem. 2003, (75), 3631-3636
coded by M. H. V. Werts (CNRS, France)
tested on Anaconda 64-bit (Python 3.6.4, numpy 1.14.0, scipy 1.0.0)
Read the license text at the end of this file before using this software.
Warm thanks go to Simon Bordeyne who pioneered a first (non-sparse) version
of the smoother in Python.
"""
import numpy as np
import scipy.sparse as sparse
from scipy.sparse.linalg import splu
def speyediff(N, d, format='csc'):
"""
(utility function)
Construct a d-th order sparse difference matrix based on
an initial N x N identity matrix
Final matrix (N-d) x N
"""
assert not (d < 0), "d must be non negative"
shape = (N-d, N)
diagonals = np.zeros(2*d + 1)
diagonals[d] = 1.
for i in range(d):
diff = diagonals[:-1] - diagonals[1:]
diagonals = diff
offsets = np.arange(d+1)
spmat = sparse.diags(diagonals, offsets, shape, format=format)
return spmat
def whittaker_smooth(y, lmbd, d = 2):
"""
Implementation of the Whittaker smoothing algorithm,
based on the work by Eilers [1].
[1] P. H. C. Eilers, "A perfect smoother", Anal. Chem. 2003, (75), 3631-3636
The larger 'lmbd', the smoother the data.
For smoothing of a complete data series, sampled at equal intervals
This implementation uses sparse matrices enabling high-speed processing
of large input vectors
---------
Arguments :
y : vector containing raw data
lmbd : parameter for the smoothing algorithm (roughness penalty)
d : order of the smoothing
---------
Returns :
z : vector of the smoothed data.
"""
m = len(y)
E = sparse.eye(m, format='csc')
D = speyediff(m, d, format='csc')
coefmat = E + lmbd * D.conj().T.dot(D)
z = splu(coefmat).solve(y)
return z
#Copyright M. H. V. Werts, 2017
#
#martinus point werts à ens-rennes point fr
#
#This software is a computer program whose purpose is to smooth noisy data.
#
#This software is governed by the CeCILL-B license under French law and
#abiding by the rules of distribution of free software. You can use,
#modify and/ or redistribute the software under the terms of the CeCILL-B
#license as circulated by CEA, CNRS and INRIA at the following URL
#"http://www.cecill.info".
#
#As a counterpart to the access to the source code and rights to copy,
#modify and redistribute granted by the license, users are provided only
#with a limited warranty and the software's author, the holder of the
#economic rights, and the successive licensors have only limited
#liability.
#
#In this respect, the user's attention is drawn to the risks associated
#with loading, using, modifying and/or developing or reproducing the
#software by the user in light of its specific status of free software,
#that may mean that it is complicated to manipulate, and that also
#therefore means that it is reserved for developers and experienced
#professionals having in-depth computer knowledge. Users are therefore
#encouraged to load and test the software's suitability as regards their
#requirements in conditions enabling the security of their systems and/or
#data to be ensured and, more generally, to use and operate it in the
#same conditions as regards security.
#
#The fact that you are presently reading this means that you have had
#knowledge of the CeCILL-B license and that you accept its terms.