Time profiling pull request #146

aelanman · 2017-04-21T01:40:20Z

Improvements in speed for read/write operations.

bhazelton · 2017-04-21T20:12:20Z

pyuvdata/miriad.py

-        if run_check:
-            self.check(run_check_acceptability=run_check_acceptability)
+#        if run_check:
+#            self.check(run_check_acceptability=run_check_acceptability)


Are you sure you want the check lines to be commented out? That seems like a bad idea.

Whoops.... Yes, I cut those from mine because it was running slow and didn't want to address it, but I'll put them back and run the profiler again.

I don't think it'll break, since write_miriad also runs the acceptability check.

bhazelton · 2017-04-21T20:16:37Z

pyuvdata/uvdata.py

        for ind, jd in enumerate(self.time_array):
            # calculate ra/dec of phase center in current epoch
            obs.date, obs.epoch = self.juldate2ephem(jd), self.juldate2ephem(jd)
            precess_pos.compute(obs)
            ra, dec = precess_pos.a_ra, precess_pos.a_dec

            # generate rotation matrices
-            m0 = a.coord.top2eq_m(self.lst_array[ind] - obs.sidereal_time(), latitude)
+            m0 = a.coord.top2eq_m(self.lst_array[ind] - obs.sidereal_time(), latitude)    ##TODO This is a bottleneck because top2eq_m involves a matrix inversion. 


Do you have a plan to address this? I think marking it as a bottleneck is usefully informative, but the TODO label gets grabbed by some editors and should only be used if there's a plan to fix it.

What if I just remove the TODO label? I haven't found a way to address it yet, but maybe someone can? I have another line marked as a bottleneck in read_miriad().

I think that's perfect.

I don't have time to do it now, but one way to at least reduce the bottleneck, would be to only calculate m0 for unique times, rather than for all blts.

The other thought I have on this is that we should consider using astropy rather than aipy (which uses pyephem under the hood I think).

Aipy doesn't use pyephem for this particular function. In fact, it's a rather simple function which just just inverts eq2top_m, which simply records the matrix using sin and cos. I'd be tempted to write a small util function in pyuvdata to write down the inverse directly, which is equally easy.

adampbeardsley · 2017-05-05T19:42:13Z

Note to self: before merging this branch, we need to check the phasing outputs against master - be sure we haven't changed this because we don't have a valid test yet.

bhazelton · 2017-05-05T23:26:42Z

I think that when this PR is merged, we should put out a new pip package, which requires bumping the version number. Any opinions on whether it should be v1.2 or v1.1.1?

also pep8

adampbeardsley · 2017-05-10T17:20:42Z

Because this branch is getting close to merging, I went ahead and did a comparison of phasing with master. It's hard to make an actual unit test... so I just used master to read in a couple files, did various phasing and unphasing operations, writing output files throughout. I then used this branch to repeat those, and compare with the save files throughout.

What I learned is that there are a couple places where vectorizing the arithmetic caused intermediate quantities to be cast as float32 instead of float64 (when using uvws, which are float32). This caused small errors in uvws, which were small enough that they passed our tolerance, but translated to unacceptable errors in the data. I was able to make a couple changes locally to make sure everything is cast as float64, and everything works.

This is making me think that we actually want the uvws always stored as float64. I'm going to start a new issue for this, but I don't think it's stopping for the time being. I'll push my local changes, just to match master, but they should go away if we do eventually use float64.

aelanman requested review from jpober, bhazelton and adampbeardsley April 21, 2017 01:40

bhazelton reviewed Apr 21, 2017

View reviewed changes

aelanman and others added 23 commits May 9, 2017 15:29

Faster set_lsts_from_time_array and improvements to read_miriad

aa2adf3

Fixing errors from previous merge

4c515b9

Moved speed of light lookup

595c824

Fixes

c30aae1

Update

6dcb143

t_grid etc. vectorized

4ff4415

Removed double logical_and operation in loop

90b809f

Update

3164172

Improvement in read_miriad speed at cost of increased memory usage.

d7d7090

Faster set_lsts_from_time_array and improvements to read_miriad

7d11281

Moved speed of light lookup

8d1dc62

Fixes

e49c0d1

Update

094c56d

Removed double logical_and operation in loop

c7e39c2

Update

c0bf74c

Improvement in read_miriad speed at cost of increased memory usage.

00cad3d

Update

c8e046d

Slight improvement in phasing and write_miriad

60b432d

Remove profiling tags and identify bottleneck in phase()

9a604c4

Changed how we calculate t_grid etc in read_miriad, fixed some pep8

424437e

Addressing pull request comments.

50fb99a

created more efficient top2eq_m function

8dea9bd

Trying a potentially more efficient loop for phasing and unphasing

1a918ae

aelanman and others added 4 commits May 9, 2017 15:29

Attempt to address last read_miriad bottleneck

fb4de1c

Get the correct indices to sort the data array in read_miriad

93ae946

Adding line profiler results

4106931

Should have indices right now

45a030b

also pep8

adampbeardsley force-pushed the time_profiling branch from 7d33414 to 45a030b Compare May 9, 2017 22:30

Recast uvws to float64 for phasing to match master

4e1a6f9

adampbeardsley approved these changes May 11, 2017

View reviewed changes

adampbeardsley merged commit 910d76b into master May 11, 2017

bhazelton added the UVData label Jul 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time profiling pull request #146

Time profiling pull request #146

aelanman commented Apr 21, 2017

bhazelton Apr 21, 2017

aelanman Apr 21, 2017

bhazelton Apr 21, 2017

aelanman Apr 21, 2017

bhazelton Apr 21, 2017

adampbeardsley Apr 21, 2017

bhazelton Apr 21, 2017

adampbeardsley Apr 21, 2017

adampbeardsley commented May 5, 2017

bhazelton commented May 5, 2017

adampbeardsley commented May 10, 2017

Time profiling pull request #146

Time profiling pull request #146

Conversation

aelanman commented Apr 21, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adampbeardsley commented May 5, 2017

bhazelton commented May 5, 2017

adampbeardsley commented May 10, 2017