[WIP] HD 143006 part 2 #80

iancze · 2021-06-18T19:15:19Z

This is a draft pull request to monitor the changes between this branch and the main branch, as well as provide a vehicle for commentary and review.

To update the file contents, continue to push committed changes to the branch, i.e. $git push origin HD143006_Part_2_branch

Changed log directory for first set of loss values

Figured out and added directions to examine functions, displayed tensorboard

Fixed error in calling trainable-- getting different error. I suspect it has to do with serialization of model

tune.run switches to a different directory that is created outside of the original working directory. Adjusted where file is created. Switched directory.

…MPoL into HD143006_Part_2_branch

…tion loops with better cv scores

briannazawadzki · 2021-07-01T16:34:52Z

docs/tutorials/HD143006_Part_2.py

+
+# ## Initializing Model with the Dirty Image
+#
+# We now have our model and data, but before we set out trying to optimize the image we should create a better starting point for our future optimization loops. A good idea for the starting point is the dirty image, since it is already a maximum likelihood fit to the data. The problem with this is that the dirty image contains negative flux pixels, while we impose the requirement that our sources must have all positive flux values. Our solution then is to optimize the RML model to become as close to the dirty image as possible (while retaining image positivity).


A "better" starting point relative to what? A blank/uniform image? I think we could make this clearer by saying we need to pick a starting point, which could be basically anything, and the dirty image is a logical choice. Rather than the current language which sort of implies we are changing the existing starting point to the dirty image instead.

I also think we should be careful with the language used at the end of this line. The current text says "Our solution then is to optimize the RML model to become as close to the dirty image as possible (while retaining image positivity)" which is true in this case; we're only enforcing image positivity which is a very simply prior, but in most cases we're not trying to make an image as close to the dirty image as possible, we're trying to make a better image. I think it would be useful to rephrase either emphasizing that image positivity is the only prior being considered here, or that we are trying to make the best possible image given the dirty image and the need for positive pixel values.

@briannazawadzki Are there cases when we don't initialize the model with the dirty image before beginning optimization for the best image? My understanding was that this section of the tutorial is initializing the model to the dirty image while retaining the image positivity prior, so we would want the dirty image to closely resemble the model at this point. Then later we take this as the starting point and begin to perform the actual optimization such that the model fits the visibilities but also takes into account any other priors so the model resembles what we'd expect the true image to be.

@hgrzy Oops, I'm sorry, I must not have my GitHub notifications configured correctly. Yes, your understanding is correct. The dirty image is being used as the starting point for the optimization loop. But sure, there could be cases where you don't want to do that. For example, you might want to see if you can still converge on a reasonable solution using the same priors but starting with a uniform blank image. It would probably take more iterations to converge than using the dirty image, since presumably the dirty image is already in the ballpark of what the final result will be, while a blank image needs a lot more adjusting.

My comment was mainly just being nitpicky about language for clarity.

@briannazawadzki No worries! Ah that makes sense. Thanks for bringing up that good point.

briannazawadzki · 2021-07-01T16:45:18Z

docs/tutorials/HD143006_Part_2.py

+#
+
+
+# To optimize the RML model toward the dirty image, we will create our training loop using a [loss function](../api.html#module-mpol.losses) and an [optimizer](https://pytorch.org/docs/stable/optim.html#module-torch.optim). MPoL and PyTorch both contain many optimizers and loss functions, each one suiting different applications. Here we use PyTorch's [mean squared error function](https://pytorch.org/docs/stable/generated/torch.nn.MSELoss.html) between the RML model image pixel fluxes and the dirty image pixel fluxes.


I would cite the optimization tutorial for more info about the training loop.

briannazawadzki · 2021-07-01T16:53:50Z

docs/tutorials/HD143006_Part_2.py

+    return test_score
+
+
+# Now, with our functions defined, we need to do the critical part of dividing our dataset into training and test datasets. There are many ways of going about this but here we are splitting it radially and azimuthally and removing chunks. This is visualized in the [Cross Validation tutorial](crossvalidation.html).


You already cite the cross validation tutorial so maybe this isn't necessary, but maybe mention that MPoL's Dartboard is an easy built-in way to get the polar coord grid, and maybe very briefly mention why we are choosing to do it this way.

briannazawadzki · 2021-07-01T16:59:05Z

docs/tutorials/HD143006_Part_2.py

+# With these set up, we can now make our training function (instead of a loop, we use a function here since the training loop will be ran multiple times with different configurations). The hyperparameters, such as `epochs` and `lambda_TV`, are contained under `config`. Most of them are used in the loss functions and can be read about [here](../api.html#module-mpol.losses).
+
+
+def train(model, dataset, optimizer, config, writer=None, logevery=50):


I'm a bit confused. Now you are introducing more priors than just the image positivity? I think you need to be more clear about what you're including, because earlier it seemed like you were saying that you were only using an image positivity prior.

I agree with this, I just feel like I don't have the knowledge to explain each of the priors here. Adding the lambda_sparsity prior affects the sparsity of the image, the lr affects how aggressive the model learns, epochs determines the amount of iterations the model trains for, I can assume that prior_intensity affects the strangth of the entropy term, but I'm not quite sure what the prior entropy is. If you someone doesn't mind explaining this and confirming what I said about the other priors that would be very helpful.

Sure, I'll give my thoughts. I actually think it's okay if you don't fully explain all of the priors here, but I think it would be useful to explicitly state that now you are expanding beyond the initial optimization shown above, which only required image positivity.

The lambda terms affect the strength of the regularizing terms, but don't affect the form of the regularizer. They are basically just coefficients to weight each term. So sparsity, total variation (TV), and entropy are all different priors/regularizers that we add to the total loss term. I'm working on explaining each of them in my paper, but I haven't written everything yet. I think for the purposes of the tutorial it would be fine to just mention that you're now using multiple priors, and direct the reader to the API for more information on these losses.

hgrzy · 2021-07-02T20:33:27Z

@iancze Ready for another round of review!

iancze · 2021-07-10T21:44:18Z

docs/large-tutorials/HD143006_part_2.py

+
+# Here is where we define the hyperparameters under the `config` dictionary. Most of these hyperparameters, such as `lambda_TV` and `entropy`, correspond to the scalar prefactors $\lambda$ as described in the [intro to RML](rml_intro.html).
+
+config = (


Where do these config values come from? The text doesn't provide any context to why they might be set to the values that they are.

iancze · 2021-07-21T22:44:23Z

@trq5014 @hgrzy @RCF42 would it be possible to update the final CV example with some of the better-looking hyperparameter settings that you all found recently? It makes sense to show off our best image :)

After that, I think this looks ready to merge! Thanks for all of the hard work.

trq5014 · 2021-07-21T22:45:56Z

Yeah that can be done @iancze ! Do you want us to add the TSV prior in as well? Or should we stick with what we have?

iancze · 2021-07-21T22:50:53Z

Might as well add in the TSV prior if it's needed to get the "best" image so far.

iancze · 2021-07-31T10:48:04Z

Thanks for all the excellent work, everyone!

RCF42 and others added 30 commits June 16, 2021 17:16

test

9a950c2

getting rid of test

6af5d3e

Added P2 notebook

e48a958

adding it as py file

a798bfd

removing notebook file

4c51fdb

proper python file format

e79f7ba

adding log figure function (#won't be pretty in the notebook(

aeef6aa

Changed log directory for tensorboard

984887e

Changed log directory for first set of loss values

Added initialize part, need to revamp training/optimize and crossval

0a2df8c

crossval+optimize structure

2c8f4cd

Tensorboard adjustments

4c39392

Figured out and added directions to examine functions, displayed tensorboard

Training with Ray Tune adj

92f82d2

Fixed error in calling trainable-- getting different error. I suspect it has to do with serialization of model

train/optimize text

99c4518

Fixed ray tune error

cd5459e

tune.run switches to a different directory that is created outside of the original working directory. Adjusted where file is created. Switched directory.

cross validation updates

0593f9b

Merge branch 'HD143006_Part_2_branch' of https://github.com/MPoL-dev/…

a7b3507

…MPoL into HD143006_Part_2_branch

tensorboard updates

656200a

Text edits

98d3d0f

conclusion and image edits

1ae4251

Fixed some (not all) grammar mistakes and added two more cross valida…

2b37f49

…tion loops with better cv scores

Text and formatting edits

a67b02c

Text edits

6033798

Fixed cv tensorboard, text edits

22a1b96

Update Makefile

4762914

Update index.rst

e500878

Metadata

29a050a

Commenting out %tensorboard

367b52b

Added comment about tensorboard

ae3c08f

Conclusion and text edits

64e4cd9

conclusion/loss comment edit

6c366a9

briannazawadzki reviewed Jul 1, 2021

View reviewed changes

trq5014 and others added 6 commits July 1, 2021 15:00

Text updates for Brianna's comments

07d54db

Further text edits

c9a850e

Text updates

bd65745

Text edits

6dbfc98

Hyperlink updates

6e1376a

Text edits

44b59f9

hgrzy marked this pull request as draft July 1, 2021 20:28

RCF42 added 2 commits July 1, 2021 18:21

moving around hyperparameter/prior text

2087bcf

moving around hyperparameter/prior text

fa076eb

hgrzy marked this pull request as ready for review July 2, 2021 20:32

iancze added 3 commits July 5, 2021 21:18

renamed files and directories.

4896957

added linking reference.

c7b5534

streamlining text, relabling headings, simplifying figures.

b27f604

iancze commented Jul 10, 2021

View reviewed changes

trq5014 and others added 3 commits July 19, 2021 13:55

Adding context to values defined in config

e014241

text edits to the tutorial here and there.

44bc9f4

updated text, links.

6a4b971

Added TSV prior for better final image

bbb6c37

iancze merged commit ee52f09 into main Jul 31, 2021

iancze deleted the HD143006_Part_2_branch branch July 31, 2021 10:47

iancze mentioned this pull request Aug 2, 2021

HD 143006 Imaging Tutorial Part 2 #62

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] HD 143006 part 2 #80

[WIP] HD 143006 part 2 #80

iancze commented Jun 18, 2021

briannazawadzki Jul 1, 2021

briannazawadzki Jul 1, 2021

hgrzy Jul 1, 2021

briannazawadzki Jul 16, 2021

hgrzy Jul 16, 2021

briannazawadzki Jul 1, 2021

briannazawadzki Jul 1, 2021

briannazawadzki Jul 1, 2021

trq5014 Jul 1, 2021

briannazawadzki Jul 1, 2021

hgrzy commented Jul 2, 2021

iancze Jul 10, 2021

iancze commented Jul 21, 2021

trq5014 commented Jul 21, 2021

iancze commented Jul 21, 2021

iancze commented Jul 31, 2021

		#


		# To optimize the RML model toward the dirty image, we will create our training loop using a [loss function](../api.html#module-mpol.losses) and an [optimizer](https://pytorch.org/docs/stable/optim.html#module-torch.optim). MPoL and PyTorch both contain many optimizers and loss functions, each one suiting different applications. Here we use PyTorch's [mean squared error function](https://pytorch.org/docs/stable/generated/torch.nn.MSELoss.html) between the RML model image pixel fluxes and the dirty image pixel fluxes.

		return test_score


		# Now, with our functions defined, we need to do the critical part of dividing our dataset into training and test datasets. There are many ways of going about this but here we are splitting it radially and azimuthally and removing chunks. This is visualized in the [Cross Validation tutorial](crossvalidation.html).

		# With these set up, we can now make our training function (instead of a loop, we use a function here since the training loop will be ran multiple times with different configurations). The hyperparameters, such as `epochs` and `lambda_TV`, are contained under `config`. Most of them are used in the loss functions and can be read about [here](../api.html#module-mpol.losses).


		def train(model, dataset, optimizer, config, writer=None, logevery=50):


		# Here is where we define the hyperparameters under the `config` dictionary. Most of these hyperparameters, such as `lambda_TV` and `entropy`, correspond to the scalar prefactors $\lambda$ as described in the [intro to RML](rml_intro.html).

		config = (

[WIP] HD 143006 part 2 #80

[WIP] HD 143006 part 2 #80

Conversation

iancze commented Jun 18, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hgrzy commented Jul 2, 2021

Choose a reason for hiding this comment

iancze commented Jul 21, 2021

trq5014 commented Jul 21, 2021

iancze commented Jul 21, 2021

iancze commented Jul 31, 2021