Setting sample_weight in Poisson regression example

#### Describe the issue linked to the documentation


In the [Poisson regression and non-normal loss](https://scikit-learn.org/stable/auto_examples/linear_model/plot_poisson_regression_non_normal_loss.html#sphx-glr-auto-examples-linear-model-plot-poisson-regression-non-normal-loss-py) example, we set the sample weight to the exposure, when we divided the count data by the exposure. We had this discussion regarding this here: https://github.com/scikit-learn/scikit-learn/pull/14300/files#r386066958

When looking at the [reference paper (page 16)](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3164764) the example was based on, it handles this by using an offset:

```r
glm(formula = ClaimNb ~ VehPowerGLM + VehAgeGLM + DrivAgeGLM +
	BonusMalusGLM + VehBrand + VehGas + DensityGLM + Region +
 	AreaGLM, family = poisson(), data = learn, offset = log(Exposure))
```

Which I think is the same as:

<img width="663" alt="image" src="https://user-images.githubusercontent.com/5402633/89111606-dd4e5c00-d425-11ea-8111-326bde318edc.png">

where `l` is the exposure. In our example, the target has been already divided by the exposure. If we want to match the narrative by the paper, is the `sample_weight` required?

**Edit:** I guess we are treating 4 event in 8 years to have a higher weight than 1 event in 2 years.

CC @lorentzenchr @rth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Setting sample_weight in Poisson regression example #18059

Describe the issue linked to the documentation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Setting sample_weight in Poisson regression example #18059

Description

Describe the issue linked to the documentation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions