`log_likelihood` loss incorrect and other loss names misleading

**Describe the bug**

The `mpol.losses.log_likelihood` calculates the likelihood as 
<img width="425" alt="Screenshot 2023-12-21 at 11 21 08 AM" src="https://github.com/MPoL-dev/MPoL/assets/467948/ea57f819-29b2-4b60-bca8-0f70e9921ab6">
When it should actually be closer to
<img width="483" alt="Screenshot 2023-12-21 at 11 22 27 AM" src="https://github.com/MPoL-dev/MPoL/assets/467948/cb145f41-a10d-4049-aac8-19be487fea99">
(e.g., [Deep Learning: Foundations and Concepts, Eqn 2.66](https://www.bishopbook.com/)).

There is at least one error in the missing 
$$\sum_i \ln \sigma_i^2$$
(I think there may still be factors of 2 that remain slightly different from the complex-valued calculations in MPoL vs. the text, which assumes real only), and the source code is missing the negative sign.

This must have been a case of me being more tired than normal, since I think I've implemented this correctly in other codebases. This also implies the `mpol.losses.log_likelihood_gridded` routine is also incorrect, since it calls `mpol.losses.log_likelihood`.

Morover, other loss functions in `mpol.losses` are incorrectly named for the quantity they actually calculate. 

* `mpol.losses.nll` does not actually calculate a negative log likelihood, it calculates a 'reduced' $\chi^2$, since it does not include the penalty for the weight values
* same for `mpol.losses.nll_gridded`

**Suggested fix**
* correct `mpol.losses.log_likelihood` to calculate the correct quantity
* rename `mpol.losses.nll` and `mpol.losses.nll_gridded` to `mpol.losses.reduced_chi_squared` and `mpol.losses.reduced_chi_squared_gridded`, respectively
* add a `mpol.losses.log_likelihood_avg` routine that is the average of `mpol.losses.log_likelihood`. This is useful for cases where the weights may be adjusted (and thus the penalty factor is needed) and we are working with batches of different data sizes.
* recommend in documentation that `mpol.losses.reduced_chi_squared` and `mpol.losses.reduced_chi_squared_gridded` are default loss functions for RML imaging and that corrected `mpol.losses.log_likelihood` is the proper loss function for inference (e.g., MCMC). 
* document changes in changelog

**Additional context**
Recommend that we stay away from the `nll` name entirely, since it appears to be inconsistently defined in the broader ML context. Sometimes it *is* the negative log likelihood (i.e. negative of Eqn 2.66) but more often than not it some averaged or normalized factor that does not include the contribution from the weights. These factors matter when building RML workflows and can make it very tricky to intercompare results from different sized datasets.

**Downstream updates**
When fixed, @briannazawadzki @jeffjennings will need to update their calls to `mpol.losses.nll_gridded` -> `mpol.losses.reduced_chi_squared_gridded`.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`log_likelihood` loss incorrect and other loss names misleading #237

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

log_likelihood loss incorrect and other loss names misleading #237

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`log_likelihood` loss incorrect and other loss names misleading #237