Add tuples to the language by WardBrian · Pull Request #1100 · stan-dev/stanc3

WardBrian · 2022-01-26T18:06:17Z

This is a port of the changes @rybern made in #675 to master. I tried to first port this while preserving his comments, etc, and then made some changes myself to reflect the new status of things.

This will close stan-dev/stan#2431

Needs

Submission Checklist

Run unit tests
Documentation
- If a user-facing facing change was made, the documentation PR is here:

Release notes

Add Tuples to the language

Copyright and Licensing

By submitting this pull request, the copyright holder is agreeing to
license the submitted work under the BSD 3-clause license (https://opensource.org/licenses/BSD-3-Clause)

Continue work from Ryan PR Remainder of changes from Ryan's pr Small fixes Match existing test output

Projection is a bit more technical, but it mainly creates further distance from 'Indexed' for easier reading

WardBrian · 2022-01-26T18:09:33Z

@SteveBronder I'm handing this off to you. If you run into any questions feel free to ping/email me as I've gotten quite familiar with the state of things.

@rok-cesnovar once we're in the freeze you may want to take a look at what is needed to support tuples in assign()

The first thing that needs to happen is work in the code gen for transformations/IO - currently we hit failwiths there that prevent code generation from proceeding. Once we get most of that sorted we can start iterating more on the rest of the PR. I am pretty sure the front end stuff is in a decent spot.

andrjohns · 2022-02-11T00:25:37Z

Any chance this implementation will support tuples of user-defined functions? Not a request if it's a lot of work or anything, just curious

WardBrian · 2022-02-11T00:26:33Z

Do you mean as a return or argument type? Yes

andrjohns · 2022-02-11T03:53:37Z

As an argument type, so in stan pseudo-code:

real fun_a(real a) {
  ...
}

real fun_b(real b) {
  ...
}

other_function(..., make_tuple(fun_a, fun_b));

I'm prototyping an approach for user-defined gradients, and it assumes that a tuple of arguments and a tuple of gradient functions will be passed:

real value_fun(real x, real y) {
  return x * y;
}
real dx_fun(real x, real y) {
  return y;
}
real dy_fun(real x, real y) {
  return x;
}



user_gradients(make_tuple(a, b), value_fun, make_tuple(dx_fun, dy_fun));

But if a tuple of functions/functors isn't feasible, then I can make it variadic and split the functions out in the c++

WardBrian · 2022-02-11T13:28:12Z

I don’t think that would work completely out of the box, but it would be a minor change to generate a tuple of the functor structs in C++. I see no reason why such a thing wouldn’t be possible for a library function. For user defined functions, that signature would of course be impossible to define until we have a way of writing down a function’s type inside Stan

I have some more questions about your thinking on this but I’ll ask them on slack/not in this PR

WardBrian · 2023-06-16T16:20:23Z

Is that something we'd desire being possible?

I think we would need to have the requirement that if any container is used in the constraint, rightmost index must be included (e.g., you could make lowers2 5,6,3 where the final dimension is just rep_array(value_you_want, 3)). So while the syntax you mentioned may be impossible, I think something semantically equivalent would be implementable

WardBrian · 2023-06-16T16:29:26Z

The remaining uncertainties (marked TUPLE MAYBE) are all in the Memory Patterns optimizations, so I'd appreciate if @SteveBronder could take a closer look at those parts

nhuurre · 2023-06-16T16:43:12Z

Is that something we'd desire being possible?

Probably not but since it wasn't in the design doc it's hard know for sure. If the current requirement were that if any container is used in the constraint, all indexes must be included, that would be consistent with either as a future extension and also currently allow something semantically equivalent.

WardBrian · 2023-06-16T18:19:07Z

I'd be happy to open a PR revising the design doc if you think that is necessary for this PR to proceed. I think that the current implementation does not prevent us from doing (... something semantically equivalent to ..) what you've described in the future. The eventual situation would end up looking something like a two-part rule:

A tuple can have entries which are constrained by the same rules the containing type. So a tuple(real<lower=0>, ...), or tuple(array[10] real<lower=1>, ...), or tuple(array[10] real<lower={...}>, ...)
An array of tuples can have each array element have the same constraints, using the above syntax, or must be given a constraint which has dimensions that are the union of the "outer" array dimensions and any "inner" dimensions, so array[N] tuple(array[M] real<lower=X>, ...) is valid for X as a 1) scalar (currently allowed), 2 1-d array (currently allowed, must be of length M), 3) 2-d array ( not currently supported, but would require size N,M)

Regardless from whether the design would allow for it, I still have some reservations about actually allowing different tuples in the same array to have different internal constraints, but those can be saved for later discussions of that actual feature

nhuurre · 2023-06-16T20:47:17Z

Yes, I think updating the design doc is the best way to move forward.

This argument has been a bit weird because I think we both agree what the eventual future is going to be. Specifically, it is one where all of the following declarations are accepted:

data {
  int N, M, K;
  real L;
  array[K] real Lk;
  array[M,K] real Lmk;
  array[N,M,K] real Lnmk;
}
parameters {
  array[N,M] tuple(array[K] real<lower=L>, real) T;
  array[N,M] tuple(array[K] real<lower=Lk>, real) Tk;
  array[N,M] tuple(array[K] real<lower=Lmk>, real) Tmk;
  array[N,M] tuple(array[K] real<lower=Lnmk>, real) Tnmk;
}

And we also agree that this is not something to implement right now but a plan for the far future, likely requiring a new design doc, or might never even happen.

Where we disagree is what the first concrete step toward that hypothetical future should be.

You think arrays should stay homogeneous and Tk is the only one consistent with that
I think we should steer clear of the potential ambiguity of Tk and instead implement Tnmk

WardBrian · 2023-06-26T13:52:52Z

Design doc changes stan-dev/design-docs#50

I think your summary is fair. I will admit that, besides thinking that it is the right thing to do, the fact that homogeneous elements is easiest to implement (even easier than if we only allowed scalars at first) does weigh in to breaking the tie between the different ideas

WardBrian · 2023-07-18T19:14:52Z

Over this past weekend I did some fuzzing of this PR for the equivalent of 8 days (4 with —O1, 4 without). It found four crashes, 3 of which were the parsing issue fixed in d59e0e3 and the last was unrelated (#1336).

This isn’t the same as testing the generated C++ compiling obvious, but it does seem like we’ve resolved most/all of the internal exceptions which could have occurred (@nhuurre had already spotted a bunch manually before this)

SteveBronder

C++ Code changes look good except for a few little spots.

SteveBronder

Ty! I looked over the rest of the code and read a lot of the C++ output and that looks good to me. @nhuurre unless you have any objections I think we are ready to merge!

nhuurre

No objections.

WardBrian added 8 commits January 26, 2022 12:43

Changes based on Ryans pr

98fe379

Continue work from Ryan PR Remainder of changes from Ryan's pr Small fixes Match existing test output

Rename IndexedTuple to TupleProjection

71eec60

Projection is a bit more technical, but it mainly creates further distance from 'Indexed' for easier reading

Comments and tweaks

955ed35

More tweaks, compile trivial model

1483c40

Update parser.messages for existing tests

c6102ce

Improve typechecking, printing

fe84e11

Add Ryan's test/good folder

c3077ef

Dune promote

6d6a712

WardBrian added the big-exciting-project Large projects that we're excited about but might take a long time and possibly not end up working o label Jan 26, 2022

WardBrian assigned SteveBronder, rok-cesnovar and WardBrian Jan 26, 2022

WardBrian marked this pull request as draft January 26, 2022 18:09

WardBrian linked an issue Feb 3, 2022 that may be closed by this pull request

How should tuples be emitted by the --info option? #820

Closed

WardBrian added 2 commits February 8, 2022 09:32

Merge branch 'master' into tuple-redo

dc8fd48

Add a few comments

14a71fc

WardBrian mentioned this pull request Feb 8, 2022

Clean up pretty printing of array[] #1113

Merged

3 tasks

WardBrian mentioned this pull request Feb 11, 2022

Support std::tuple types in assign() stan-dev/stan#3100

Closed

WardBrian and others added 7 commits February 18, 2022 10:16

Merge branch 'master' into tuple-redo

5c5ebab

Merge branch 'master' into tuple-redo

f9beb87

does data reads for tuples

938489a

Fix some levels of the tuple errors and generate the tests output

21a719d

merge master

1089d58

Merge branch 'master' into tuples-redo

dc3a7fc

Merge branch 'master' into tuple-redo

48f945c

nhuurre reviewed Jun 17, 2023

View reviewed changes

Comment thread src/analysis_and_optimization/Mir_utils.ml Outdated

WardBrian and others added 8 commits June 26, 2023 14:44

Further tweaks to MIR_utils

d5cb0b6

format

6fb9e22

Merge branch 'master' into tuple-redo

a544739

Dune promote

85d2a96

Merge branch 'master' into tuple-redo

004431d

Fix #1334 in tuple case as well

ee56a8c

Fix adtype in function inlining

95a2e7a

Fix crashes found in fuzzing

d59e0e3

WardBrian mentioned this pull request Jul 17, 2023

Handle leading zeros in scientific notation literals #1336

Merged

3 tasks

WardBrian added 4 commits July 18, 2023 15:39

Merge branch 'master' into tuple-redo

bc89150

dune promote

ff150e9

cleanup

4e1f467

Dune promote

6c3037d

SteveBronder requested changes Jul 19, 2023

View reviewed changes

Comment thread test/integration/good/compiler-optimizations/cppO1.expected Outdated

Comment thread test/integration/good/compiler-optimizations/mem_patterns/cpp.expected

Comment thread test/integration/good/compiler-optimizations/cppO1.expected Outdated

WardBrian added 2 commits July 20, 2023 09:53

Remove unnecessary *1 in sizes

ac34648

Selectively strip metadata to prevent current_location statements

2889a6a

WardBrian requested a review from SteveBronder July 20, 2023 15:05

SteveBronder approved these changes Jul 20, 2023

View reviewed changes

nhuurre approved these changes Jul 21, 2023

View reviewed changes

WardBrian mentioned this pull request Jul 21, 2023

Document Tuple types in the language stan-dev/docs#656

Closed

WardBrian merged commit 054d554 into master Jul 21, 2023

WardBrian deleted the tuple-redo branch October 25, 2023 20:36

Uh oh!

Conversation

WardBrian commented Jan 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Needs

Submission Checklist

Release notes

Copyright and Licensing

Uh oh!

WardBrian commented Jan 26, 2022

Uh oh!

andrjohns commented Feb 11, 2022

Uh oh!

WardBrian commented Feb 11, 2022

Uh oh!

andrjohns commented Feb 11, 2022

Uh oh!

WardBrian commented Feb 11, 2022

Uh oh!

WardBrian commented Jun 16, 2023

Uh oh!

WardBrian commented Jun 16, 2023

Uh oh!

nhuurre commented Jun 16, 2023

Uh oh!

WardBrian commented Jun 16, 2023

Uh oh!

nhuurre commented Jun 16, 2023

Uh oh!

Uh oh!

WardBrian commented Jun 26, 2023

Uh oh!

WardBrian commented Jul 18, 2023

Uh oh!

SteveBronder left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SteveBronder left a comment

Choose a reason for hiding this comment

Uh oh!

nhuurre left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

WardBrian commented Jan 26, 2022 •

edited

Loading