A full knobs-and-dials buffering strategy transformation #61

phausler · 2022-02-26T01:03:50Z

This allows for full customization of buffers with simple actor based isolation.

It needs considerably more tests, but this approach will allow us to not only offer more customization but also more extensive testing.

Some neat characteristics: the AsyncBuffer protocol can be implemented with an actor if that makes sense, it participates in rethrowing (so if folks want to throw when the buffer is full they can do so easily), but the sneakiest part here is that the buffer has an input type and an output type. That last part means that we can buffer say characters and emit strings. Or do things like buffer up strings and concatenate them etc.

…ests for custom buffer types

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

parkera · 2022-03-01T17:08:13Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+
+public actor AsyncLimitBuffer<Element: Sendable>: AsyncBuffer {
+  public enum Policy: Sendable {
+    case unbounded


If this winds up being the easiest choice to make for clients, we could be leading them down the path of unbounded memory growth as well.

I can remove the defaultness of that policy. I was just mirroring AsyncStream

kperryua · 2022-03-01T01:44:27Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+    case .bufferingNewest(let limit):
+      if buffer.count < limit {
+        buffer.append(element)
+      } else if buffer.count > 0 {


This check seems superfluous.

unless the limit is 0, which could be valid as a buffer right?

If the limit is 0 (which seems weird to me, see below), then you will never have put anything in the buffer ever, and buffer.count will always be 0. The condition is still superfluous. Mathematically, the only time this condition matters is if both buffer.count and limit could be negative.

kperryua · 2022-03-01T01:45:19Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+  let policy: Policy
+
+  init(policy: Policy) {
+    self.policy = policy


For bufferingOldest and Newest we should have a precondition(limit > 0).

should it be precondition(limit > 0) or precondition(limit >= 0)?

What's supposed to happen with limit == 0? Do all values get dropped? 0 is weird, because .bufferingNewest(0) and .bufferingOldest(0) seems like they would be equvalent.

yea a buffer of 0 seems silly, you should just remove the buffer if that is the case.

kperryua · 2022-03-01T21:34:43Z

Tests/AsyncAlgorithmsTests/TestBuffer.swift

+    func update(_ apply: @Sendable (inout T) -> Void) async {
+      apply(&value)
+    }
+  }


This doesn't appear to be used.

yea the whole type is no longer used, will remove

kperryua · 2022-03-01T22:38:01Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+      break
+    }
+  }
+


We just discussed this in person, but while this looks better on the surface and seems to behave better in your testing, I still have to think very carefully about the location of the suspension points and the possibilities of actor interleaving messing with the state machine in a manner not too dissimilar from the previous ManagedCriticalState approach.

agreed, it definitely needs more testing to validate it

Unfortunately, I did find at least one instance where actor interleaving disrupts the state machine as written, putting the sequence into an undesirable state. I'm working on a fix, but it requires careful examination of the current state at every entry point and after every suspension point to the iterator's actor, since suspension can allow another call into the actor to proceed and therefore change the actor's state.

kperryua · 2022-03-01T22:39:27Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+extension AsyncSequence where Element: Sendable {
+  public func buffer<Buffer: AsyncBuffer>(_ createBuffer: @Sendable @escaping () -> Buffer) -> AsyncBufferSequence<Self, Buffer> where Buffer.Input == Element {
+    AsyncBufferSequence(self, createBuffer: createBuffer)
+  }


Overall, I like this shape of this, and the amount of utility available here is very high.

…s checks

kperryua · 2022-03-01T23:39:34Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+  associatedtype Input: Sendable
+  associatedtype Output: Sendable
+
+  func push(_ element: Input) async


Something that occurred to me: an async push() function essentially provides a backpressure mechanism, yeah? Could we, for instance, have a type of buffer that accepts a certain number of items, but then when full, stashes away a continuation in push() that only gets resumed on pop()?

yes that is possible (a bit more complicated). are you thinking of it having some sort of limit there too? or does that become unneeded?

Yeah, I'm thinking about an additional Policy value that is something like .bufferingWithBackpressure(Int) (terrible name) that happily collects upstream values while there's available space, making them quickly available to any less-hot downstream consumers. But instead of dropping oldest or newest values, it uses a continuation to exert backpressure on the Task that is consuming items from the base iterator.

The problem with that would be that we would need some sort of external mechanism to push because if the push is awaiting a continuation the pop would never be able to enter IIUC for the actor models. It is worth exploration as another option.

…interleaving issues. Fixed error/termination handling. Validation diagram tests, using “delay next” operators

kperryua · 2022-03-11T18:45:52Z

I think there is still some more work to be done here. More extensive testing. Perhaps new default buffer policies, including "throw when full", and "exert backpressure".

phausler · 2022-03-11T20:04:28Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+      switch state {
+      case .idle(let iterator, let createBuffer):
+        let bufferState = AsyncBufferState<Base.Element, Buffer.Output>()
+        let buffer = Active(Envelope(iterator), buffer: createBuffer(), state: bufferState)


After looking at it, I think the envelope thing is a bad idea; actors really should have Sendable requirements for the initialization parameters. That means that buffer must have the base type's iterator should be Sendable.

…he iterator

… conditional Sendable conformances

phausler · 2022-03-14T17:34:47Z

Sources/AsyncAlgorithms/AsyncBufferSequence.swift

+    for continuation in pending {
+      continuation.resume(returning: .success(nil))
+    }
+    pending = []


might want to set the pending to empty before resuming btw, that way we don't run the risk of bad states (granted here I think you are fine)

A full knobs-and-dials buffering strategy transformation

69a2b70

phausler requested review from jmschonfeld, kperryua, natecook1000, parkera and timvermeulen February 26, 2022 01:03

Refine the AsyncBuffer type to require actor and add some additonal t…

23df67d

…ests for custom buffer types

kperryua reviewed Feb 28, 2022

View reviewed changes

Sources/AsyncAlgorithms/AsyncBufferSequence.swift Show resolved Hide resolved

Migrate the internal state to an actor

9409f07

phausler requested a review from kperryua March 1, 2022 03:10

parkera reviewed Mar 1, 2022

View reviewed changes

Remove default unbounded policy from buffers

b7d3250

kperryua reviewed Mar 1, 2022

View reviewed changes

Remove some dead code tests and fix space removal to avoid superfluou…

c0f79b0

…s checks

kperryua reviewed Mar 1, 2022

View reviewed changes

kperryua added 3 commits March 8, 2022 17:24

Merge branch 'pr/delay-next' into pr/buffer

19daa0c

Merge branch 'main' into pr/buffer

851d53c

Simplification of nominal state machine, which helps eliminate actor …

0996083

…interleaving issues. Fixed error/termination handling. Validation diagram tests, using “delay next” operators

phausler commented Mar 11, 2022

View reviewed changes

kperryua added 2 commits March 11, 2022 16:32

Most robust behavior when multiple tasks are concurrently accessing t…

c77c13e

…he iterator

Removed Envelope, added base iterator Sendable requirement, and added…

d19e04a

… conditional Sendable conformances

phausler commented Mar 14, 2022

View reviewed changes

Minor future-proofing

d5420d7

kperryua merged commit 6ba2c05 into main Mar 14, 2022

kperryua deleted the pr/buffer branch March 14, 2022 20:41

A full knobs-and-dials buffering strategy transformation #61

A full knobs-and-dials buffering strategy transformation #61

Uh oh!

Conversation

phausler commented Feb 26, 2022

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kperryua Mar 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kperryua Mar 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kperryua commented Mar 11, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kperryua Mar 1, 2022 •

edited

Loading

kperryua Mar 1, 2022 •

edited

Loading