[Channel] re-implement using a state machine #235

twittemb · 2022-11-20T23:45:26Z

Hi

This PR suggests a new implementation for AsyncChannel. The idea was to:

streamline the implementation using the same paradigm (state machine/storage) as the recently remade operators (merge, zip, ...)
share some code between AsyncChannel and AsyncThrowingChannel since the state machine is the same
eventually improve the performance (I've seen a 20% increase in throughput)

Some remarks:

I've added a throughput measurement
I think there is a bug in the old/current implementation of AsyncChannel because when I tried to run it with the new throughput measurement I had a crash from time to time
with this new implementation we could imagine to split the producer from the consumer (same as what @FranzBusch pitched here https://forums.swift.org/t/pitch-convenience-async-throwing-stream-makestream-methods/61030 for AsyncStream). We just have to reference the same storage from an hypothetical AsyncChannel.Continuation and the AsyncChannel it self.

@phausler @FranzBusch what do you think ?

phausler · 2022-11-21T03:32:43Z

The perf sounds fantastic! I will make sure to go over this with a fine tooth comb Monday.

twittemb · 2022-11-21T14:15:15Z

For the record, I've used ad-hoc properties (hidden behind #if DEBUG instructions) so we can test the suspend/resume mechanism more precisely, particularly for task cancellation. I know this is not an ideal solution but I can't see anything better right now.

I guess this is a part of the resolution of #148

FranzBusch · 2022-11-21T19:51:16Z

Currently on vacation but will try to give this a look soon! I think @phausler has some ideas around testing that doesn’t involve scattering debug only code.

eaigner · 2022-12-16T17:55:48Z

Just FYI, your bug is probably that you increment generation += 1. In swift this does not automatically overflow, but will crash if you reach Int.max.

phausler · 2022-12-16T18:03:42Z

Cancellation complicates things a decent amount; accounting for all the states and their cancellation counterparts ends up being roughly similar. The advantage with state machines is that the states are well known about which transition it can be and explicitly calls those out - so for subtile state manipulation that can cross boundaries of tasks it is useful to have a bit more # of lines of code (without much perf impact) to build known state transition clarity.

FranzBusch · 2022-12-18T14:46:03Z

Agree with @phausler here LoC is not a good measurement here. AsyncChannel has some inherent complexities and state machines just surface the various transition edges way better.

FranzBusch

Left some comments here! In general, I like the switch to a state machine it makes it very clear what is going on

Sources/AsyncAlgorithms/Channels/AsyncChannel.swift

Sources/AsyncAlgorithms/Channels/ChannelStorage.swift

Sources/AsyncAlgorithms/Channels/ChannelStateMachine.swift

Sources/AsyncAlgorithms/Channels/ChannelStorage.swift

twittemb · 2022-12-18T15:09:36Z

Left some comments here! In general, I like the switch to a state machine it makes it very clear what is going on

Thanks for your review … I‘ll check it ASAP

twittemb · 2022-12-24T14:50:42Z

Left some comments here! In general, I like the switch to a state machine it makes it very clear what is going on

Hi @FranzBusch I've pushed a version with pretty much all the comments addressed.

FranzBusch

Looks really good already. Left some more comments inline but we are getting close here! Thanks for all the work

Sources/AsyncAlgorithms/Channels/AsyncChannel.swift

Sources/AsyncAlgorithms/Channels/AsyncThrowingChannel.swift

Sources/AsyncAlgorithms/Channels/ChannelStateMachine.swift

FranzBusch · 2023-01-11T10:07:20Z

Sources/AsyncAlgorithms/Channels/ChannelStateMachine.swift

+      case .terminated(.finished):
+        return .resumeConsumer(element: nil)
+
+      case .terminated(.failed(let error)):


I don't think that this is aligned with the rule that next() should throw an error once and after that it should return nil. This code right now throws the same error over and over again. Implementing the expected behaviour is going to get interesting since it means that we have to keep track of each iterator and what they saw. Maybe we can achieve this by creating a single ID for an iterator that we keep inside the iterator and use for every next() call on the state machine instead of generating an ID per next call.

It is not exactly what happens. When the state is .finished(.failure) and there are some suspended consumers then they all receive the failure and the state is set to .terminated(.finished). The subsequent call to next will receive “nil”.

@FranzBusch, @phausler before making any changes I'd like to be sure of the expected behaviour. I think that what I have now is aligned with the previous implementation of AsyncChannel.

Maybe I am misreading the code here as well. If the behaviour is the same then it should be good!

this is the unit test that checks that:

func test_asyncThrowingChannel_resumes_producers_and_discards_additional_elements_when_fail_is_called() async throws { // Given: an AsyncThrowingChannel let sut = AsyncThrowingChannel<Int, Error>() // Given: 2 suspended send operations let task1 = Task { await sut.send(1) } let task2 = Task { await sut.send(2) } // When: failing the channel sut.fail(Failure()) // Then: the send operations are resumed _ = await (task1.value, task2.value) // When: sending an extra value await sut.send(3) // Then: the send operation is resumed // Then: the iteration is resumed with a failure var collected = [Int]() do { for try await element in sut { collected.append(element) } } catch { XCTAssertTrue(collected.isEmpty) XCTAssertEqual(error as? Failure, Failure()) } // When: requesting a next value var iterator = sut.makeAsyncIterator() let pastFailure = try await iterator.next() // Then: the past failure is nil XCTAssertNil(pastFailure) }

yep

func test_asyncThrowingChannel_resumes_consumers_when_fail_is_called() async throws { // Given: an AsyncThrowingChannel let sut = AsyncThrowingChannel<Int, Error>() // Given: 2 suspended iterations let task1 = Task<Int?, Error> { var iterator = sut.makeAsyncIterator() return try await iterator.next() } let task2 = Task<Int?, Error> { var iterator = sut.makeAsyncIterator() return try await iterator.next() } // When: failing the channel sut.fail(Failure()) // Then: the iterations are resumed with the error do { _ = try await (task1.value, task2.value) } catch { XCTAssertEqual(error as? Failure, Failure()) } // When: requesting a next value var iterator = sut.makeAsyncIterator() let pastFailure = try await iterator.next() // Then: the past failure is nil XCTAssertNil(pastFailure) }

[UPDATE] although I could improve it a bit by asserting both task1 and task2 will fail independently ! hold on ...

Hm that's not really testing the same. I would like to see that test but with the same iterator calling next twice. Like this

let task1 = Task<Int?, Error> { var iterator = sut.makeAsyncIterator() try await iterator.next() // Need to catch the values of both next and return them in an array to assert try await iterator.next() }

ok.

Is that ok?

func test_asyncThrowingChannel_resumes_consumer_when_fail_is_called() async throws { // Given: an AsyncThrowingChannel let sut = AsyncThrowingChannel<Int, Error>() // Given: suspended iteration let task = Task<Int?, Error> { var iterator = sut.makeAsyncIterator() do { _ = try await iterator.next() } catch { XCTAssertEqual(error as? Failure, Failure()) } return try await iterator.next() } // When: failing the channel sut.fail(Failure()) // Then: the iterations are resumed with the error and the next element is nil do { let collected = try await task.value XCTAssertNil(collected) } catch { XCTFail("The task should not fail, the past failure element should be nil, not a failure.") } }

I think almost, just one slight change

func test_asyncThrowingChannel_resumes_consumer_when_fail_is_called() async throws { // Given: an AsyncThrowingChannel let sut = AsyncThrowingChannel<Int, Error>() // Given: suspended iteration let task = Task<Int?, Error> { var iterator = sut.makeAsyncIterator() do { _ = try await iterator.next() XCTFail("We expect the above call to throw") } catch { XCTAssertEqual(error as? Failure, Failure()) } return try await iterator.next() } // When: failing the channel sut.fail(Failure()) // Then: the iterations are resumed with the error and the next element is nil do { let collected = try await task.value XCTAssertNil(collected) } catch { XCTFail("The task should not fail, the past failure element should be nil, not a failure.") } }

Would be great if you could add that test

I had to do a few tweaks in the state machine to make it pass ... nice catch.
I've pushed the updated version.

twittemb · 2023-01-11T10:12:47Z

Looks really good already. Left some more comments inline but we are getting close here! Thanks for all the work

Thanks for the review. I'll address the comments later today or tomorrow.

phausler

Overall this looks pretty good; let me know when you are ready to land it.

Sources/AsyncAlgorithms/Channels/ChannelStorage.swift

Sources/AsyncAlgorithms/Channels/ChannelStateMachine.swift

Sources/AsyncAlgorithms/Channels/ChannelStorage.swift

twittemb · 2023-01-11T18:21:25Z

Overall this looks pretty good; let me know when you are ready to land it.

Thanks for the review, I think I'll push something tomorrow.

twittemb · 2023-01-12T09:51:31Z

Overall this looks pretty good; let me know when you are ready to land it.

I've addressed all the comments (minus one for the throwing behaviour).

In the end I can confirm a 20% increase in perfs + a more reliable implementation (the current one crashes from time to time during the throughput measurement)

FranzBusch

One last nit, but looks good to me now! Thanks for all the work!

Tests/AsyncAlgorithmsTests/TestChannel.swift

twittemb · 2023-01-12T12:54:54Z

One last nit, but looks good to me now! Thanks for all the work!

Thanks a lot for your review.

@phausler ready to merge 👍.

twittemb mentioned this pull request Nov 20, 2022

Add a proposal for AsyncChannel #216

Merged

twittemb closed this Dec 16, 2022

twittemb reopened this Dec 16, 2022

FranzBusch reviewed Dec 18, 2022

View reviewed changes

twittemb force-pushed the feature/async-channel branch 4 times, most recently from e4edbf0 to 1fcb631 Compare December 24, 2022 14:41

Keithbreadley approved these changes Jan 4, 2023

View reviewed changes

twittemb force-pushed the feature/async-channel branch from 1fcb631 to de3eb0f Compare January 10, 2023 09:43

twittemb requested review from FranzBusch and phausler and removed request for FranzBusch and phausler January 10, 2023 09:43

FranzBusch requested changes Jan 11, 2023

View reviewed changes

phausler approved these changes Jan 11, 2023

View reviewed changes

twittemb force-pushed the feature/async-channel branch from d7239ba to c52e06d Compare January 11, 2023 19:05

twittemb force-pushed the feature/async-channel branch from c52e06d to c67b934 Compare January 12, 2023 09:41

twittemb force-pushed the feature/async-channel branch from c67b934 to 446e0a2 Compare January 12, 2023 10:28

twittemb requested a review from FranzBusch January 12, 2023 10:53

twittemb force-pushed the feature/async-channel branch from 446e0a2 to 6afcc8c Compare January 12, 2023 11:19

FranzBusch approved these changes Jan 12, 2023

View reviewed changes

Tests/AsyncAlgorithmsTests/TestChannel.swift Outdated Show resolved Hide resolved

channel: re-implement with a state machine

3b66bb6

twittemb force-pushed the feature/async-channel branch from 6afcc8c to 3b66bb6 Compare January 12, 2023 12:54

phausler merged commit 0ebc805 into apple:main Jan 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Channel] re-implement using a state machine #235

[Channel] re-implement using a state machine #235

twittemb commented Nov 20, 2022 •

edited

Loading

phausler commented Nov 21, 2022

twittemb commented Nov 21, 2022 •

edited

Loading

FranzBusch commented Nov 21, 2022

eaigner commented Dec 16, 2022

phausler commented Dec 16, 2022

FranzBusch commented Dec 18, 2022

FranzBusch left a comment

twittemb commented Dec 18, 2022

twittemb commented Dec 24, 2022

FranzBusch left a comment

FranzBusch Jan 11, 2023

twittemb Jan 11, 2023

twittemb Jan 12, 2023

FranzBusch Jan 12, 2023

twittemb Jan 12, 2023

twittemb Jan 12, 2023 •

edited

Loading

FranzBusch Jan 12, 2023

twittemb Jan 12, 2023

FranzBusch Jan 12, 2023

twittemb Jan 12, 2023 •

edited

Loading

twittemb commented Jan 11, 2023 •

edited

Loading

phausler left a comment

twittemb commented Jan 11, 2023

twittemb commented Jan 12, 2023 •

edited

Loading

FranzBusch left a comment

twittemb commented Jan 12, 2023 •

edited

Loading

[Channel] re-implement using a state machine #235

[Channel] re-implement using a state machine #235

Conversation

twittemb commented Nov 20, 2022 • edited Loading

phausler commented Nov 21, 2022

twittemb commented Nov 21, 2022 • edited Loading

FranzBusch commented Nov 21, 2022

eaigner commented Dec 16, 2022

phausler commented Dec 16, 2022

FranzBusch commented Dec 18, 2022

FranzBusch left a comment

Choose a reason for hiding this comment

twittemb commented Dec 18, 2022

twittemb commented Dec 24, 2022

FranzBusch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

twittemb Jan 12, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

twittemb Jan 12, 2023 • edited Loading

Choose a reason for hiding this comment

twittemb commented Jan 11, 2023 • edited Loading

phausler left a comment

Choose a reason for hiding this comment

twittemb commented Jan 11, 2023

twittemb commented Jan 12, 2023 • edited Loading

FranzBusch left a comment

Choose a reason for hiding this comment

twittemb commented Jan 12, 2023 • edited Loading

twittemb commented Nov 20, 2022 •

edited

Loading

twittemb commented Nov 21, 2022 •

edited

Loading

twittemb Jan 12, 2023 •

edited

Loading

twittemb Jan 12, 2023 •

edited

Loading

twittemb commented Jan 11, 2023 •

edited

Loading

twittemb commented Jan 12, 2023 •

edited

Loading

twittemb commented Jan 12, 2023 •

edited

Loading