pipeline doc - #14414

Rick-Anderson · 2019-09-13T02:33:49Z

Internal review URLs

System.IO.Pipelines

Todo

Hook up TOC
add metadata
UE Pass

Fixes #7516

mairaw

Some quick comments to get started

docs/standard/io/Buffers.md

Rick-Anderson · 2019-09-13T19:23:23Z

@mairaw I haven't started on the buffer doc yet. I've made some changes to pipeline.

Rick-Anderson · 2019-09-13T21:39:07Z

@BillWagner see Pipe basic usage

Would you be able to create a project with the code in this PR (from this sample down). I'm having trouble getting it to work. I can then do snippet references and customer can have code they can download/test/modify.

If you're too busy I could ask @KPixel to migrate the code.

davidfowl · 2019-09-13T23:17:20Z

I'll own fixing the code samples, that's not a problem.

davidfowl · 2019-09-13T23:28:40Z

cc @pakrym @jkotalik @halter73

Rick-Anderson · 2019-09-14T00:58:20Z

@davidfowl really fantastic info I'm anxious to get published. If @pakrym | @jkotalik | @halter73 could put the code in a repo that will speed up importing it.

No hurry as I'm doing this after my ASP.NET Core 3.0 migration work.

idg10 · 2019-09-16T06:31:07Z

docs/standard/io/Buffers.md

+}
+```
+
+The above method searches each segment for a specific byte. If we need to keep track of each segment's `SequencePosition` then [`ReadOnlySequence.TryGet`](https://docs.microsoft.com/en-us/dotnet/api/system.buffers.readonlysequence-1.tryget?view=netcore-3.0) would be more appropriate. Let's change the above code to return a `SequencePosition` instead of an integer. This has the added benefit of allowing the caller to avoid a second scan to get the data at a specific index.


It took me a little while to work out what this meant:

"allowing the caller to avoid a second scan"

There are two kinds of scans going on here: the Span<byte>.IndexOf calls, and then the scan through the segments in the sequence. My mental model was that the bulk of the scan-like work here is the IndexOf because that's going to touch every byte until it finds what it's looking for. (Also, so far in my work with ReadOnlySequence<T>, which I've been using in conjunction with pipelines, the majority have been single-segment, with multi-segment examples only appearing as you approach the end of a buffer. I've found my app gets better throughput with largish buffer sizes, ensuring that most of the time I'm in the single-segment case, so I naturally tend to think of that one as the hot path that's most perf-critical.)

So maybe qualify this as "...a second scan through the sequence's segments to get..."?

idg10 · 2019-09-16T06:50:48Z

docs/standard/io/Pipelines.md

+
+* Is called to get memory from the underlying writer.
+* `PipeWriter.Advance(int)` is called to tell the `PipeWriter` how much data was written to the buffer.
+* `PipeWriter.FlushAsync` is called to make the data available to the `PipeReader`.


As a reader, a question I have at this point is: why are these separate operations?

In your example, the call to Advance is followed immediately (as long as an exception didn't occur) by a call to FlushAsync. This suggests that an alternative design was possible, in which these were rolled into a single method. But presumably there's a reason you didn't do that—there must be some scenario in which you might want to call Advance more than once in between calls to FlushAsync? But what is that scenario? As it stands, I don't understand this aspect of the design.

This suggests that an alternative design was possible, in which these were rolled into a single method. But presumably there's a reason you didn't do that—there must be some scenario in which you might want to call Advance more than once in between calls to FlushAsync? But what is that scenario? As it stands, I don't understand this aspect of the design.

Buffering. You can GetMemory()/GetSpan and Advance(..) over and over then when you have buffered up the appropriate message/amount you call FlushAsync.

An example https://github.com/aspnet/AspNetCore/blob/7b0b2980dc6ffb860f2bcd71868d98d85c7d35b6/src/Http/Http.Abstractions/src/Extensions/HttpResponseWritingExtensions.cs#L125-L143

Makes sense. I don't think this is very discoverable right now, so I'm wondering if it would be helpful to add a short description of when this might be something you'd want to do. I think in the app where I'm using this I don't need to, but it's hard to be certain from the docs—my read loop reads fairly large blocks of data off disk and writes them into the pipe, and I flush every time. But with the socket example you've given I don't actually know how to judge whether the "flush on every iteration" approach the code you've shown would be right in any particular scenario. If I happen to know that I'm receiving from a source that sends data in small dribs and drabs (not a hypothetical question for me, by the way) should I be waiting until I've "buffered up the appropriate message/amount"?

It's not useful when bytes are being transferred from one source to the other. In that scenario the writer has no idea what the bytes are and can't make sense of how much should be buffered before flushing.

There are cases like serialization that take in an IBufferWriter<byte> or when you're directly writing via the PipeWriter where you can choose to write an "entire message" not pieces of a message.

Here's an example:

A simple length prefixed message, you could imagine writing code like this:

async Task WriteMessage(PipeWriter writer, byte[] payload) { writer.Write(GetLength(payload)); writer.Advance(4); // Write 4 bytes writer.Write(payload); writer.Advance(payload.Length); await writer.FlushAsync(); }

jkotalik · 2019-09-17T17:14:02Z

docs/standard/io/Buffers.md

+
+The preceding method requests a buffer of at least 5 bytes from the `IBufferWriter<byte>` using `GetSpan(5)` then writes bytes for the ASCII string "Hello" to the returned `Span<byte>`. It then calls `Advance(written)` to indicate how many bytes were written to the buffer. 
+
+This method of writing uses the `Memory<T>`/`Span<T>` buffer provided by the `IBufferWriter<T>`, but you can also use the [`Write`](https://docs.microsoft.com/en-us/dotnet/api/system.buffers.buffersextensions.write?view=netstandard-2.1) extension method to copy an existing buffer to the `IBufferWriter<T>`. `Write` does the work of calling `GetSpan`/`Advance` as appropriate, so there's no need to call `Advance` after writing.


We should have links to the API doc for GetSpan/GetMemory/Advance. Ex: https://docs.microsoft.com/en-us/dotnet/api/system.buffers.ibufferwriter-1.getmemory?view=netstandard-2.1#System_Buffers_IBufferWriter_1_GetMemory_System_Int32_

PLEASE HOLD COMMENTS UNTIL IT'S READY FOR REVIEW

😆

Yeah, I stopped as soon as I saw that. Oops!

NP. This is taking a back seat to the ASP.NET Core 3.0 release so I'll be slow doing updates. At some point I'll need one of you to get the bulk of the pipe code into a working project on a Git repo. That way I can migrate the code snippets to this doc and not have embedded snippets.

…into ra/buffers/pipelines

docs/standard/io/pipelines.md

halter73 · 2019-10-10T03:17:13Z

Is there any mention of completing Pipes with an exception? Given the overall depth this doc, that seems like a pretty big omission to me.

davidfowl · 2019-10-10T03:19:55Z

I’ll add it in the future. I want to get this in since it’s the first big conceptual doc. But you’re right. That is missing.

All of the changes that don’t have suggested edits will happen later.

Co-Authored-By: Stephen Halter <[email protected]>

jkotalik · 2019-10-10T05:26:31Z

docs/standard/io/pipelines.md

+
+* The entire message (end of line) might not be received in a single call to `ReadAsync`.
+* It's ignoring the result of `stream.ReadAsync`. `stream.ReadAsync` returns how much data was read.
+* It doesn't handle the case where multiple lines are read in a single `ReadAsync` call.


I'd also consider mentioning that the method is always allocating a byte array each time we read data.

I'd also consider mentioning that the method is always allocating a byte array each time we read data.

Added

It allocates a byte array with each read.

jkotalik · 2019-10-10T05:33:12Z

docs/standard/io/pipelines.md

+* Returns an incomplete `ValueTask<FlushResult>` when the amount of data in the `Pipe` crosses `PauseWriterThreshold`.
+* Completes `ValueTask<FlushResult>` when it becomes lower than `ResumeWriterThreshold`.
+
+Two values are used to prevent rapid cycling, which can occur if one value is used.


I'd give an example of the rapid cycling if the same value is used for the threshold.

@jkotalik please suggest verbiage.

jkotalik · 2019-10-10T05:38:23Z

docs/standard/io/pipelines.md

+
+## Streams
+
+When reading or writing stream data, you typically read data using a de-serializer and write data using a serializer. Most of these read and write stream APIs have a `Stream` parameter. To make it easier to integrate with these existing APIs, `PipeReader` and `PipeWriter` expose an <xref:System.IO.Pipelines.PipeReader.AsStream%2A>.  <xref:System.IO.Pipelines.PipeWriter.AsStream%2A> returns a `Stream` implementation around the `PipeReader` or `PipeWriter`.


Also mention that there are PipeReader.Create and PipeWriter.Create methods to convert a stream into a PipeReader/PipeWriter.

This is on me. I need to expand on the Stream interop with examples. I'll do this in a different PR

docs/standard/io/pipelines.md

halter73 suggestions Co-Authored-By: Stephen Halter <[email protected]>

Rick-Anderson · 2019-10-11T03:40:07Z

@davidfowl should I format the code/comments to minimize the horizontal scroll bars on a tablet? You can simulate a tablet on bigger displays by making the widest window without a left side and right side TOC.
For example, Read a single message

davidfowl · 2019-10-11T03:58:17Z

Yes we can wrap the comments and should align the code by the parameters

davidfowl

LGTM

mairaw · 2019-10-11T23:06:23Z

Feel free to squash and merge this when you're ready @Rick-Anderson. I imagine you can do the comment wrapping separately since it's a nice improvement but not blocking right?

davidfowl · 2019-10-12T00:23:05Z

Where's the real URL?

Rick-Anderson · 2019-10-12T01:20:44Z

@mairaw @BillWagner when do you merge master into live? Let me know so I can tweat this before @davidfowl does.

mairaw · 2019-10-12T01:53:27Z

Usually by the end of my day shift 😆

Rick Anderson added 2 commits September 12, 2019 19:33

Fowlers pipeline doc

c8db8cf

work

68ef48d

mairaw reviewed Sep 13, 2019

View reviewed changes

work

3d6e9e3

work

e150a39

idg10 reviewed Sep 16, 2019

View reviewed changes

Rick-Anderson changed the title ~~Fowlers pipeline doc~~ DRAFT: pipeline doc - PLEASE HOLD COMMENTS UNTIL IT'S READY FOR REVIEW Sep 16, 2019

jkotalik reviewed Sep 17, 2019

View reviewed changes

analogrelay mentioned this pull request Sep 18, 2019

Add doc for what's new in ASP.NET Core 3.0 dotnet/AspNetCore.Docs#14250

Merged

8 tasks

Rick Anderson added 15 commits September 26, 2019 12:37

work

9bce945

work

2a79b2e

work

3a9c607

Delete Pipes.AssemblyInfo.cs

0b60f41

Delete Pipes.AssemblyInfoInputs.cache

9b88ca0

Delete project.assets.json

27824fd

Delete Pipes.csproj.nuget.g.targets

6a57a49

Delete Pipes.csproj.nuget.dgspec.json

9a8c686

Delete Pipes.csproj.nuget.cache

f078039

Delete Pipes.csproj.nuget.g.props

87a242d

work

6536470

Merge branch 'ra/buffers/pipelines' of https://github.com/dotnet/docs …

a72d48e

…into ra/buffers/pipelines

work

9cdaf6f

work

743969c

work

42ade7c

halter73 reviewed Oct 10, 2019

View reviewed changes

docs/standard/io/pipelines.md Outdated Show resolved Hide resolved

halter73 reviewed Oct 10, 2019

View reviewed changes

docs/standard/io/pipelines.md Show resolved Hide resolved

Rick Anderson and others added 3 commits October 9, 2019 17:33

Apply suggestions from code review

33024f9

Co-Authored-By: Stephen Halter <[email protected]>

add halter73 suggestions

e193e70

add halter73 suggestions

075ac6c

jkotalik reviewed Oct 10, 2019

View reviewed changes

halter73 reviewed Oct 10, 2019

View reviewed changes

docs/standard/io/pipelines.md Outdated Show resolved Hide resolved

Thraka added the rerun-labels label Oct 10, 2019

dotnet-bot added 📚 Area - .NET Guide and removed rerun-labels labels Oct 10, 2019

Rick Anderson and others added 2 commits October 10, 2019 11:32

Apply suggestions from code review

fb9cc9d

halter73 suggestions Co-Authored-By: Stephen Halter <[email protected]>

add halter73 suggestions

49c96a6

halter73 mentioned this pull request Oct 11, 2019

Pipe Complete() -> CompleteAsync() dotnet/samples#1624

Merged

add halter73 suggestions

59b9a16

davidfowl approved these changes Oct 11, 2019

View reviewed changes

halter73 approved these changes Oct 11, 2019

View reviewed changes

Rick-Anderson merged commit a607439 into master Oct 12, 2019

Rick-Anderson deleted the ra/buffers/pipelines branch October 12, 2019 00:05

Rick-Anderson mentioned this pull request Oct 12, 2019

Format pipeline code #15126

Closed

Rick-Anderson mentioned this pull request Nov 6, 2019

Review Memory<T> and Span<T> and pipeline doc dotnet/AspNetCore.Docs#11823

Closed


		The preceding method requests a buffer of at least 5 bytes from the `IBufferWriter<byte>` using `GetSpan(5)` then writes bytes for the ASCII string "Hello" to the returned `Span<byte>`. It then calls `Advance(written)` to indicate how many bytes were written to the buffer.

		This method of writing uses the `Memory<T>`/`Span<T>` buffer provided by the `IBufferWriter<T>`, but you can also use the [`Write`](https://docs.microsoft.com/en-us/dotnet/api/system.buffers.buffersextensions.write?view=netstandard-2.1) extension method to copy an existing buffer to the `IBufferWriter<T>`. `Write` does the work of calling `GetSpan`/`Advance` as appropriate, so there's no need to call `Advance` after writing.


		## Streams

		When reading or writing stream data, you typically read data using a de-serializer and write data using a serializer. Most of these read and write stream APIs have a `Stream` parameter. To make it easier to integrate with these existing APIs, `PipeReader` and `PipeWriter` expose an <xref:System.IO.Pipelines.PipeReader.AsStream%2A>. <xref:System.IO.Pipelines.PipeWriter.AsStream%2A> returns a `Stream` implementation around the `PipeReader` or `PipeWriter`.

pipeline doc - #14414

pipeline doc - #14414

Uh oh!

Conversation

Rick-Anderson commented Sep 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mairaw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Rick-Anderson commented Sep 13, 2019

Uh oh!

Rick-Anderson commented Sep 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidfowl commented Sep 13, 2019

Uh oh!

davidfowl commented Sep 13, 2019

Uh oh!

Rick-Anderson commented Sep 14, 2019

Uh oh!

idg10 Sep 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

halter73 commented Oct 10, 2019

Uh oh!

davidfowl commented Oct 10, 2019

Uh oh!

jkotalik Oct 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Rick-Anderson commented Oct 11, 2019

Uh oh!

davidfowl commented Oct 11, 2019

Uh oh!

davidfowl left a comment

Choose a reason for hiding this comment

Uh oh!

mairaw commented Oct 11, 2019

Uh oh!

Rick-Anderson commented Sep 13, 2019 •

edited

Loading

Rick-Anderson commented Sep 13, 2019 •

edited

Loading

idg10 Sep 16, 2019 •

edited

Loading

jkotalik Oct 10, 2019 •

edited

Loading