K4os.Streams

Description

The need for this library was triggered by a project which used MemoryStream a lot and I was told by memory profiler that is very heavy on memory allocation.

I was aware that RecyclableMemoryStream exists but I wanted something lighter (the question if I succeeded is a different matter, lol).

There are two (so far) stream implementations in this library: ResizingByteBufferStream and ChunkedByteBufferStream. Both of them are using ArrayPool<byte> but ResizingByteBufferStream stores data in one (potentially) large array (the same approach as MemoryStream) while ChunkedByteBufferStream stores data in a list of chunks.

Measuring performance

Measuring performance if form of magic and it is very hard to get objective numbers.

It is hard to measure performance, because lot of it depends on usage patterns.

Are you using small or large streams? Do they stay in memory for long? Do you read/write them in small or large chunks? What are the thresholds for certain actions (like resizing or chunking)? Do you measure it just before threshold or just after?

Let's say we measure a data structure which rebuilds itself around 1024 elements. You measure performance at 1023 and it might the best, you measure at 1025 and it is 20% behind all other competitors.

What I measured was continuous writing (no Seek) of small chunks (1K) and then continuous reading but in bigger chunks (8K). This was based on usage pattern where I was building a json payload from data (small Writes) and then sending them over network (bigger Reads).

Note, I think I already notices that RecyclableMemoryStream prefer larger chunks, so YMMV.

All measurements were done using:

BenchmarkDotNet=v0.13.5, OS=Windows 11 (10.0.22621.1848/22H2/2022Update/SunValley2)
AMD Ryzen 5 3600, 1 CPU, 12 logical and 6 physical cores
.NET SDK=6.0.410
  [Host]     : .NET 6.0.18 (6.0.1823.26907), X64 RyuJIT AVX2
  DefaultJob : .NET 6.0.18 (6.0.1823.26907), X64 RyuJIT AVX2

NOTE: in first column names of streams has been shortened to fit in table:

Name	Actual class
MemoryStream	`MemoryStream` from `System.IO`
RecyclableStream	`RecyclableMemoryStream` from Microsoft.IO.RecyclableMemoryStream
ResizingStream	`ResizingByteBufferStream` from `K4os.Streams`
ChunkedStream	`ChunkedByteBufferStream` from `K4os.Streams`

Small streams (128B - 64KB)

Method	Length	Mean	Ratio	Gen0	Gen1
MemoryStream	128	51.95 ns	1.00	0.0411	-
RecyclableStream 💩	128	278.25 ns	5.36	0.0324	-
ResizingStream 🏆	128	44.52 ns	0.86	0.0401	-
ChunkedStream 👍	128	46.08 ns	0.89	0.0421	-

MemoryStream	1024	101.99 ns	1.00	0.1329	0.0005
RecyclableStream 💩	1024	312.58 ns	3.06	0.0324	-
ResizingStream 🏆	1024	85.31 ns	0.79	0.0067	-
ChunkedStream 👍	1024	90.07 ns	0.88	0.0086	-

MemoryStream 💩	8192	972.6 ns	1.00	1.8539	0.0668
RecyclableStream	8192	627.3 ns	0.64	0.0324	-
ResizingStream 👍	8192	503.8 ns	0.52	0.0067	-
ChunkedStream 🏆	8192	476.3 ns	0.49	0.0086	-

MemoryStream 💩	65336	7,328.8 ns	1.00	15.5029	3.8681
RecyclableStream 🏆	65336	3,460.7 ns	0.47	0.0305	-
ResizingStream 👍	65336	3,664.8 ns	0.50	0.0038	-
ChunkedStream 👍	65336	3,705.0 ns	0.51	0.0076	-

Medium streams (128KB - 8MB)

Method	Length	Mean	Ratio	Gen0	Gen1	Gen2
MemoryStream 💩	131072	60.229 us	1.00	41.6260	41.6260	41.6260
RecyclableStream 🏆	131072	6.554 us	0.11	0.0305	-	-
ResizingStream	131072	7.403 us	0.12	-	-	-
ChunkedStream 👍	131072	6.836 us	0.11	0.0458	-	-

MemoryStream 💩	1048576	770.487 us	1.00	499.0234	499.0234	499.0234
RecyclableStream 👍	1048576	52.645 us	0.07	0.0610	-	-
ResizingStream	1048576	60.258 us	0.08	-	-	-
ChunkedStream 🏆	1048576	46.239 us	0.06	-	-	-

MemoryStream 💩	8388608	7,484.830 us	1.00	742.1875	742.1875	742.1875
RecyclableStream 👍	8388608	439.533 us	0.06	2.4414	-	-
ResizingStream	8388608	1,543.618 us	0.22	-	-	-
ChunkedStream 🏆	8388608	380.532 us	0.05	-	-	-

Large streams (128MB - 512MB)

Method	Length	Mean	Ratio	Gen0	Gen1	Gen2
MemoryStream 💩	134217728	123.99 ms	1.00	4800.0000	4800.0000	4800.0000
RecyclableStream 👍	134217728	28.94 ms	0.23	500.0000	31.2500	-
ResizingStream	134217728	41.55 ms	0.33	-	-	-
ChunkedStream 🏆	134217728	28.85 ms	0.23	125.0000	125.0000	125.0000

MemoryStream 💩	536870912	753.93 ms	1.00	6000.0000	6000.0000	6000.0000
RecyclableStream 👍	536870912	138.75 ms	0.18	8000.0000	800.0000	-
ResizingStream	536870912	163.87 ms	0.20	-	-	-
ChunkedStream 🏆	536870912	136.63 ms	0.18	-	-	-

Observations

ResizingByteBufferStream is the fastest for small streams
ChunkedByteBufferStream is not much worse in small stream range, but shines in medium and large streams
RecyclableMemoryStream has quite a lot of overhead, that's why it's 5x slower than MemoryStream for tiny streams
RecyclableMemoryStream is very good for medium and large stream
MemoryStream is the kind of good for tiny streams, but nothing more
MemoryStream is the worst for large streams
RecyclableMemoryStream has an interesting top performance at 64K - 128K range and will investigate it further
I think, it means that transition between small and medium stream could be improved in ChunkedByteBufferStream

Decision tree

I just roughly scored choosing given stream implementations for certain ranges:

What I would say, the result can be read as: ResizingByteBufferStream is the best for small streams, while ChunkedByteBufferStream is the best all-rounder. MemoryStream is terrible for large streams, while RecyclableMemoryStream is quite bad for small streams.

Size	MemoryStream	ResizingStream	ChunkedStream	RecyclableStream
tiny	B	A* 🏆	A 👍	F 💩
small	D 💩	A* 🏆	A 👍	B
medium	F 💩	B	A* 🏆	A 👍
large	F 💩	C	A* 🏆	A* 🏆

If your streams are always very small, use ResizingByteBufferStream
If your streams are always quite large, use RecyclableMemoryStream or ChunkedByteBufferStream
If you need a compromise, have medium or unpredictable sizes, use ChunkedByteBufferStream

Usage

One very important note is those streams need to be disposed to get the benefit, if you don't dispose them the performance will be roughly the same as MemoryStream.

It is a little bit problematic though as memory is disposed at Dispose so you may not access it .ToArray() after that.

If you need to get data from stream, do it before disposing it!

using var stream = new ChunkedByteBufferStream();
using var writer = new StreamWriter(stream, leaveOpen: true); // NOTE: leaveOpen!
writer.Write("Hello, world!");
writer.Flush();
Console.WriteLine(Encoding.UTF8.GetString(stream.ToArray());

There are some memory specific methods available on both streams allowing quickly access data in them:

class ResizingByteBufferStream: Stream
{
    Span<byte> AsSpan();
    
    int ExportTo(Span<byte> target);
    byte[] ToArray();
}

class ChunkedByteBufferStream: Stream
{
    int ExportTo(Span<byte> target);
    byte[] ToArray();
}

(NOTE: no AsSpan() for ChunkedByteBufferStream because it is not a single block of memory, I may add AsReadOnlySequence one day though).

Other than that it is just a Stream.

Build

build

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.config		.config
.github/workflows		.github/workflows
.nuke		.nuke
res		res
src		src
.env		.env
.gitignore		.gitignore
CHANGES.md		CHANGES.md
Directory.Build.props		Directory.Build.props
Directory.Build.targets		Directory.Build.targets
Directory.Packages.props		Directory.Packages.props
LICENSE		LICENSE
PublicAssembly.props		PublicAssembly.props
README.md		README.md
build.cmd		build.cmd
build.ps1		build.ps1
build.sh		build.sh
global.json		global.json
nuget.config		nuget.config
nuke.cmd		nuke.cmd
paket		paket
paket.cmd		paket.cmd
paket.dependencies		paket.dependencies
paket.lock		paket.lock
settings.cfg		settings.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

K4os.Streams

Description

Measuring performance

Small streams (128B - 64KB)

Medium streams (128KB - 8MB)

Large streams (128MB - 512MB)

Observations

Decision tree

Usage

Build

About

Uh oh!

Releases

Packages

Languages

License

israellot/K4os.Streams

Folders and files

Latest commit

History

Repository files navigation

K4os.Streams

Description

Measuring performance

Small streams (128B - 64KB)

Medium streams (128KB - 8MB)

Large streams (128MB - 512MB)

Observations

Decision tree

Usage

Build

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages