Skip to content

Commit cc126ce

Browse files
ableegoldmanbbejeck
authored andcommitted
port paragrpah from CP docs (apache#7808)
The AK Streams architecture docs should explain how the maximum parallelism is determined Reviewers: Bill Bejeck <[email protected]>
1 parent 8668436 commit cc126ce

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

docs/streams/architecture.html

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -65,6 +65,14 @@ <h3><a id="streams_architecture_tasks" href="#streams_architecture_tasks">Stream
6565
these record buffers. As a result stream tasks can be processed independently and in parallel without manual intervention.
6666
</p>
6767

68+
<p>
69+
Slightly simplified, the maximum parallelism at which your application may run is bounded by the maximum number of stream tasks, which itself is determined by
70+
maximum number of partitions of the input topic(s) the application is reading from. For example, if your input topic has 5 partitions, then you can run up to 5
71+
applications instances. These instances will collaboratively process the topic’s data. If you run a larger number of app instances than partitions of the input
72+
topic, the “excess” app instances will launch but remain idle; however, if one of the busy instances goes down, one of the idle instances will resume the former’s
73+
work.
74+
</p>
75+
6876
<p>
6977
It is important to understand that Kafka Streams is not a resource manager, but a library that "runs" anywhere its stream processing application runs.
7078
Multiple instances of the application are executed either on the same machine, or spread across multiple machines and tasks can be distributed automatically

0 commit comments

Comments
 (0)