Skip to content

Commit aca6511

Browse files
[3.13] GH-117759: Document incremental GC (GH-123266) (#123395)
GH-117759: Document incremental GC (GH-123266) * Update what's new * Update gc module docs and fix inconsistency in gc.get_objects (cherry picked from commit f49a916) Co-authored-by: Mark Shannon <[email protected]>
1 parent cbcb9e1 commit aca6511

File tree

3 files changed

+69
-26
lines changed

3 files changed

+69
-26
lines changed

Doc/library/gc.rst

+40-15
Original file line numberDiff line numberDiff line change
@@ -40,11 +40,18 @@ The :mod:`gc` module provides the following functions:
4040

4141
.. function:: collect(generation=2)
4242

43-
With no arguments, run a full collection. The optional argument *generation*
43+
Perform a collection. The optional argument *generation*
4444
may be an integer specifying which generation to collect (from 0 to 2). A
45-
:exc:`ValueError` is raised if the generation number is invalid. The sum of
45+
:exc:`ValueError` is raised if the generation number is invalid. The sum of
4646
collected objects and uncollectable objects is returned.
4747

48+
Calling ``gc.collect(0)`` will perform a GC collection on the young generation.
49+
50+
Calling ``gc.collect(1)`` will perform a GC collection on the young generation
51+
and an increment of the old generation.
52+
53+
Calling ``gc.collect(2)`` or ``gc.collect()`` performs a full collection
54+
4855
The free lists maintained for a number of built-in types are cleared
4956
whenever a full collection or collection of the highest generation (2)
5057
is run. Not all items in some free lists may be freed due to the
@@ -53,6 +60,9 @@ The :mod:`gc` module provides the following functions:
5360
The effect of calling ``gc.collect()`` while the interpreter is already
5461
performing a collection is undefined.
5562

63+
.. versionchanged:: 3.13
64+
``generation=1`` performs an increment of collection.
65+
5666

5767
.. function:: set_debug(flags)
5868

@@ -68,13 +78,20 @@ The :mod:`gc` module provides the following functions:
6878

6979
.. function:: get_objects(generation=None)
7080

81+
7182
Returns a list of all objects tracked by the collector, excluding the list
72-
returned. If *generation* is not ``None``, return only the objects tracked by
73-
the collector that are in that generation.
83+
returned. If *generation* is not ``None``, return only the objects as follows:
84+
85+
* 0: All objects in the young generation
86+
* 1: No objects, as there is no generation 1 (as of Python 3.13)
87+
* 2: All objects in the old generation
7488

7589
.. versionchanged:: 3.8
7690
New *generation* parameter.
7791

92+
.. versionchanged:: 3.13
93+
Generation 1 is removed
94+
7895
.. audit-event:: gc.get_objects generation gc.get_objects
7996

8097
.. function:: get_stats()
@@ -101,19 +118,27 @@ The :mod:`gc` module provides the following functions:
101118
Set the garbage collection thresholds (the collection frequency). Setting
102119
*threshold0* to zero disables collection.
103120

104-
The GC classifies objects into three generations depending on how many
105-
collection sweeps they have survived. New objects are placed in the youngest
106-
generation (generation ``0``). If an object survives a collection it is moved
107-
into the next older generation. Since generation ``2`` is the oldest
108-
generation, objects in that generation remain there after a collection. In
109-
order to decide when to run, the collector keeps track of the number object
121+
The GC classifies objects into two generations depending on whether they have
122+
survived a collection. New objects are placed in the young generation. If an
123+
object survives a collection it is moved into the old generation.
124+
125+
In order to decide when to run, the collector keeps track of the number of object
110126
allocations and deallocations since the last collection. When the number of
111127
allocations minus the number of deallocations exceeds *threshold0*, collection
112-
starts. Initially only generation ``0`` is examined. If generation ``0`` has
113-
been examined more than *threshold1* times since generation ``1`` has been
114-
examined, then generation ``1`` is examined as well.
115-
With the third generation, things are a bit more complicated,
116-
see `Collecting the oldest generation <https://devguide.python.org/garbage_collector/#collecting-the-oldest-generation>`_ for more information.
128+
starts. For each collection, all the objects in the young generation and some
129+
fraction of the old generation is collected.
130+
131+
The fraction of the old generation that is collected is **inversely** proportional
132+
to *threshold1*. The larger *threshold1* is, the slower objects in the old generation
133+
are collected.
134+
For the default value of 10, 1% of the old generation is scanned during each collection.
135+
136+
*threshold2* is ignored.
137+
138+
See `Garbage collector design <https://devguide.python.org/garbage_collector>`_ for more information.
139+
140+
.. versionchanged:: 3.13
141+
*threshold2* is ignored
117142

118143

119144
.. function:: get_count()

Doc/whatsnew/3.13.rst

+13
Original file line numberDiff line numberDiff line change
@@ -495,6 +495,19 @@ Incremental garbage collection
495495
The cycle garbage collector is now incremental.
496496
This means that maximum pause times are reduced
497497
by an order of magnitude or more for larger heaps.
498+
499+
There are now only two generations: young and old.
500+
When :func:`gc.collect` is not called directly, the
501+
GC is invoked a little less frequently. When invoked, it
502+
collects the young generation and an increment of the
503+
old generation, instead of collecting one or more generations.
504+
505+
The behavior of :func:`!gc.collect` changes slightly:
506+
507+
* ``gc.collect(1)``: Performs an increment of GC,
508+
rather than collecting generation 1.
509+
* Other calls to :func:`!gc.collect` are unchanged.
510+
498511
(Contributed by Mark Shannon in :gh:`108362`.)
499512

500513

Python/gc.c

+16-11
Original file line numberDiff line numberDiff line change
@@ -1701,20 +1701,25 @@ _PyGC_GetObjects(PyInterpreterState *interp, int generation)
17011701
GCState *gcstate = &interp->gc;
17021702

17031703
PyObject *result = PyList_New(0);
1704-
if (result == NULL) {
1705-
return NULL;
1704+
/* Generation:
1705+
* -1: Return all objects
1706+
* 0: All young objects
1707+
* 1: No objects
1708+
* 2: All old objects
1709+
*/
1710+
if (result == NULL || generation == 1) {
1711+
return result;
17061712
}
1707-
1708-
if (generation == -1) {
1709-
/* If generation is -1, get all objects from all generations */
1710-
for (int i = 0; i < NUM_GENERATIONS; i++) {
1711-
if (append_objects(result, GEN_HEAD(gcstate, i))) {
1712-
goto error;
1713-
}
1713+
if (generation <= 0) {
1714+
if (append_objects(result, &gcstate->young.head)) {
1715+
goto error;
17141716
}
17151717
}
1716-
else {
1717-
if (append_objects(result, GEN_HEAD(gcstate, generation))) {
1718+
if (generation != 0) {
1719+
if (append_objects(result, &gcstate->old[0].head)) {
1720+
goto error;
1721+
}
1722+
if (append_objects(result, &gcstate->old[1].head)) {
17181723
goto error;
17191724
}
17201725
}

0 commit comments

Comments
 (0)