Regular object layout #80

markshannon · 2021-08-04T09:48:45Z

Accessing the __dict__ of an object takes a bit of pointer chasing and computation.
The code to get the address of the __dict__ is as follows:
(assuming we have already checked whether this object can have a dict)

    Py_ssize_t dictoffset = tp->tp_dictoffset;
    if (dictoffset < 0) {
        /* Compute size of object */
        Py_ssize_t tsize = ...
        dictoffset += size;
    }
    dictptr = (PyObject **) ((char *)obj + dictoffset);

Given how often we access instance attributes, this overhead is significant

What we would like is:

    dictptr = (PyObject **) ((char *)obj + CONSTANT;

The text was updated successfully, but these errors were encountered:

markshannon · 2021-08-04T10:11:19Z

Since not all objects have a __dict__ we cannot simply put the dictionary at a fixed offset after the header without leaving holes in small objects like ints and floats.

Since any object that has a dictionary can be part of a cycle, it must have a GC header.
Therefore, we can put the dictionary directly before the GC header and there will be no gap.

Simple object without GC header, e.g. an int.

Object that may be part of cycle, but without `dict`, e.g. a list.

Object with a `dict`

methane · 2021-08-10T03:00:19Z

pymalloc aligns memory blocks with 2words (8byte on 32bit, 16byte on 64bit platform).

Currently, GC header is 2 words so no gap. If we add __dict__ there, we need to add a gap.

markshannon · 2021-08-23T07:36:42Z

If we were to reduce the GC header to a single word, then placing the dict pointer before the header would be even more compelling as it would fill the gap.

markshannon · 2021-09-22T14:44:37Z

Given that allocations are two-word aligned, the pre-header needs to be an even number of words.

Long term we want this layout:

But in the medium term, this gives us fixed offsets for dict and weakref pointers and is as compact as what we have now:

pxeger · 2021-09-22T15:08:11Z

Maybe a noobish and/or off-topic question because I don't know much about CPython memory layout, but why is there anything before what the object pointer points to at all? Why not just keep it all after that?

markshannon · 2021-09-22T15:28:31Z

It needs to be at a fixed offset and allow for variable sized objects and inheritance.

gvanrossum · 2021-09-22T15:32:11Z

In particular, for ABI compatibility we need to keep ob_refcnt and ob_type at the same offset relative to the pointer.

markshannon · 2021-09-30T17:05:23Z

An alternative to the above, which will work well with #72 (comment) is:

markshannon · 2021-10-12T17:01:27Z

Regular object layout would also help the GC traverse and clean objects, as well as simplify code for inheritance of layouts, assigning __dict__ and __class__ attributes.
To that end we also need to consider the layout of an object after the class pointer.

I'm going to gloss over weak references here. If we inline the values, the weakref list can go where the values pointer is now.
In the future, we may maintain the weakrefs to an object externally, both saving memory and allowing weakrefs to any object.

The object can be broken down into five sections:

dict and values pointers
GC bits
BaseObject (class and reference count)
Custom section
Slots

The first three are described above. The custom section is whatever is handled by the custom code for a builtin class. Slots are slots described either by __slots__ in Python code, or in the type spec.

E.g. the following class

class XList(list):
    __slots__ = "a", "b", "__dict__"

would have all five sections.

In contrast, object has just the one section: BaseObject.

Objects without a custom section, are transparent to the GC and VM and will need no custom traverse, etc functions.
For efficiency we might want to insist that object slots are grouped together and precede non-object slots.

Inheritance

The rules for layout inheritance are much as they are now, but slightly simplifies by having the dictionary at a fixed offset.
Single inheritance in Python is always legal. New slots are added at the end.

Multiple inheritance where more than one class has a custom section is prohibited. I think this is the same as it is now: "multiple bases have instance lay-out conflict".

GC operations

The GC needs to traverse objects and clear them. In addition, objects need to be deallocated.
These operations are all very similar to each other, and are also similar across different classes. Yet we have a plethora of different, often buggy, implementations.

For objects without a custom section (and perhaps for some special cases with a custom section) the above layout is transparent to the GC. The traversal and deallocation functions can be inlined leading to faster and more robust memory management.

gvanrossum · 2021-10-12T17:30:28Z

So the slots section is not used for "ordinary attributes" (the ones that go into __dict__), right? Only for things described in the type object (how?) or in __slots__.

For a tuple, would the custom section contain everything, or would the custom section only contain the length so the items would become slots? (That would be handy for namedtuples too.) But this seems to contradict the idea that the layout would be transparent to the GC -- it would have to know to look in the custom section to find how many slots there are.

I worry about backwards compatibility here -- 3rd party type definitions should remain supported (probably for many releases). I also worry specifically about tp_dictoffset, which appears in the public headers, even though one should call _PyObject_GetDictPtr() instead.

markshannon · 2021-10-13T10:59:49Z

"Custom" can contain anything. Fully backwards compatible and opaque to the GC.
Tuples would be "custom". Having custom layouts for things like tuples, lists and dicts is fine. They're special.

Code that sets tp_dictoffset is fine, that just means we have a custom layout.

Code that uses tp_dictoffset is problematic only in the case where we have the __dict__ pointer in the header, and the class has instances of differing size, e.g. a tuple.

Which means that we can't use regular object layout for classes that inherit from tuple, bytes, etc.
I don't think that will be a problem in practice.

For example, this class:

class XTuple(tuple):
    pass

would not be able to have the __dict__ pointer in the header, it would have to go in the "custom" section.

markshannon · 2022-01-13T09:50:38Z

I think this is done. We might want to tweak this later, but that can be its own issue.
FTR, the final layout chosen was #80 (comment)

markshannon mentioned this issue Aug 4, 2021

Object layout #69

Closed

markshannon mentioned this issue Aug 13, 2021

bpo-44889: Specialize LOAD_METHOD with PEP 659 adaptive interpreter python/cpython#27722

Merged

markshannon mentioned this issue Dec 1, 2021

bpo-45947: Place dict and values pointer at fixed (negative) offset just before GC header. python/cpython#29879

Merged

gramster added this to Fancy CPython Board Jan 10, 2022

gramster moved this to Todo in Fancy CPython Board Jan 10, 2022

gramster moved this from Todo to Other in Fancy CPython Board Jan 10, 2022

markshannon moved this from Other to Done in Fancy CPython Board Jan 13, 2022

markshannon closed this as completed Jan 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regular object layout #80

Regular object layout #80

markshannon commented Aug 4, 2021

markshannon commented Aug 4, 2021

methane commented Aug 10, 2021

markshannon commented Aug 23, 2021

markshannon commented Sep 22, 2021

pxeger commented Sep 22, 2021 •

edited

Loading

markshannon commented Sep 22, 2021

gvanrossum commented Sep 22, 2021

markshannon commented Sep 30, 2021

markshannon commented Oct 12, 2021

gvanrossum commented Oct 12, 2021

markshannon commented Oct 13, 2021

markshannon commented Jan 13, 2022

Regular object layout #80

Regular object layout #80

Comments

markshannon commented Aug 4, 2021

markshannon commented Aug 4, 2021

Simple object without GC header, e.g. an int.

Object that may be part of cycle, but without __dict__, e.g. a list.

Object with a __dict__

methane commented Aug 10, 2021

markshannon commented Aug 23, 2021

markshannon commented Sep 22, 2021

pxeger commented Sep 22, 2021 • edited Loading

markshannon commented Sep 22, 2021

gvanrossum commented Sep 22, 2021

markshannon commented Sep 30, 2021

markshannon commented Oct 12, 2021

Inheritance

GC operations

gvanrossum commented Oct 12, 2021

markshannon commented Oct 13, 2021

markshannon commented Jan 13, 2022

Object that may be part of cycle, but without `dict`, e.g. a list.

Object with a `dict`

pxeger commented Sep 22, 2021 •

edited

Loading