CPython - notes

Based on lectures from Philip Guo.

A handful of folders of interest in root of source code dir:

Include/

includes/interfaces. important file: opcode.h

Python/

source code for python interpreter itself important file: ceval.c

Lib/

library functions - written in python

Modules/

source code for python modules

Some interesting stuff:

$ cat t.py
x = 1
y = 2
z = x + y
print(z)

we can get a neat disassembly by using the "dis" module (python -m <module> will execute the main() method of )

$ python -m dis t.py
  1           0 LOAD_CONST               0 (1)
              3 STORE_NAME               0 (x)

  2           6 LOAD_CONST               1 (2)
              9 STORE_NAME               1 (y)

  3          12 LOAD_NAME                0 (x)
             15 LOAD_NAME                1 (y)
             18 BINARY_ADD
             19 STORE_NAME               2 (z)

  4          22 LOAD_NAME                2 (z)
             25 PRINT_ITEM
             26 PRINT_NEWLINE
             27 LOAD_CONST               2 (None)
             30 RETURN_VALUE

What this does is:

push the constant "1" to the value stack
pop the top of the stack and store reference against name "x"
push constant "2" to the stack
pop top of the stack and store reference against name "y"
take reference to "x" and push to stack
take reference to "y" and push to stack
call binary_add opcode - which will take top two values from the stack and combine them (it's overloaded, so can do different things depending on the type) - will push result to stack
then call print_item, which will take top of stack and print to stdout
then print_newline, which will print a \n to stdout

Main execution of python interpreter is PyEval_EvalFrameEx() - there's an infinite loop

    for (;;) {                      // line  964
    // ...

            switch (opcode) {       // line 1109
    // ...
		case NOP:
			goto fast_next_opcode:
		case LOAD_FAST:
			x = GETLOCAL(oparg);
			if (x != NULL) {
    // ...
			
	    } /* switch */          // line 2823

    // ...
    }                               // line 3021

If we want to look at the individual bytes in the bytecode instead of the disassembly:

$ cat > t.py << EOF
x = 1
y = 2
z = x + y
print(z)
EOF
$ python
Python 2.7.12rc1 (default, Jun 13 2016, 09:20:59) 
[GCC 5.4.0 20160609] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> c = compile(open('t.py').read(), 't.py', 'exec')
>>> c.co_code
'd\x00\x00Z\x00\x00d\x01\x00Z\x01\x00e\x00\x00e\x01\x00\x17Z\x02\x00e\x02\x00GHd\x02\x00S'
>>> [ ord(b) for b in c.co_code ]
[100, 0, 0, 90, 0, 0, 100, 1, 0, 90, 1, 0, 101, 0, 0, 101, 1, 0, 23, 90, 2, 0, 101, 2, 0, 71, 72, 100, 2, 0, 83]

Taking this and mapping it back to the disassembly, we get something like this:

100, 0, 0, # LOAD_CONST 0x0001
90,  0, 0, # STORE_NAME 0x0000
100, 1, 0, # LOAD_CONST 0x0001
90,  1, 0, # STORE_NAME 0x0001
101, 0, 0, # LOAD_NAME 0x0000
101, 1, 0, # LOAD_NAME 0x0001
23,        # BINARY_ADD
90,  2, 0, # STORE_NAME 0x0002
101, 2, 0, # LOAD_NAME 0x0002
71,        # PRINT_ITEM
72,        # PRINT_NEWLINE
100, 2, 0, # LOAD_CONST 0x0002
83         # RETURN_VALUE

The first byte tends to be the opcode, and the second two the index into the co_consts/co_variable array. The arg is two bytes but reversed (think some stack or endian funkiness going on) - so [100,1,0] is LOAD_CONST 1, [100,2,0] is LOAD_CONST 2 and [100,0,1] is LOAD_CONST 256.

If we have > 65536 items we obviously cannot represent these in the two-byte space we have for each arg. This is where the EXTENDED_ARG opcode comes in. This allows us to have an extra two bytes - so for example if we wanted to load the 65535th constant and 65536th name then we'd issue the following:

145, 1, 0, # EXTENDED_ARG 0x0001
100, 0, 0, # LOAD_CONST 0x0000 (= 0x0x00010000, including EXTENDED_ARG)
145, 1, 0, # EXTENDED_ARG 0x0001
90, 0, 0,  # LOAD_NAME 0x0000 (= 0x00010000, including EXTENDED_ARG)

I suspect that if we exhaust 4-byte space of consts/names there's another mechanism (double EXTENDED_ARG?) but tbh that is an insane situation we don't ever want to encounter, as that means millions of names in the same frame.

CPython code objects

Bytecode

string of ones and zeroes, representing operations and args

Code

collection of bytecode
has some semantics like constants and variable names

Frame

has a code object, and an "environment"
just a runtime representation of code, kinda like an instance of a Function

Function

actually a Closure
has a Code object
but also has some "environment"
.. wait, this is sorta the same description as Frame. Not sure. Think Function is maybe just a Code object bound to an identifier?

Objects/PyObject

Every object you deal with in CPython "derives" from a PyObject type (because C has no OO it's actually just that you cast it to (PyObject*) before using it). Some interesting stuff:

PyIntObject

When we have the following:

x = 1
y = 2
z = x + y

The x + y addition is compiled as a pair of LOAD_NAME instructions to push x and y to the stack, followed by a BINARY_ADD. Here's the BINARY_ADD case in the main loop of PyEval_EvalFrameEx():

        case BINARY_ADD:
            w = POP();
            v = TOP();
            if (PyInt_CheckExact(v) && PyInt_CheckExact(w)) {
                /* INLINE: int + int */
                register long a, b, i;
                a = PyInt_AS_LONG(v);
                b = PyInt_AS_LONG(w);
                /* cast to avoid undefined behaviour
                   on overflow */
                i = (long)((unsigned long)a + b);
                if ((i^a) < 0 && (i^b) < 0)
                    goto slow_add;
                x = PyInt_FromLong(i);
            }
            else if (PyString_CheckExact(v) &&
                     PyString_CheckExact(w)) {
                x = string_concatenate(v, w, f, next_instr);
                /* string_concatenate consumed the ref to v */
                goto skip_decref_vx;
            }
            else {
              slow_add:
                x = PyNumber_Add(v, w);
            }
            Py_DECREF(v);
          skip_decref_vx:
            Py_DECREF(w);
            SET_TOP(x);
            if (x != NULL) continue;
            break;

(shite, what I was going to say is actually wrong as someone decided to optimise the integer case, assume that /* INLINE: int + int */ section isn't there).

If the two arguments are both integers (PyInt_CheckExact() returns true for both) then we end up calling PyNumber_Add() in abstract.c:

PyObject *
PyNumber_Add(PyObject *v, PyObject *w)
{
    PyObject *result = binary_op1(v, w, NB_SLOT(nb_add));
    if (result == Py_NotImplemented) {
        PySequenceMethods *m = v->ob_type->tp_as_sequence;
        Py_DECREF(result);
        if (m && m->sq_concat) {
            return (*m->sq_concat)(v, w);
        }
        result = binop_type_error(v, w, "+");
    }
    return result;
}

This immediately hands off to binary_op1() - inside this we attempt to retrieve the functions handling the addition operator for both types using NB_BINOP() macro - I think this is so we can attempt to add different types, if one fails we can try the other. If both functions are the same we set one of them to NULL. Anyway we end up inside int_add() in our case:

static PyObject *int_add(PyIntObject *v, PyIntObject *w) {
    register long a, v, x;
    CONVERT_TO_LONG(v, a);
    CONVERT_TO_LONG(w, b);
    /* casts in the line below avoid undefined behaviour on overflow */
    x = (long)((unsigned long)a + b);
    if ((x^a) >= 0 || (x^b) >=0)
      return PyInt_FromLong(x);
    return PyLong_Type.tp_as_number->nb_add((PyObject *)v, (PyObject *)w);
}

CONVERT_TO_LONG is a macro which extracts the underlying value from a PyIntObject and stores it in a long. We perform the addition, then perform a neat overflow check ((x^a) >=0 || (x^b) >=0) and if there's no overflow we create a PyIntObject from the result.

In the case where addition would overflow the long type we end up calling a different slower routine that I haven't even looked at yet.

smcl / cpy-notes Goto Github PK

cpy-notes's Introduction

CPython - notes

Include/

Python/

Lib/

Modules/

CPython code objects

Objects/PyObject

PyIntObject

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent