.. _embedding: Embedding the interpreter ######################### While pybind11 is mainly focused on extending Python using C++, it's also possible to do the reverse: embed the Python interpreter into a C++ program. All of the other documentation pages still apply here, so refer to them for general pybind11 usage. This section will cover a few extra things required for embedding. Getting started =============== A basic executable with an embedded interpreter can be created with just a few lines of CMake and the ``pybind11::embed`` target, as shown below. For more information, see :doc:`/compiling`. .. code-block:: cmake cmake_minimum_required(VERSION 3.15...4.0) project(example) find_package(pybind11 REQUIRED) # or `add_subdirectory(pybind11)` add_executable(example main.cpp) target_link_libraries(example PRIVATE pybind11::embed) The essential structure of the ``main.cpp`` file looks like this: .. code-block:: cpp #include // everything needed for embedding namespace py = pybind11; int main() { py::scoped_interpreter guard{}; // start the interpreter and keep it alive py::print("Hello, World!"); // use the Python API } The interpreter must be initialized before using any Python API, which includes all the functions and classes in pybind11. The RAII guard class ``scoped_interpreter`` takes care of the interpreter lifetime. After the guard is destroyed, the interpreter shuts down and clears its memory. No Python functions can be called after this. Executing Python code ===================== There are a few different ways to run Python code. One option is to use ``eval``, ``exec`` or ``eval_file``, as explained in :ref:`eval`. Here is a quick example in the context of an executable with an embedded interpreter: .. code-block:: cpp #include namespace py = pybind11; int main() { py::scoped_interpreter guard{}; py::exec(R"( kwargs = dict(name="World", number=42) message = "Hello, {name}! The answer is {number}".format(**kwargs) print(message) )"); } Alternatively, similar results can be achieved using pybind11's API (see :doc:`/advanced/pycpp/index` for more details). .. code-block:: cpp #include namespace py = pybind11; using namespace py::literals; int main() { py::scoped_interpreter guard{}; auto kwargs = py::dict("name"_a="World", "number"_a=42); auto message = "Hello, {name}! The answer is {number}"_s.format(**kwargs); py::print(message); } The two approaches can also be combined: .. code-block:: cpp #include #include namespace py = pybind11; using namespace py::literals; int main() { py::scoped_interpreter guard{}; auto locals = py::dict("name"_a="World", "number"_a=42); py::exec(R"( message = "Hello, {name}! The answer is {number}".format(**locals()) )", py::globals(), locals); auto message = locals["message"].cast(); std::cout << message; } Importing modules ================= Python modules can be imported using ``module_::import()``: .. code-block:: cpp py::module_ sys = py::module_::import("sys"); py::print(sys.attr("path")); For convenience, the current working directory is included in ``sys.path`` when embedding the interpreter. This makes it easy to import local Python files: .. code-block:: python """calc.py located in the working directory""" def add(i, j): return i + j .. code-block:: cpp py::module_ calc = py::module_::import("calc"); py::object result = calc.attr("add")(1, 2); int n = result.cast(); assert(n == 3); Modules can be reloaded using ``module_::reload()`` if the source is modified e.g. by an external process. This can be useful in scenarios where the application imports a user defined data processing script which needs to be updated after changes by the user. Note that this function does not reload modules recursively. .. _embedding_modules: Adding embedded modules ======================= Embedded binary modules can be added using the ``PYBIND11_EMBEDDED_MODULE`` macro. Note that the definition must be placed at global scope. They can be imported like any other module. .. code-block:: cpp #include namespace py = pybind11; PYBIND11_EMBEDDED_MODULE(fast_calc, m) { // `m` is a `py::module_` which is used to bind functions and classes m.def("add", [](int i, int j) { return i + j; }); } int main() { py::scoped_interpreter guard{}; auto fast_calc = py::module_::import("fast_calc"); auto result = fast_calc.attr("add")(1, 2).cast(); assert(result == 3); } Unlike extension modules where only a single binary module can be created, on the embedded side an unlimited number of modules can be added using multiple ``PYBIND11_EMBEDDED_MODULE`` definitions (as long as they have unique names). These modules are added to Python's list of builtins, so they can also be imported in pure Python files loaded by the interpreter. Everything interacts naturally: .. code-block:: python """py_module.py located in the working directory""" import cpp_module a = cpp_module.a b = a + 1 .. code-block:: cpp #include namespace py = pybind11; PYBIND11_EMBEDDED_MODULE(cpp_module, m) { m.attr("a") = 1; } int main() { py::scoped_interpreter guard{}; auto py_module = py::module_::import("py_module"); auto locals = py::dict("fmt"_a="{} + {} = {}", **py_module.attr("__dict__")); assert(locals["a"].cast() == 1); assert(locals["b"].cast() == 2); py::exec(R"( c = a + b message = fmt.format(a, b, c) )", py::globals(), locals); assert(locals["c"].cast() == 3); assert(locals["message"].cast() == "1 + 2 = 3"); } ``PYBIND11_EMBEDDED_MODULE`` also accepts :func:`py::mod_gil_not_used()`, :func:`py::multiple_interpreters::per_interpreter_gil()`, and :func:`py::multiple_interpreters::shared_gil()` tags just like ``PYBIND11_MODULE``. See :ref:`misc_subinterp` and :ref:`misc_free_threading` for more information. Interpreter lifetime ==================== The Python interpreter shuts down when ``scoped_interpreter`` is destroyed. After this, creating a new instance will restart the interpreter. Alternatively, the ``initialize_interpreter`` / ``finalize_interpreter`` pair of functions can be used to directly set the state at any time. Modules created with pybind11 can be safely re-initialized after the interpreter has been restarted. However, this may not apply to third-party extension modules. The issue is that Python itself cannot completely unload extension modules and there are several caveats with regard to interpreter restarting. In short, not all memory may be freed, either due to Python reference cycles or user-created global data. All the details can be found in the CPython documentation. .. warning:: Creating two concurrent ``scoped_interpreter`` guards is a fatal error. So is calling ``initialize_interpreter`` for a second time after the interpreter has already been initialized. Use :class:`scoped_subinterpreter` to create a sub-interpreter. See :ref:`subinterp` for important details on sub-interpreters. Do not use the raw CPython API functions ``Py_Initialize`` and ``Py_Finalize`` as these do not properly handle the lifetime of pybind11's internal data. .. _subinterp: Embedding Sub-interpreters ========================== A sub-interpreter is a separate interpreter instance which provides a separate, isolated interpreter environment within the same process as the main interpreter. Sub-interpreters are created and managed with a separate API from the main interpreter. Beginning in Python 3.12, sub-interpreters each have their own Global Interpreter Lock (GIL), which means that running a sub-interpreter in a separate thread from the main interpreter can achieve true concurrency. pybind11's sub-interpreter API can be found in ``pybind11/subinterpreter.h``. pybind11 :class:`subinterpreter` instances can be safely moved and shared between threads as needed. However, managing multiple threads and the lifetimes of multiple interpreters and their GILs can be challenging. Proceed with caution (and lots of testing)! The main interpreter must be initialized before creating a sub-interpreter, and the main interpreter must outlive all sub-interpreters. Sub-interpreters are managed through a different API than the main interpreter. The :class:`subinterpreter` class manages the lifetime of sub-interpreters. Instances are movable, but not copyable. Default constructing this class does *not* create a sub-interpreter (it creates an empty holder). To create a sub-interpreter, call :func:`subinterpreter::create()`. .. warning:: Sub-interpreter creation acquires (and subsequently releases) the main interpreter GIL. If another thread holds the main GIL, the function will block until the main GIL can be acquired. Sub-interpreter destruction temporarily activates the sub-interpreter. The sub-interpreter must not be active (on any threads) at the time the :class:`subinterpreter` destructor is called. Both actions will re-acquire any interpreter's GIL that was held prior to the call before returning (or return to no active interpreter if none was active at the time of the call). Each sub-interpreter will import a separate copy of each ``PYBIND11_EMBEDDED_MODULE`` when those modules specify a ``multiple_interpreters`` tag. If a module does not specify a ``multiple_interpreters`` tag, then Python will report an ``ImportError`` if it is imported in a sub-interpreter. pybind11 also has a :class:`scoped_subinterpreter` class, which creates and activates a sub-interpreter when it is constructed, and deactivates and deletes it when it goes out of scope. Activating a Sub-interpreter ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Once a sub-interpreter is created, you can "activate" it on a thread (and acquire its GIL) by creating a :class:`subinterpreter_scoped_activate` instance and passing it the sub-intepreter to be activated. The function will acquire the sub-interpreter's GIL and make the sub-interpreter the current active interpreter on the current thread for the lifetime of the instance. When the :class:`subinterpreter_scoped_activate` instance goes out of scope, the sub-interpreter GIL is released and the prior interpreter that was active on the thread (if any) is reactivated and it's GIL is re-acquired. When using ``subinterpreter_scoped_activate``: 1. If the thread holds any interpreter's GIL: - That GIL is released 2. The new sub-interpreter's GIL is acquired 3. The new sub-interpreter is made active. 4. When the scope ends: - The sub-interpreter's GIL is released - If there was a previous interpreter: - The old interpreter's GIL is re-acquired - The old interpreter is made active - Otherwise, no interpreter is currently active and no GIL is held. Example: .. code-block:: cpp py::initialize_interpreter(); // Main GIL is held { py::subinterpreter sub = py::subinterpreter::create(); // Main interpreter is still active, main GIL re-acquired { py::subinterpreter_scoped_activate guard(sub); // Sub-interpreter active, thread holds sub's GIL { py::subinterpreter_scoped_activate main_guard(py); // Sub's GIL was automatically released // Main interpreter active, thread holds main's GIL } // Back to sub-interpreter, thread holds sub's GIL again } // Main interpreter is active, main's GIL is held } GIL API for sub-interpreters ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ :class:`gil_scoped_release` and :class:`gil_scoped_acquire` can be used to manage the GIL of a sub-interpreter just as they do for the main interpreter. They both manage the GIL of the currently active interpreter, without the programmer having to do anything special or different. There is one important caveat: .. note:: When no interpreter is active through a :class:`subinterpreter_scoped_activate` instance (such as on a new thread), :class:`gil_scoped_acquire` will acquire the **main** GIL and activate the **main** interpreter. Full Sub-interpreter example ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Here is an example showing how to create and activate sub-interpreters: .. code-block:: cpp #include #include #include namespace py = pybind11; PYBIND11_EMBEDDED_MODULE(printer, m, py::multiple_interpreters::per_interpreter_gil()) { m.def("which", [](const std::string& when) { std::cout << when << "; Current Interpreter is " << py::subinterpreter::current().id() << std::endl; }); } int main() { py::scoped_interpreter main_interp; py::module_::import("printer").attr("which")("First init"); { py::subinterpreter sub = py::subinterpreter::create(); py::module_::import("printer").attr("which")("Created sub"); { py::subinterpreter_scoped_activate guard(sub); try { py::module_::import("printer").attr("which")("Activated sub"); } catch (py::error_already_set &e) { std::cerr << "EXCEPTION " << e.what() << std::endl; return 1; } } py::module_::import("printer").attr("which")("Deactivated sub"); { py::gil_scoped_release nogil; { py::subinterpreter_scoped_activate guard(sub); try { { py::subinterpreter_scoped_activate main_guard(py::subinterpreter::main()); try { py::module_::import("printer").attr("which")("Main within sub"); } catch (py::error_already_set &e) { std::cerr << "EXCEPTION " << e.what() << std::endl; return 1; } } py::module_::import("printer").attr("which")("After Main, still within sub"); } catch (py::error_already_set &e) { std::cerr << "EXCEPTION " << e.what() << std::endl; return 1; } } } } py::module_::import("printer").attr("which")("At end"); return 0; } Expected output: .. code-block:: text First init; Current Interpreter is 0 Created sub; Current Interpreter is 0 Activated sub; Current Interpreter is 1 Deactivated sub; Current Interpreter is 0 Main within sub; Current Interpreter is 0 After Main, still within sub; Current Interpreter is 1 At end; Current Interpreter is 0 .. warning:: In Python 3.12 sub-interpreters must be destroyed in the same OS thread that created them. Failure to follow this rule may result in deadlocks or crashes when destroying the sub-interpreter on the wrong thread. This constraint is not present in Python 3.13+. Best Practices for sub-interpreter safety ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ - Never share Python objects across different interpreters. - :class:`error_already_set` objects contain a reference to the Python exception type, and :func:`error_already_set::what()` acquires the GIL. So Python exceptions must **never** be allowed to propagate past the enclosing :class:`subinterpreter_scoped_activate` instance! (So your try/catch should be *just inside* the scope covered by the :class:`subinterpreter_scoped_activate`.) - Avoid global/static state whenever possible. Instead, keep state within each interpreter, such as within the interpreter state dict, which can be accessed via ``subinterpreter::current().state_dict()``, or within instance members and tied to Python objects. - Avoid trying to "cache" Python objects in C++ variables across function calls (this is an easy way to accidentally introduce sub-interpreter bugs). In the code example above, note that we did not save the result of :func:`module_::import`, in order to avoid accidentally using the resulting Python object when the wrong interpreter was active. - Avoid moving or disarming RAII objects managing GIL and sub-interpreter lifetimes. Doing so can lead to confusion about lifetimes. (For example, accidentally extending a :class:`subinterpreter_scoped_activate` past the lifetime of it's :class:`subinterpreter`.) - While sub-interpreters each have their own GIL, there can now be multiple independent GILs in one program so you need to consider the possibility of deadlocks caused by multiple GILs and/or the interactions of the GIL(s) and your C++ code's own locking. - When using multiple threads to run independent sub-interpreters, the independent GILs allow concurrent calls from different interpreters into the same C++ code from different threads. So you must still consider the thread safety of your C++ code. Remember, in Python 3.12 sub-interpreters must be destroyed on the same thread that they were created on. - Familiarize yourself with :ref:`misc_concurrency`.