Loky backend doesn't cleanup worker processes #945

rbedi · 2019-10-05T02:48:20Z

Python 3.7.2, macOS 10.13.3 and Ubuntu 18.04

I notice when using the Loky backend, joblib doesn't clean up after itself even when explicitly calling _terminate_backend(). Here's a minimal example:

from joblib import Parallel, delayed
from multiprocessing import active_children

def f(x): return x**2
par_loky = Parallel(n_jobs=32, backend="loky")
par_loky(delayed(f)(i) for i in range(10))

print(len(active_children())) # Prints 32
par_loky._terminate_backend()
print(len(active_children()))  # Prints 32

Same effect if I use the context manager to construct the pool:

from joblib import Parallel, delayed
from multiprocessing import active_children

def f(x): return x**2

with Parallel(n_jobs=32, backend="loky") as par_loky:
    par_loky([delayed(f)(i) for i in range(10)])

print(len(active_children())) # Prints 32

However, with the multiprocessing backend, it works as expected:

from joblib import Parallel, delayed
from multiprocessing import active_children

def f(x): return x**2
par_mp = Parallel(n_jobs=32, backend="multiprocessing")
par_mp(delayed(f)(i) for i in range(10))
print(len(active_children())) # prints 0, as expected

The text was updated successfully, but these errors were encountered:

tomMoral · 2019-10-09T21:22:51Z

Hi,

thanks for reporting. This is actually the expected behavior for loky. The loky backend rely on spawn to start the new processes for the pool of worker. As this method may take up to few tenth of seconds, this can be quite costly to start a new pool each time you need to call Parallel. To that end, loky manage a reusable pool of workers that is reused for multiple call to Parallel, hence not terminated even once the Parallel object is cleaned up with _terminate_backend.

If they are not reused, the processes will be cleaned up if they time out (default is 300s and apparently there is no way to modify that yet). If you need to force the clean-up of such process, you could call:

from joblib.externals.loky import get_reusable_executor
get_reusable_executor().shutdown(wait=True)

Let me know if this solves your problem. As for the API, would it be better if you controlled the timemout delay for the worker or would you need to directly clean up the processes with imperative instruction?

ogrisel · 2019-10-17T12:08:05Z

Maybe we should improve the joblib documentation.

dishkakrauch · 2021-10-05T14:29:48Z

Hi,

thanks for reporting. This is actually the expected behavior for loky. The loky backend rely on spawn to start the new processes for the pool of worker. As this method may take up to few tenth of seconds, this can be quite costly to start a new pool each time you need to call Parallel. To that end, loky manage a reusable pool of workers that is reused for multiple call to Parallel, hence not terminated even once the Parallel object is cleaned up with _terminate_backend.

If they are not reused, the processes will be cleaned up if they time out (default is 300s and apparently there is no way to modify that yet). If you need to force the clean-up of such process, you could call:
from joblib.externals.loky import get_reusable_executor
get_reusable_executor().shutdown(wait=True)
Let me know if this solves your problem. As for the API, would it be better if you controlled the timemout delay for the worker or would you need to directly clean up the processes with imperative instruction?

Thank you this note cause it helped me to solve issue with using joblib as a part of apache airflow python task where I had many daemonic processes after DAG execution.

secsilm · 2022-11-04T05:48:30Z

Hi,

thanks for reporting. This is actually the expected behavior for loky. The loky backend rely on spawn to start the new processes for the pool of worker. As this method may take up to few tenth of seconds, this can be quite costly to start a new pool each time you need to call Parallel. To that end, loky manage a reusable pool of workers that is reused for multiple call to Parallel, hence not terminated even once the Parallel object is cleaned up with _terminate_backend.

If they are not reused, the processes will be cleaned up if they time out (default is 300s and apparently there is no way to modify that yet). If you need to force the clean-up of such process, you could call:
from joblib.externals.loky import get_reusable_executor
get_reusable_executor().shutdown(wait=True)
Let me know if this solves your problem. As for the API, would it be better if you controlled the timemout delay for the worker or would you need to directly clean up the processes with imperative instruction?

What if I use multiprocessing as backend? Can I still use from joblib.externals.loky import get_reusable_executor?

durgeksh · 2023-11-09T19:36:22Z

I want to read all the sheets in an xlsx file using joblib in parallel mode. Polars is the library for processing the xlsx file.
When I use the backend as loky, it needs to recreate same object else it gives pickle error. But, if I use threading it works but there is no parallel execution at all. Below is my code.

def read_custom_csv(source="test.xlsx", sheet_list, xlsx2csv_options):
    get_reusable_executor().shutdown(wait=True)
    parser = xlsx2csv.Xlsx2csv(source, **xlsx2csv_options)
    read_csv_options = {"infer_schema_length": 0, "truncate_ragged_lines": True}
    read_csv_options_wo_header = {
        "infer_schema_length": 0,
        "truncate_ragged_lines": True,
        "skip_rows": 1,
        "has_header": False,
    }
    excluded_sheets = ["A", "B", "C", "D"]

    core_count = cpu_count()
    n_jobs = os.environ.get("THREAD_COUNT", core_count // 2)
    print(f"Using {n_jobs} number of processes for loading sheets in parallel.")
    args = list()
    for sheet in sheet_list:
        args.append(
            {
                "parser": parser,
                "sheet_name": sheet,
                "read_csv_options": read_csv_options_wo_header if sheet in excluded_sheets else read_csv_options,
            }
        )

    with parallel_config(backend="loky", n_jobs=n_jobs):
        results = Parallel(return_as="generator")(delayed(_read_excel_sheet)(**a) for a in args)

    return {sheet[0]: sheet[1] for sheet in results}


def _read_excel_sheet(parser, sheet_name, read_csv_options) -> pl.DataFrame:
    csv_buffer = StringIO()
    parser.convert(outfile=csv_buffer, sheetname=sheet_name)

    if csv_buffer.tell() != 0:
        csv_buffer.seek(0)
        return sheet_name, pl.read_csv(csv_buffer, **read_csv_options)

ValHayot mentioned this issue Apr 12, 2021

Joblib artifacts remain after segmentation yeatmanlab/pyAFQ#674

Closed

Barry1 mentioned this issue Oct 3, 2022

loky backend confuses timings joblib/loky#371

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loky backend doesn't cleanup worker processes #945

Loky backend doesn't cleanup worker processes #945

rbedi commented Oct 5, 2019 •

edited

tomMoral commented Oct 9, 2019

ogrisel commented Oct 17, 2019

dishkakrauch commented Oct 5, 2021

secsilm commented Nov 4, 2022

durgeksh commented Nov 9, 2023 •

edited

Loky backend doesn't cleanup worker processes #945

Loky backend doesn't cleanup worker processes #945

Comments

rbedi commented Oct 5, 2019 • edited

tomMoral commented Oct 9, 2019

ogrisel commented Oct 17, 2019

dishkakrauch commented Oct 5, 2021

secsilm commented Nov 4, 2022

durgeksh commented Nov 9, 2023 • edited

rbedi commented Oct 5, 2019 •

edited

durgeksh commented Nov 9, 2023 •

edited