RFE: Move transaction processing to a subprocess #1526

DemiMarie · 2021-02-03T00:46:59Z

As described in #1483, performing an RPM transaction from a multithreaded process will very likely result in Undefined Behavior. Furthermore, if RPM performs any database operations with an altered root directory, this will also result in Undefined Behavior, as SQLite will use an incorrect WAL.

This can be fixed by moving all transaction processing to a subprocess. Due to POSIX restrictions on fork() in a multi-threaded process, this subprocess would need to be a separate binary, and would use stdio to communicate with the parent.

The text was updated successfully, but these errors were encountered:

Conan-Kudo · 2021-02-03T03:15:15Z

Subprocessing a transaction would make this much more brittle, since it would expose RPM to weaknesses in POSIX itself wrt system software data replacement. Think for example if rpm is upgrading rpm: having a separate binary means that we need to design a complex method to handle that the rpm binaries are being replaced, rather than relying on the in-memory program data footprint that comes from DNF using librpm and holding it in memory while it works through everything.

RPM already has locking semantics to implement a "write once, read many" setup, so I'm not sure we actually need to do much more than beef this up with the SQLite database backend.

DemiMarie · 2021-02-03T04:35:27Z

If a single subprocess is used for the entire transaction, then I imagine those problems would go away.

Conan-Kudo · 2021-02-04T11:45:12Z

That gets us back to square one, though. Making this MT safe is effectively pointless since we're still constrained to one process no matter what.

DemiMarie · 2021-02-04T18:09:50Z

Not really. The difference is that the subprocess would be created and managed by librpm itself. That means that librpm itself is thread-safe, which is a hard requirement for embedding librpm in certain scenarios.

Conan-Kudo · 2021-02-04T18:11:25Z

What scenario do you want to embed librpm that requires this that would also do transactions? Because pretty much all MT-safe operations would generally not require doing transactions...

DemiMarie · 2021-02-04T18:13:43Z

I (and rpm-ostree) want to be able to run a transaction from a multi-threaded parent process. This is only possible if the actual transaction is done in a child process managed by librpm.

DemiMarie · 2021-02-04T18:19:07Z

Also, Rust, Java, .NET, glib, and several other languages, runtimes, and frameworks require that all code must be thread-safe, full stop. A Java or .NET VM will always have multiple threads running, and a GTK or QT application must assume that it will. Using the RPM transaction APIs from such a process is currently Undefined Behavior. Rust programs are not all multi-threaded, but Rust libraries are required to work in multithreaded programs, which means that the librpm.rs bindings are probably unsound.

Conan-Kudo · 2021-02-04T18:21:15Z

Using the RPM transaction APIs from such a process is currently Undefined Behavior.

All such environments you listed also provide a way to constrain threading behavior when you need to, because it's unrealistic to actually mandate that at the layers below it. Even Python, Perl, and Ruby have this. I know Java definitely does.

The phrase "undefined behavior" (in title case or no) isn't enough in itself to justify breaking the librpm architecture.

DemiMarie · 2021-02-04T18:25:41Z

Using the RPM transaction APIs from such a process is currently Undefined Behavior.

All such environments you listed also provide a way to constrain threading behavior when you need to, because it's unrealistic to actually mandate that at the layers below it. Even Python, Perl, and Ruby have this. I know Java definitely does.

The phrase "undefined behavior" (in title case or no) isn't enough in itself to justify breaking the librpm architecture.

Java, at least, does not support programs that call chdir(), much less chroot(). #1483 (comment) is an example of this being a problem in the real world.

pmatilai · 2021-02-12T12:17:51Z

This comes up every now and then. Running the transaction in a sub-process would of course be the sane thing to do, but within the existing rpm architecture it's quite impossible to do in rpm itself. We have no plans to work on this.

DemiMarie · 2021-02-12T20:52:11Z

@pmatilai what about RPMv6? Asking because RPMv6 can break backwards compat.

Right now anyone wanting to run RPM transactions from a multithreaded process needs to do this themselves.

DemiMarie · 2021-02-19T18:27:25Z

This comes up every now and then. Running the transaction in a sub-process would of course be the sane thing to do, but within the existing rpm architecture it's quite impossible to do in rpm itself. We have no plans to work on this.

Would you mind explaining? I am a bit confused what you mean by “rpm architecture” here. Could the RPM CLI be expanded to do everything that can be done via the API?

pmatilai closed this as completed Feb 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFE: Move transaction processing to a subprocess #1526

RFE: Move transaction processing to a subprocess #1526

DemiMarie commented Feb 3, 2021 •

edited

Conan-Kudo commented Feb 3, 2021

DemiMarie commented Feb 3, 2021

Conan-Kudo commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

Conan-Kudo commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

Conan-Kudo commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

pmatilai commented Feb 12, 2021

DemiMarie commented Feb 12, 2021

DemiMarie commented Feb 19, 2021

RFE: Move transaction processing to a subprocess #1526

RFE: Move transaction processing to a subprocess #1526

Comments

DemiMarie commented Feb 3, 2021 • edited

Conan-Kudo commented Feb 3, 2021

DemiMarie commented Feb 3, 2021

Conan-Kudo commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

Conan-Kudo commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

Conan-Kudo commented Feb 4, 2021

DemiMarie commented Feb 4, 2021

pmatilai commented Feb 12, 2021

DemiMarie commented Feb 12, 2021

DemiMarie commented Feb 19, 2021

DemiMarie commented Feb 3, 2021 •

edited