dump BDDs from CUDD to JSON #21

johnyf · 2017-01-29T04:51:49Z

Dump BDDs from dd.cudd.BDD to a JSON file, and load them too. A two-stage approach seems most promising:

dump to a shelf file using shelve
iteratively load the shelf file, and dump it iteratively to a JSON file.

The first step uses the shelf as a cache for the depth-first traversal (typically implemented recursively) of the BDD to find which nodes to dump. We cannot know which nodes to dump without first finding those reachable from the roots we want to dump. Naively, we could traverse the BDD and so dump all reachable nodes. Due to a BDDs structure, this can result in an exponential amount of duplicate work.

This is why visited nodes are memoized, which corresponds to maintaining a set of "visited" nodes in a graph search. In other algorithmic applications, one would simply mark the visited nodes as visited. In CUDD, there isn't space for "marking" the nodes. We could instead add the visited nodes to a separate "set" in main memory. In demanding use cases, CUDD fills most of the main memory, so this isn't possible, because it would essentially duplicate within main memory the BDD we want to save.

I think that DDDMP approaches this problem by removing and adding nodes to the unique table. I consider this undesirable, because it affects the existing cache (hashing information, repeatability, etc.). A dumping operation is extraneous to the BDD manager, so it shouldn't have side effects on the manager.

Main memory cannot serve for storing "visited" information during traversal without interfering with CUDD, but there's the disk. Nowadays, the disk is vastly larger than main memory. So, why not use the disk to store the "visited" status of each node? (For example, the enumerative model checker TLC uses the disk to store the state space.)

Even better, since all we want to do is dump to the disk, why not use the target file itself as the store of "visited" information? The only challenge is a dict-like interface for quickly checking containment of nodes in the file. The shelve module from Python's standard library provides exactly this interface.

The second step is just a conversion of the entire shelf file to a JSON file. In other words, the first step identifies and isolates the information that we want to store, and the second step puts this information in the target file format.

ijson seems suitable for step 2. Compared to json-streamer, ijson is preferred:

due to simpler API
several backends, including a pure Python one
both are based on the C library yajl
apparently wider adoption, which usually reflects experience
more years in development.

Previous comments that are relevant:

The text was updated successfully, but these errors were encountered:

johnyf · 2017-01-30T11:02:08Z

A draft is in 9948d3d.

johnyf · 2018-04-16T20:54:26Z

Addressed in 61b60bf, specifically by the functions dump_json and load_json. The shelve file is used as the cache during the traversal that dumps nodes from the BDD manager to the JSON file.

johnyf added the enhancement A new feature, an improvement, or other addition. label Jan 29, 2017

johnyf self-assigned this Jan 29, 2017

johnyf added this to the 0.5.4 milestone Jan 6, 2018

johnyf closed this as completed Apr 16, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dump BDDs from CUDD to JSON #21

dump BDDs from CUDD to JSON #21

johnyf commented Jan 29, 2017 •

edited

johnyf commented Jan 30, 2017

johnyf commented Apr 16, 2018

dump BDDs from CUDD to JSON #21

dump BDDs from CUDD to JSON #21

Comments

johnyf commented Jan 29, 2017 • edited

johnyf commented Jan 30, 2017

johnyf commented Apr 16, 2018

johnyf commented Jan 29, 2017 •

edited