[Question] Customize the number of input/output variables in generated graphs #117

ntcmp2u · 2023-06-06T15:11:42Z

Hi. I am currently learning to use nnsmith. I would like to know if I can customize the generated graph as a graph with only a single input variable and a single output variable.

Is there any way to quickly implement this?

ganler · 2023-06-06T16:55:59Z

Do you want to only generate a chain of operators? Or you hope at model-wise you only need to create one input and compare one output?

Both of these need to be implemented. Just let me know :)

lazycal · 2023-06-06T17:06:06Z

Hi @ntcmp2u , thanks for your interest in nnsmith. I assume you mean "model-wise" single input output.

For single input, one quick way you could try is to set the forward_prob to 1 here:

nnsmith/nnsmith/graph_gen.py

Line 53 in 1031c85

self.forward_prob = 0.5 if forward_prob is None else forward_prob

, though this might hurt some diversity in the graph structures (since no backward_gen any more).
For single output there might be no easy way and need some proper reimplementation. Probably we need to change this function

nnsmith/nnsmith/graph_gen.py

Line 337 in 1031c85

def pick_var_group(

, which is used to pick after what operator should we insert the next one.

ntcmp2u · 2023-06-06T17:31:51Z

@ganler @lazycal Thank you so much for quick response. Yes, what I mean is the "model-wise" single input output. Although this may impact the diversity of generated graphs, I believe that it could be valuable to simulate a real-world senario -- most of time our trained model is a single input model. I will investigate the pick_var_group function to implement it.

Thank you again for your assist.

ganler · 2023-06-06T20:34:39Z

There are two ways to make a model produce only one output. The first way is what I said to make only one leaf node in the graph. Another simple way is to "cheat" by only marking one value in the graph as output (others could be cut by DCE).

Because you are talking about real-world scenarios, I bet you are talking about the first one. For that pick_var_groups is not the right place to work on and it is completely unnecessary to hack the codebase (which is hard too), as I made it essentially for grouping connections to rank-dtype compatible variables/placeholders while your desired constraint is on model topology. The easiest way IMO to achieve the topological constraint is to (i) generate a large model; and (ii) picking an intermediate value, marking it alive and then do a use-def analysis + only preserve its usee chain (this is similar to clipping a subgraph from a large one such that the subgraph is single i/o). This is very easy in NNSmith as the GraphIR is an SSA and has built-in use-def analysis support.

That being said, let me know if you want me to quickly implement that for you and then you shall be able to go for it with a commandline flag. Of course you are always welcome to try it on your own and even upstream the patch.

NB: for real-world-like models or whatever structure in user intents, we are building a DSL for describing any desired model patterns. It is not going to ship very recently but yeah stay tuned :)

ntcmp2u · 2023-06-08T12:16:26Z

let me know if you want me to quickly implement that for you and then you shall be able to go for it with a commandline flag. Of course you are always welcome to try it on your own and even upstream the patch.

@ganler I tried to implement it for a few hours but failed to do it in a right way. Perhaps I still need time to get familiar with NNSmith's implementation. If possible, could you implement that and export a command line flag? Thank you so much for assisting.

ganler · 2023-06-08T17:04:18Z

@ntcmp2u No worries at all. I will find a time this weekend. -- And don't feel frustrated as it is hard to extend on big and weakly documented codebase in the beginning (enhancing the doc is a longer-term plan... since it is currently maintained/developed by a very small team). Meanwhile feel free to post any questions regarding the implementation if you are interested. Thanks.

ntcmp2u · 2023-07-16T08:02:02Z

@ganler Hi, sorry for bothering you. Just want to know if the one-leaf generation is implemented. If you can't find a time, perhaps I can try to understand the use-def analysis and implement one.

ganler · 2023-07-18T08:38:05Z

@ntcmp2u Sorry for the delay (I totally forgot that... I need to update my TODO list more timely lol).

Once #119 is merged you can try:

python nnsmith/cli/model_gen.py model.type=torch backend.type="torchjit" debug.viz=1 mgen.method="single-io-cinit" mgen.max_nodes=10

It shall get you something like:

ntcmp2u · 2023-07-21T12:57:32Z

@ganler Thank you so much for the assist!

ntcmp2u closed this as completed Jun 6, 2023

ganler mentioned this issue Jul 18, 2023

feat: impl single input-output gen via backward-cut #119

Merged

ganler mentioned this issue Apr 10, 2024

[Help Wanted] How to only generate sequential models #138

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Customize the number of input/output variables in generated graphs #117

[Question] Customize the number of input/output variables in generated graphs #117

ntcmp2u commented Jun 6, 2023

ganler commented Jun 6, 2023

lazycal commented Jun 6, 2023

ntcmp2u commented Jun 6, 2023

ganler commented Jun 6, 2023

ntcmp2u commented Jun 8, 2023 •

edited

Loading

ganler commented Jun 8, 2023

ntcmp2u commented Jul 16, 2023

ganler commented Jul 18, 2023 •

edited

Loading

ntcmp2u commented Jul 21, 2023

[Question] Customize the number of input/output variables in generated graphs #117

[Question] Customize the number of input/output variables in generated graphs #117

Comments

ntcmp2u commented Jun 6, 2023

ganler commented Jun 6, 2023

lazycal commented Jun 6, 2023

ntcmp2u commented Jun 6, 2023

ganler commented Jun 6, 2023

ntcmp2u commented Jun 8, 2023 • edited Loading

ganler commented Jun 8, 2023

ntcmp2u commented Jul 16, 2023

ganler commented Jul 18, 2023 • edited Loading

ntcmp2u commented Jul 21, 2023

ntcmp2u commented Jun 8, 2023 •

edited

Loading

ganler commented Jul 18, 2023 •

edited

Loading