How to describe and use Network #1315

wangkuiyi · 2017-02-11T05:20:36Z

We'd thought that a DL framework should implement concepts like model and cost. But we realized that these are not flexible enough to describe deep learning problems. Instead, we need the concept network. For more about this derivation, please refer to #1311.

In this issue, we are going to figure out how should we build a network and its parameters, and how can we train a network and use part of it (the model) for inference/serving.

wangkuiyi · 2017-02-11T05:32:53Z

Here summarizes an idea from @helinwang and @emailweixu that changes the concepts listed in #1297 into the following:

No concept of Model; instead, we introduce Network. The reason is listed in here.
A Network consists of topology and parameters. But Network is not the essence; instead; topology and parameters are.
Layers in the same network might share parameters, an example is in here, and
Layers of different networks might share parameters too, as the GAN example will be presented later.

For how to describe networks and how to use it for convenient training, testing, and inference/serving, please see following comments.

wangkuiyi · 2017-02-12T23:13:13Z

Example 1. Sharing Parameters between Layers

We use the 3-branch ranking model in this example. For your convenience, I copy-a-paste the model's topology as follows:

A -> f -\
Q -> f --> cost
B -> f -/

The following program trains the topology including the cost, and then use the sub-network in the trained topology in inference:

def f(in):
    e = paddle.layer.embedding(in, parameter_name="embedding")
    o = paddle.layer.softmax(e, parameter_name="semantic")
    return o

# Create 3 topologies (subnets), they share parameters because all
# correspoinding layers have the same parameter names.
fA = f(paddle.layer.data(input_name="A"))
fB = f(paddle.layer.data(input_name="B"))
fQ = f(paddle.layer.data(input_name="Q"))

topology = paddle.layer.less_than(
               paddle.layer.cross_entropy(fA, fQ),
               paddle.layer.corss_entropy(fB, fQ))

# Derive parameters required in topology and create them in model.
parameters = paddle.parameters.create(topology)

# Estimate parameters used in topology from data.
paddle.train(topology, parameters, reader=read_ranking_model_data)

# Inference using fA (or fB or fC, as they share their parameters).
[testA, testB, testQ] = read_ranking_model_data()
print "The sematic-vector of testA: ", paddle.infer(fA, parameters, testA)

wangkuiyi · 2017-02-13T00:12:59Z

Exmaple 2. Sharing Parameters between "Models"

We use GAN in
this example. In the following example program, d0 and d1 correspond to the two networks in the following figure:

def G(in):
    # over-simplified example as G has only one layers:
    return paddle.layer.fc(in, parameter_name="G") 

def D(in, parameters_mutable);
    # again, over-simplified:
    return paddle.layer.fc(in, parameters_name="D", parameters_mutable)

# Construct the first topology, which contains both D and G.
# By learning this topology, we update parameters of G.
d0 = paddle.layer.should_be_false(
         D(G(paddle.layer.data()),
           False)) # Don't update the parameter of D here.

# Construct a second topology d1, which contains only D. By
# training this topology, we update parameters of D.  Note 
# that d1 share parameters with d0.
d1 = paddle.layer.should_be_true(D(paddle.layer.data()))

# Create parameters from a list of multiple topologies (models) for
# the chance to share parameters between these topologies.
parameters = paddle.parameters.create([d0, d1])

# Iterative training of GAN.
for ...:
    train(d0, parameters, reader=read_from_rng)
    train(d1, parameters, reader=read_from_realistic_images)

# Use d1 for inference:
print "D thinks a batch of images are realistic ", infer(d1, parameters, read_mnist_images)

reyoung · 2017-02-13T02:30:58Z

Maybe a parameter pool(parameters in above code) and network topologies is a good abstraction?

To be trained Neural Network = a parameter pool + train network topology.
Inference Neural Network = the same parameter pool + inference network topology.

Is the Model or NeuralNetwork an important concept?

helinwang · 2017-02-13T02:49:28Z

Maybe instead of specifying which parameter not to update here:

d0 = paddle.layer.should_be_false(
         D(G(paddle.layer.data()),
           False)) # Don't update the parameter of D here.

We can specify in train, which parameter to update, or by default update all.

reyoung · 2017-02-13T03:11:58Z

train函数里面，添加 event_handler的callback

附之前讨论的代码:

def train_reader():
    yield {'pixel': pixels, 'label': labels}  # return a data batch.

# Observe callback is used for plotting or logging the training process.
# The type of event parameter could be various. The intermediate result for 
# training is in event instance.
def callback(event):
     if isinstance(event, FinishTrainOneBatch):
        print event.pass_id, event.batch_id, "Cost = ", event.cost, "Error Rate = ", event.metric[0]
        print "output layer's output is ", event.activation['output']
        if event.batch_id % 1000 == 0:  # Even, we could save check point during callback.
            with open('check_point_%d' % event.batch_id, 'w') as stream:
                 optimizer.check_point(stream)
     else:
        pass

optimizer.train(train_reader=train_reader,  test_reader=None,  # Test reader shared the same 
                                                               # format of train reader. Could be None if no test data.
                cost=CrossEntropy(input=model.topology.output_layer,  # the network's output layer.
                                  label=DataReader("label")),  # Label is get from data_reader's 'label' field.
                metric=[ErrorRateMetric(input=model.topology.output_layer, label=DataReader("label"))], # same logic above
                observe_callback=callback
)

helinwang · 2017-02-13T03:12:34Z

Added issue for separating updater and trainer: #1319

jacquesqiao · 2017-02-13T03:14:09Z

if we need to put things about cost in a special namespace, like

paddle.layer.cost.cross_entropy
paddle.layer.cost.less_than

wangkuiyi · 2017-02-14T00:01:35Z

@reyoung For your comment, It seems that given the event_handler mechanism, we don't need to pass matrics to function train; instead, we can calculate those metrics in the event_handler and plot them in necessary?

Update API design doc according to discussions in issue #1315

…addlePaddle#1315) * Add yacs and numpy to requirements; Update LayoutXLM README.md * try_import yacs

wangkuiyi self-assigned this Feb 11, 2017

This was referenced Feb 11, 2017

Is Cost actually a Layer? #1311

Closed

Trainer, Model, and Inferencer #1312

Closed

wangkuiyi changed the title ~~How to express and use Network~~ How to describe and use Network Feb 11, 2017

wangkuiyi mentioned this issue Feb 13, 2017

New API Design Doc #1297

Merged

wangkuiyi mentioned this issue Feb 13, 2017

Should we expose paddle.parameters.create? #1321

Closed

wangkuiyi added a commit to wangkuiyi/Paddle that referenced this issue Feb 13, 2017

Update according to discussions in PaddlePaddle#1315

5a1f061

wangkuiyi mentioned this issue Feb 13, 2017

Update API design doc according to discussions in issue #1315 #1322

Merged

wangkuiyi closed this as completed in #1322 Feb 14, 2017

wangkuiyi added a commit that referenced this issue Feb 14, 2017

Merge pull request #1322 from wangkuiyi/design_doc_new_api

6ab6c35

Update API design doc according to discussions in issue #1315

wangxicoding pushed a commit to wangxicoding/Paddle that referenced this issue Dec 9, 2021

Update package requirements and example README.md of LayoutXLM model (P…

2a08c14

…addlePaddle#1315) * Add yacs and numpy to requirements; Update LayoutXLM README.md * try_import yacs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to describe and use Network #1315

How to describe and use Network #1315

wangkuiyi commented Feb 11, 2017

wangkuiyi commented Feb 11, 2017 •

edited

Loading

wangkuiyi commented Feb 12, 2017 •

edited

Loading

wangkuiyi commented Feb 13, 2017 •

edited

Loading

reyoung commented Feb 13, 2017 •

edited

Loading

helinwang commented Feb 13, 2017 •

edited

Loading

reyoung commented Feb 13, 2017

helinwang commented Feb 13, 2017

jacquesqiao commented Feb 13, 2017

wangkuiyi commented Feb 14, 2017

How to describe and use Network #1315

How to describe and use Network #1315

Comments

wangkuiyi commented Feb 11, 2017

wangkuiyi commented Feb 11, 2017 • edited Loading

wangkuiyi commented Feb 12, 2017 • edited Loading

Example 1. Sharing Parameters between Layers

wangkuiyi commented Feb 13, 2017 • edited Loading

Exmaple 2. Sharing Parameters between "Models"

reyoung commented Feb 13, 2017 • edited Loading

helinwang commented Feb 13, 2017 • edited Loading

reyoung commented Feb 13, 2017

helinwang commented Feb 13, 2017

jacquesqiao commented Feb 13, 2017

wangkuiyi commented Feb 14, 2017

wangkuiyi commented Feb 11, 2017 •

edited

Loading

wangkuiyi commented Feb 12, 2017 •

edited

Loading

wangkuiyi commented Feb 13, 2017 •

edited

Loading

reyoung commented Feb 13, 2017 •

edited

Loading

helinwang commented Feb 13, 2017 •

edited

Loading