When execute command `seed:run`, insert all data to a table in a bulk. #1148

kenzo0107 · 2017-08-15T10:34:27Z

When execute command seed:run, insert all data to a table in a bulk.
It take much time to insert data to a table one by one until now.

An example is shown below:

$table = $this->table('users');
$table->bulk()
      ->insert($data)
      ->save();

It became more than triple as fast by Bulk Insert when inserting 1,000 records.
It is very effective when many data samples are required.

However, it is assumed that the keys of data to be set are unified.
The method bulk() are not available in the following example:

$data = array(
    array(
        'column1' => 'value1',
        'column2' => 1,
    ),
    array(
        'column1' => 'value2',
        'column3' => 'foo',
    ),
    array(
        'column1' => 'value3',
        'column2' => 3,
        'column3' => 'foo',
    )
);

$table = $this->table('users');
$table->bulk()
  ->insert($data)
  ->save();

The function of dynamically checking the key of the set data is not added,
because it will cause performance degradation.

It take much time to insert data to a table one by one until now. An example is shown below: ``` $table = $this->table('users'); $table->bulk() ->insert($data) ->save(); ``` It became more than triple as fast by Bulk Insert when inserting 1,000 records. It is very effective when many data samples are required.

lorenzo

Is there a way you can implement this without adding the bulk flag? I'm not sure it makes for a good API

lorenzo · 2017-08-16T12:00:54Z

src/Phinx/Db/Table.php

@@ -75,6 +75,11 @@ class Table
    protected $data = array();

    /**
+     * @var boolean
+     */
+    protected $bulk = false;


I don't understand what this flag is for

This flag is prepared to judge whether to use the bulk insert function.

Why not always use it?

I see.
There is a way I can implement this without adding the bulk flag.
Thx.

chinpei215 · 2017-08-16T17:50:24Z

src/Phinx/Db/Adapter/PdoAdapter.php

+        $keys = array_keys($current);
+        $sql .= "(". implode(', ', array_map(array($this, 'quoteColumnName'), $keys)) . ") VALUES";
+
+        $objTmp = (object) array('aFlat' => array());


aFlat

Typo?

Oh, typo !
fix it !

chinpei215 · 2017-08-16T17:51:49Z

src/Phinx/Db/Adapter/PdoAdapter.php

+        $sql .= "(". implode(', ', array_map(array($this, 'quoteColumnName'), $keys)) . ") VALUES";
+
+        $objTmp = (object) array('aFlat' => array());
+        array_walk_recursive($rows, create_function('&$v, $k, &$t', '$t->flat[] = $v;'), $objTmp);


Why don't you use a Closure? Phinx seems to require PHP >= 5.4. And if you use a closure, $objTmp is unnecessary, as it can inherit variables form the parent scope by reference. And $rows would be a two-dimensional array. You need not handle it recursively. If I were you, I would use foreach in this case:

$vals = []; foreach ($rows as $row) { foreach ($row as $v) { $vals[] = $v; } }

Also, I think you should make sure that all rows have the same keys. I think it doesn't make a big impact on performance, but dose prevent users' mistakes.

Thank you for your review !

I tried to simply coded in one line.
But there was not enough consideration to Phinx.

Fix it !

Thx.

fallaten values in data simply. And typo, sorry.

set prefix adapter to a table when bulk insert.

By checking the keys in data to be inserted, and when all the keys are matched, execute bulk insert.

… into feature/bulkinsert

kenzo0107 · 2017-08-17T06:45:43Z

@lorenzo

I could implement Bulk Insert without adding the bulk flag 🤓
Please review.

lorenzo · 2017-08-21T07:45:16Z

I think the only thing this is missing is tests showing that the method actually works

kenzo0107 · 2017-08-21T08:04:41Z

@lorenzo

Thank you for review !

I added to test the new method with the following method.

tests/Phinx/Db/Adapter/TablePrefixAdapterTest::testInsertData
tests/Phinx/Db/TableTest::testInsertSaveData
tests/Phinx/Migration/AbstractMigrationTest::testInsert

Is it necessary elsewhere ?

lorenzo · 2017-08-30T19:44:53Z

@kenzo0107 I didn't see any tests where the method was actually used. Only mocked tests were used, so the actual implementation is not tested

add test of method bulkinsert to each adaptor.

kenzo0107 · 2017-08-31T06:30:59Z

@lorenzo Thank you for your review. I added some tests where the method is actually used.
Please review again.

lorenzo · 2017-08-31T08:27:57Z

Thanks @kenzo0107 !

lorenzo · 2017-08-31T08:28:27Z

@kenzo0107 Would you mind sending another pull request with a short example for the docs?

kenzo0107 · 2017-08-31T08:38:24Z

@lorenzo Thank you 😄 I will send another pull request soon.

lorenzo requested changes Aug 16, 2017

View reviewed changes

kenzo0107 force-pushed the feature/bulkinsert branch from 2c27f6a to 7368363 Compare August 16, 2017 16:41

chinpei215 reviewed Aug 16, 2017

View reviewed changes

mpg-kenzo-tanaka added 5 commits August 17, 2017 12:29

[Update] flatten values in data by using Closure

7741a86

fallaten values in data simply. And typo, sorry.

[Update] set prefix adapter when bulk insert

2c570f2

set prefix adapter to a table when bulk insert.

[Update] determine automatically bulk insert or normal insert

c67436c

By checking the keys in data to be inserted, and when all the keys are matched, execute bulk insert.

[Update] determine automatically bulk insert or normal insert

1e6e8df

By checking the keys in data to be inserted, and when all the keys are matched, execute bulk insert.

Merge branch 'feature/bulkinsert' of https://github.com/kenzo0107/phinx…

d3761af

… into feature/bulkinsert

[Update] Add tests of method bulkinsert.

468e0f9

add test of method bulkinsert to each adaptor.

lorenzo merged commit 17ec982 into cakephp:master Aug 31, 2017

kenzo0107 mentioned this pull request Sep 26, 2017

I want to add docs about bulk insert with a short example. #1194

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When execute command `seed:run`, insert all data to a table in a bulk. #1148

When execute command `seed:run`, insert all data to a table in a bulk. #1148

kenzo0107 commented Aug 15, 2017

lorenzo left a comment

lorenzo Aug 16, 2017

kenzo0107 Aug 16, 2017

lorenzo Aug 16, 2017

kenzo0107 Aug 17, 2017

chinpei215 Aug 16, 2017

kenzo0107 Aug 17, 2017

chinpei215 Aug 16, 2017

kenzo0107 Aug 17, 2017

kenzo0107 commented Aug 17, 2017 •

edited

lorenzo commented Aug 21, 2017

kenzo0107 commented Aug 21, 2017 •

edited

lorenzo commented Aug 30, 2017

kenzo0107 commented Aug 31, 2017

lorenzo commented Aug 31, 2017

lorenzo commented Aug 31, 2017

kenzo0107 commented Aug 31, 2017

When execute command seed:run, insert all data to a table in a bulk. #1148

When execute command seed:run, insert all data to a table in a bulk. #1148

Conversation

kenzo0107 commented Aug 15, 2017

lorenzo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kenzo0107 commented Aug 17, 2017 • edited

lorenzo commented Aug 21, 2017

kenzo0107 commented Aug 21, 2017 • edited

lorenzo commented Aug 30, 2017

kenzo0107 commented Aug 31, 2017

lorenzo commented Aug 31, 2017

lorenzo commented Aug 31, 2017

kenzo0107 commented Aug 31, 2017

When execute command `seed:run`, insert all data to a table in a bulk. #1148

When execute command `seed:run`, insert all data to a table in a bulk. #1148

kenzo0107 commented Aug 17, 2017 •

edited

kenzo0107 commented Aug 21, 2017 •

edited