You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/en/connector-v2/sink/FtpFile.md
+7Lines changed: 7 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -39,6 +39,7 @@ By default, we use 2PC commit to ensure `exactly-once`
39
39
| is_partition_field_write_in_file | boolean | no | false |
40
40
| sink_columns | array | no | When this parameter is empty, all fields are sink columns |
41
41
| is_enable_transaction | boolean | no | true |
42
+
| batch_size | int | no | 1000000 |
42
43
| common-options || no | - |
43
44
44
45
### host [string]
@@ -127,6 +128,10 @@ Please note that, If `is_enable_transaction` is `true`, we will auto add `${tran
127
128
128
129
Only support `true` now.
129
130
131
+
### batch_size [int]
132
+
133
+
The maximum number of rows in a file. For SeaTunnel Engine, the number of lines in the file is determined by `batch_size` and `checkpoint.interval` jointly decide. If the value of `checkpoint.interval` is large enough, sink writer will write rows in a file until the rows in the file larger than `batch_size`. If `checkpoint.interval` is small, the sink writer will create a new file when a new checkpoint trigger.
134
+
130
135
### common options
131
136
132
137
Sink plugin common parameters, please refer to [Sink Common Options](common-options.md) for details.
@@ -173,3 +178,5 @@ FtpFile {
173
178
- When field from upstream is null it will throw NullPointerException
174
179
- Sink columns mapping failed
175
180
- When restore writer from states getting transaction directly failed
181
+
182
+
-[Improve] Support setting batch size for every file ([3625](https://github.com/apache/incubator-seatunnel/pull/3625))
Copy file name to clipboardExpand all lines: docs/en/connector-v2/sink/HdfsFile.md
+8-1Lines changed: 8 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -38,6 +38,7 @@ In order to use this connector, You must ensure your spark/flink cluster already
38
38
| is_partition_field_write_in_file | boolean | no | false |
39
39
| sink_columns | array | no | When this parameter is empty, all fields are sink columns |
40
40
| is_enable_transaction | boolean | no | true |
41
+
| batch_size | int | no | 1000000 |
41
42
| common-options || no | - |
42
43
43
44
### fs.defaultFS [string]
@@ -115,6 +116,10 @@ Please note that, If `is_enable_transaction` is `true`, we will auto add `${tran
115
116
116
117
Only support `true` now.
117
118
119
+
### batch_size [int]
120
+
121
+
The maximum number of rows in a file. For SeaTunnel Engine, the number of lines in the file is determined by `batch_size` and `checkpoint.interval` jointly decide. If the value of `checkpoint.interval` is large enough, sink writer will write rows in a file until the rows in the file larger than `batch_size`. If `checkpoint.interval` is small, the sink writer will create a new file when a new checkpoint trigger.
122
+
118
123
### common options
119
124
120
125
Sink plugin common parameters, please refer to [Sink Common Options](common-options.md) for details
@@ -195,4 +200,6 @@ HdfsFile {
195
200
-[BugFix] Fixed the following bugs that failed to write data to files ([3258](https://github.com/apache/incubator-seatunnel/pull/3258))
196
201
- When field from upstream is null it will throw NullPointerException
197
202
- Sink columns mapping failed
198
-
- When restore writer from states getting transaction directly failed
203
+
- When restore writer from states getting transaction directly failed
204
+
205
+
-[Improve] Support setting batch size for every file ([3625](https://github.com/apache/incubator-seatunnel/pull/3625))
Copy file name to clipboardExpand all lines: docs/en/connector-v2/sink/LocalFile.md
+7Lines changed: 7 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -35,6 +35,7 @@ By default, we use 2PC commit to ensure `exactly-once`
35
35
| is_partition_field_write_in_file | boolean | no | false |
36
36
| sink_columns | array | no | When this parameter is empty, all fields are sink columns |
37
37
| is_enable_transaction | boolean | no | true |
38
+
| batch_size | int | no | 1000000 |
38
39
| common-options || no | - |
39
40
40
41
### path [string]
@@ -108,6 +109,10 @@ Please note that, If `is_enable_transaction` is `true`, we will auto add `${tran
108
109
109
110
Only support `true` now.
110
111
112
+
### batch_size [int]
113
+
114
+
The maximum number of rows in a file. For SeaTunnel Engine, the number of lines in the file is determined by `batch_size` and `checkpoint.interval` jointly decide. If the value of `checkpoint.interval` is large enough, sink writer will write rows in a file until the rows in the file larger than `batch_size`. If `checkpoint.interval` is small, the sink writer will create a new file when a new checkpoint trigger.
115
+
111
116
### common options
112
117
113
118
Sink plugin common parameters, please refer to [Sink Common Options](common-options.md) for details.
@@ -186,3 +191,5 @@ LocalFile {
186
191
- When field from upstream is null it will throw NullPointerException
187
192
- Sink columns mapping failed
188
193
- When restore writer from states getting transaction directly failed
194
+
195
+
-[Improve] Support setting batch size for every file ([3625](https://github.com/apache/incubator-seatunnel/pull/3625))
Copy file name to clipboardExpand all lines: docs/en/connector-v2/sink/OssFile.md
+8-1Lines changed: 8 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -42,6 +42,7 @@ By default, we use 2PC commit to ensure `exactly-once`
42
42
| is_partition_field_write_in_file | boolean | no | false |
43
43
| sink_columns | array | no | When this parameter is empty, all fields are sink columns |
44
44
| is_enable_transaction | boolean | no | true |
45
+
| batch_size | int | no | 1000000 |
45
46
| common-options || no | - |
46
47
47
48
### path [string]
@@ -131,6 +132,10 @@ Please note that, If `is_enable_transaction` is `true`, we will auto add `${tran
131
132
132
133
Only support `true` now.
133
134
135
+
### batch_size [int]
136
+
137
+
The maximum number of rows in a file. For SeaTunnel Engine, the number of lines in the file is determined by `batch_size` and `checkpoint.interval` jointly decide. If the value of `checkpoint.interval` is large enough, sink writer will write rows in a file until the rows in the file larger than `batch_size`. If `checkpoint.interval` is small, the sink writer will create a new file when a new checkpoint trigger.
138
+
134
139
### common options
135
140
136
141
Sink plugin common parameters, please refer to [Sink Common Options](common-options.md) for details.
@@ -224,4 +229,6 @@ For orc file format
224
229
-[BugFix] Fixed the following bugs that failed to write data to files ([3258](https://github.com/apache/incubator-seatunnel/pull/3258))
225
230
- When field from upstream is null it will throw NullPointerException
226
231
- Sink columns mapping failed
227
-
- When restore writer from states getting transaction directly failed
232
+
- When restore writer from states getting transaction directly failed
233
+
234
+
-[Improve] Support setting batch size for every file ([3625](https://github.com/apache/incubator-seatunnel/pull/3625))
Copy file name to clipboardExpand all lines: docs/en/connector-v2/sink/S3File.md
+7-1Lines changed: 7 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,6 +43,7 @@ By default, we use 2PC commit to ensure `exactly-once`
43
43
| is_partition_field_write_in_file | boolean | no | false |
44
44
| sink_columns | array | no | When this parameter is empty, all fields are sink columns |
45
45
| is_enable_transaction | boolean | no | true |
46
+
| batch_size | int | no | 1000000 |
46
47
| common-options || no | - |
47
48
48
49
### path [string]
@@ -137,6 +138,10 @@ Please note that, If `is_enable_transaction` is `true`, we will auto add `${tran
137
138
138
139
Only support `true` now.
139
140
141
+
### batch_size [int]
142
+
143
+
The maximum number of rows in a file. For SeaTunnel Engine, the number of lines in the file is determined by `batch_size` and `checkpoint.interval` jointly decide. If the value of `checkpoint.interval` is large enough, sink writer will write rows in a file until the rows in the file larger than `batch_size`. If `checkpoint.interval` is small, the sink writer will create a new file when a new checkpoint trigger.
144
+
140
145
### common options
141
146
142
147
Sink plugin common parameters, please refer to [Sink Common Options](common-options.md) for details.
@@ -229,4 +234,5 @@ For orc file format
229
234
-[Feature] Support S3A protocol ([3632](https://github.com/apache/incubator-seatunnel/pull/3632))
230
235
- Allow user to add additional hadoop-s3 parameters
231
236
- Allow the use of the s3a protocol
232
-
- Decouple hadoop-aws dependencies
237
+
- Decouple hadoop-aws dependencies
238
+
-[Improve] Support setting batch size for every file ([3625](https://github.com/apache/incubator-seatunnel/pull/3625))
Copy file name to clipboardExpand all lines: docs/en/connector-v2/sink/SftpFile.md
+8-1Lines changed: 8 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -39,6 +39,7 @@ By default, we use 2PC commit to ensure `exactly-once`
39
39
| is_partition_field_write_in_file | boolean | no | false |
40
40
| sink_columns | array | no | When this parameter is empty, all fields are sink columns |
41
41
| is_enable_transaction | boolean | no | true |
42
+
| batch_size | int | no | 1000000 |
42
43
| common-options || no | - |
43
44
44
45
### host [string]
@@ -127,6 +128,10 @@ Please note that, If `is_enable_transaction` is `true`, we will auto add `${tran
127
128
128
129
Only support `true` now.
129
130
131
+
### batch_size [int]
132
+
133
+
The maximum number of rows in a file. For SeaTunnel Engine, the number of lines in the file is determined by `batch_size` and `checkpoint.interval` jointly decide. If the value of `checkpoint.interval` is large enough, sink writer will write rows in a file until the rows in the file larger than `batch_size`. If `checkpoint.interval` is small, the sink writer will create a new file when a new checkpoint trigger.
134
+
130
135
### common options
131
136
132
137
Sink plugin common parameters, please refer to [Sink Common Options](common-options.md) for details.
@@ -165,4 +170,6 @@ SftpFile {
165
170
-[BugFix] Fixed the following bugs that failed to write data to files ([3258](https://github.com/apache/incubator-seatunnel/pull/3258))
166
171
- When field from upstream is null it will throw NullPointerException
167
172
- Sink columns mapping failed
168
-
- When restore writer from states getting transaction directly failed
173
+
- When restore writer from states getting transaction directly failed
174
+
175
+
-[Improve] Support setting batch size for every file ([3625](https://github.com/apache/incubator-seatunnel/pull/3625))
Copy file name to clipboardExpand all lines: seatunnel-connectors-v2/connector-file/connector-file-base/src/main/java/org/apache/seatunnel/connectors/seatunnel/file/config/BaseSinkConfig.java
Copy file name to clipboardExpand all lines: seatunnel-connectors-v2/connector-file/connector-file-base/src/main/java/org/apache/seatunnel/connectors/seatunnel/file/config/BaseTextFileConfig.java
+4-1Lines changed: 4 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -40,6 +40,7 @@ public class BaseTextFileConfig implements DelimiterConfig, CompressConfig, Seri
Copy file name to clipboardExpand all lines: seatunnel-connectors-v2/connector-file/connector-file-base/src/main/java/org/apache/seatunnel/connectors/seatunnel/file/sink/BaseFileSinkWriter.java
Copy file name to clipboardExpand all lines: seatunnel-connectors-v2/connector-file/connector-file-base/src/main/java/org/apache/seatunnel/connectors/seatunnel/file/sink/state/FileSinkState.java
0 commit comments