NIO based file transport implementation #1

isururanawaka · 2019-08-16T01:27:29Z

No description provided.

DImuthuUpe

Hi @isururanawaka this is really great and well thought implementation. I have few comments though.

DImuthuUpe · 2019-08-16T03:39:15Z

transport/s3-transport/src/main/java/org/apache/airavata/mft/transport/s3/S3Constants.java

+/**
+ * Includes constants related to S3 SDK
+ */
+public interface S3Constants {


Can't we use a constant class for this instead of an Interface and use static imports? Using an interface to keep constants seems like a violation of class hierarchy
https://veerasundar.com/blog/2012/04/java-constants-using-class-interface-static-imports/

Yes. We can use a constant class.

DImuthuUpe · 2019-08-16T03:53:15Z

core/src/main/java/org/apache/airavata/mft/core/bufferedImpl/ChannelUtils.java

+    public static void copyData(ReadableByteChannel src,  WritableByteChannel dest) throws IOException {
+        final ByteBuffer buffer = ByteBuffer.allocateDirect(Constants.BUFFER_SIZE);
+        int count = 0;
+        while ((count =src.read(buffer)) != -1) {


What happens if the src is slow and bytes are not coming as fast as expected? Should the loop wait for some time or continuously retry?

Thread will wait until data is available on the channel. If there is an error e.g connection is closed then it will return with error.

DImuthuUpe · 2019-08-16T03:54:39Z

core/src/main/java/org/apache/airavata/mft/core/bufferedImpl/PassthroughMediator.java

+        ReadableByteChannel rChannel = (ReadableByteChannel) src.getChannel();
+        WritableByteChannel dChannel = (WritableByteChannel) dst.getChannel();
+        try {
+            if (rChannel instanceof FileChannel) {


Are there the only 3 cases? What if both src and dst channels are local file channels?

That case also handled. It only requires one to be FileChannel. It will transfer from first case.

DImuthuUpe · 2019-08-16T03:59:47Z

core/src/main/java/org/apache/airavata/mft/core/bufferedImpl/AbstractConnector.java

+     * @param channel
+     * @param obj
+     */
+    public void cacheChannel(Channel channel, Object obj) {


What is the main intention of caching the channel? Do you think that the all channels will be implemented using NIO Channels?

DImuthuUpe · 2019-08-16T04:00:24Z

core/src/main/java/org/apache/airavata/mft/core/bufferedImpl/AbstractConnector.java

+     * @param channel
+     * @return
+     */
+    public Object getConnectorChannel(Channel channel) {


What could be the edge cases of channels getting invalidated / timed out?

DImuthuUpe · 2019-08-16T04:00:55Z

core/src/main/java/org/apache/airavata/mft/core/api/SinkConnector.java

+ * This represents the output connector, where to write
+ * data from the application.
+ */
+public interface SinkConnector extends Connector{


Connector {

DImuthuUpe · 2019-08-16T04:01:06Z

core/src/main/java/org/apache/airavata/mft/core/api/SourceConnector.java

+ * This represents the input connector, where to read data
+ * from the application
+ */
+public interface SourceConnector extends Connector{


Connector {

DImuthuUpe · 2019-08-16T04:05:48Z

transport/s3-transport/src/main/java/org/apache/airavata/mft/transport/s3/S3SinkConnector.java

+        expTimeMillis += S3Constants.CONNECTION_EXPIRE_TIME;
+        expiration.setTime(expTimeMillis);
+
+        GeneratePresignedUrlRequest generatePresignedUrlRequest = new GeneratePresignedUrlRequest


Why can't we use s3client.putObject method instead of this approach?

With pre signed URL method. we can provide IAM Roles and it may reduce security issues
https://medium.com/@aidan.hallett/securing-aws-s3-uploads-using-presigned-urls-aa821c13ae8d

Other reason is PutObject method requires Inputstream. For that we need to copy data from Outputstream to Input stream, using this method avoids that.

DImuthuUpe · 2019-08-16T04:11:24Z

...sport/s3-transport/src/main/java/org/apache/airavata/mft/transport/s3/S3SourceConnector.java

+    private AmazonS3 s3Client;
+
+    @Override
+    public boolean initiate(ConnectorConfig connectorConfig) {


I understand that you are trying to make the APIs generic as possible. But in some cases, it might cause the API to less readable. Fro an outsider, it would be hard to understand what are the properties that he should pass into connectorConfig as it contains generic key value pairs. Instead of using that one, why don't we come up configs specific for each connector with clearly defined properties. Say S3ConnectorConfig. Then the user exactly knows what to pass. That will reduce the reusability a bit but will improve the clarity of the API. What do you think?

Yes I agree. What I thought was Connector implementation is specific to its transport attributes and local to that. Hence, anyone who tries to read code should know the transport specific attributes. So I thought it won't be a problem. But it makes Connector decouple from implementation details.

DImuthuUpe · 2019-08-22T16:41:50Z

Looks good to me. I'm going to merge this. Thanks @isururanawaka

GCS transport implementation

isururanawaka added 4 commits August 15, 2019 13:24

Adding NIO based file transferring initial impl

c7b1eb4

Remove senstive data

eda7872

Remove Channel from interface Connector

c16da98

remove sensitive data

10fc560

DImuthuUpe requested changes Aug 16, 2019

View reviewed changes

isururanawaka added 8 commits August 19, 2019 11:00

Improve code

7a951d2

Adding local transport

5c0a238

Change permission

df53974

Add intial implementation of TCP File Server

3b15709

TCP transport implementation

4d773e4

TCP transport implementation

1d373eb

TCP transport implementation

1635fe5

TCP transport implementation

e331ff7

DImuthuUpe merged commit c2f432e into apache:master Aug 22, 2019

DImuthuUpe pushed a commit that referenced this pull request Apr 23, 2020

Merge pull request #1 from pokearu/GCSTransport

4fb6939

GCS transport implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NIO based file transport implementation #1

NIO based file transport implementation #1

isururanawaka commented Aug 16, 2019

DImuthuUpe left a comment

DImuthuUpe Aug 16, 2019

isururanawaka Aug 16, 2019

DImuthuUpe Aug 16, 2019

isururanawaka Aug 16, 2019

DImuthuUpe Aug 16, 2019

isururanawaka Aug 16, 2019

DImuthuUpe Aug 16, 2019

DImuthuUpe Aug 16, 2019

DImuthuUpe Aug 16, 2019

DImuthuUpe Aug 16, 2019

DImuthuUpe Aug 16, 2019

isururanawaka Aug 16, 2019 •

edited

Loading

DImuthuUpe Aug 16, 2019

isururanawaka Aug 16, 2019

DImuthuUpe commented Aug 22, 2019

NIO based file transport implementation #1

NIO based file transport implementation #1

Conversation

isururanawaka commented Aug 16, 2019

DImuthuUpe left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

isururanawaka Aug 16, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DImuthuUpe commented Aug 22, 2019

isururanawaka Aug 16, 2019 •

edited

Loading