Enable HTTP log reporting #149

Superskyyy · 2021-08-10T14:19:13Z

This is a work in progress.

Now reporting logs in JSON through HTTP to http://oap/v3/logs does work.

But, I'm not sure if the oap/v3/logs endpoint is for such usage(It seems designed for fluent-bit batch reporting?). Reporting logs one by one through HTTP may not be ideal in terms of performance. I'm not sure whether the Java agent only implements gRPC reporter intentionally out of this reason.

Please advise.

Signed-off-by: YihaoChen Superskyyy@outlook.com

Signed-off-by: YihaoChen <Superskyyy@outlook.com>

tom-pytel · 2021-08-10T18:33:48Z

But, I'm not sure if the oap/v3/logs endpoint is for such usage(It seems designed for fluent-bit batch reporting?). Reporting logs one by one through HTTP may not be ideal in terms of performance. I'm not sure whether the Java agent only implements gRPC reporter intentionally out of this reason.

Not sure what the INTENDED usage of that endpoint was, but the same individual send inefficiency applies to how the spans are sent currently to /v3/segment instead of batching to /v3/segments, this should be optimized at some point. The hit is not that bad though since the http protocol uses requests.Session which should do persistent connections as per https://docs.python-requests.org/en/master/user/advanced/:

"The Session object allows you to persist certain parameters across requests. It also persists cookies across all requests made from the Session instance, and will use urllib3’s connection pooling. So if you’re making several requests to the same host, the underlying TCP connection will be reused, which can result in a significant performance increase (see HTTP persistent connection)."

In any case, if it works but may not yet be optional it is a step forward.

tom-pytel · 2021-08-10T18:40:01Z

skywalking/client/http.py

+    def report(self, generator):
+        for log_data in generator:
+            json_string = json_format.MessageToJson(log_data)
+            res = self.session.post(self.url_report, json=[json.loads(json_string)])


Actually, looking at this suggests that the /v3/logs endpoint can take an array of logs so a batch send could be done. Change to:

def report(self, generator): json = [json_format.MessageToJson(log_data) for log_data in generator] res = self.session.post(self.url_report, json=json)

Your call if you want to do now.

Umm, should I make a new config entry allowing the user to choose whether batch or not?

No, batch is the right way to go (assuming this endpoint does take arrays, which is what you should check).

Got it, it does take arrays.

kezhenxu94 · 2021-08-11T01:01:08Z

But, I'm not sure if the oap/v3/logs endpoint is for such usage(It seems designed for fluent-bit batch reporting?).

Yes it's correct usage. Http protocol is provided for those language that don't (or hard to ) support gRPC protocol.

Reporting logs one by one through HTTP may not be ideal in terms of performance. I'm not sure whether the Java agent only implements gRPC reporter intentionally out of this reason.

Yes, Java agent can eliminate all possible side effects via shading the gRPC libs but for Python, we still need http protocol in case that users' applications are using a different (incompatible) gRPC package version, we decided to implement http protocol in Python agent from day one to give the users a secondary choice.

Humbertzhang · 2021-08-11T16:34:01Z

So far looks good to me, looking forward to your Kafka part, thank you.

tom-pytel · 2021-08-11T17:32:16Z

I would say merge this first then do Kafka as a separate PR.

kezhenxu94 · 2021-08-12T00:15:01Z

I would say merge this first then do Kafka as a separate PR.

I agree. @Superskyyy let's do one thing at a time, in a single PR

Superskyyy · 2021-08-12T03:40:00Z

I would say merge this first then do Kafka as a separate PR.

I agree. @Superskyyy let's do one thing at a time, in a single PR

No problem, but let me add a commit on the batch reporting first. Then lets merge.

Superskyyy · 2021-08-12T03:57:41Z

Oops, messed up a bit.

kezhenxu94 · 2021-08-12T03:58:13Z

Oops, messed up a bit.

It's ok, we will squash the commits into one when merging

kezhenxu94 · 2021-08-12T04:17:36Z

@Superskyyy ping me when it's ready to merge

Signed-off-by: YihaoChen <Superskyyy@outlook.com>

Superskyyy · 2021-08-12T06:16:36Z

@kezhenxu94 Checks done, ready to merge.

kezhenxu94 · 2021-08-12T06:17:28Z

@kezhenxu94 Checks done, ready to merge.

Thank you @Superskyyy very much 🙇🏻 , excellent work!

WIP Try support HTTP protocol

6fcb29f

Signed-off-by: YihaoChen <Superskyyy@outlook.com>

kezhenxu94 added the feature New feature label Aug 10, 2021

kezhenxu94 added this to the 0.7.0 milestone Aug 10, 2021

kezhenxu94 requested review from kezhenxu94 and tom-pytel August 10, 2021 15:28

tom-pytel approved these changes Aug 10, 2021

View reviewed changes

kezhenxu94 requested a review from Humbertzhang August 11, 2021 01:01

Superskyyy changed the title ~~Enable HTTP Kafka log reporting~~ Enable HTTP log reporting Aug 12, 2021

Superskyyy added 2 commits August 12, 2021 11:48

Merge branch 'master' into HTTP-Kafka-logging

879319c

Merge branch 'apache:master' into HTTP-Kafka-logging

203303b

Superskyyy closed this Aug 12, 2021

Superskyyy deleted the HTTP-Kafka-logging branch August 12, 2021 03:54

Superskyyy restored the HTTP-Kafka-logging branch August 12, 2021 03:56

Superskyyy reopened this Aug 12, 2021

Support batch reporting

1b7fabd

Signed-off-by: YihaoChen <Superskyyy@outlook.com>

Superskyyy marked this pull request as ready for review August 12, 2021 05:40

kezhenxu94 merged commit 8039d8b into apache:master Aug 12, 2021

Superskyyy deleted the HTTP-Kafka-logging branch August 12, 2021 06:22

Enable HTTP log reporting #149

Enable HTTP log reporting #149

Uh oh!

Conversation

Superskyyy commented Aug 10, 2021

Uh oh!

tom-pytel commented Aug 10, 2021

Uh oh!

tom-pytel Aug 10, 2021

Choose a reason for hiding this comment

Uh oh!

Superskyyy Aug 11, 2021

Choose a reason for hiding this comment

Uh oh!

tom-pytel Aug 11, 2021

Choose a reason for hiding this comment

Uh oh!

Superskyyy Aug 11, 2021

Choose a reason for hiding this comment

Uh oh!

kezhenxu94 commented Aug 11, 2021

Uh oh!

Humbertzhang commented Aug 11, 2021

Uh oh!

tom-pytel commented Aug 11, 2021

Uh oh!

kezhenxu94 commented Aug 12, 2021

Uh oh!

Superskyyy commented Aug 12, 2021

Uh oh!

Superskyyy commented Aug 12, 2021

Uh oh!

kezhenxu94 commented Aug 12, 2021

Uh oh!

kezhenxu94 commented Aug 12, 2021

Uh oh!

Superskyyy commented Aug 12, 2021

Uh oh!

kezhenxu94 commented Aug 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants