Add s3 output to filebeat #18158

thenewguy · 2020-05-02T19:58:02Z

It would be very helpful to allow filebeat to output to s3 directly.

Currently, if one wants to store logs on s3, logstash is required.

When using a service like aws elastic beanstalk, it is very handy to push logs to s3 for persistence. I can imagine 100 other usecases but this one is the current one that would have simplified my life.

The only alternative is rather complicated - you have to configure the aws ECS agent to support gelf logging and then use gelf logging to logstash. And then push the logs to s3 via logstash output.

Major downsite: There is a race-condition with this approach where you will lose the initial logs from containers that start before the logstash container.

One big downside: you cannot use docker logs for quick inspection anymore because aws doesn't offer dual logger output. The json-file supported by file beat would work out of the box here. It would certainly be easier to use filebeat when just getting started

Another issue is that you must run logstash on each application instance plus the ones you need for ingestion into elasticsearch.

elasticmachine · 2020-05-03T09:39:33Z

Pinging @elastic/integrations (Team:Integrations)

holisticode · 2020-06-27T00:23:19Z

Any statement from the maintainers? Will this be considered?

ktham · 2020-09-25T18:46:17Z

I'm hoping S3 Output for Filebeats gets considered soon. We are looking to replace fluentd with fluent-bit as soon as fluent-bit adds S3 output (fluent/fluent-bit#2583), but I would prefer to stay within the Elastic ecosystem if possible.

(We want to be running something lightweight which is why we're looking to move off fluentd. And logstash is not lightweight)

lambda-9 · 2020-11-16T22:38:19Z

We would also like to consider various beats, and elastic agent/fleet management as a replacement for fluentd and fluent-bit. We are unable to consider beats or elastic agent due to lack of flexibility in outputs. We can use logstash as an output, but this fails in any scenario where elastic agent or the fleet manager is to be considered.

We do not consider Elasticsearch a primary data store for logs and events. Rather, it is a secondary data store for analysis and search. We must persist the primary data store for 7 years in most cases and we don't feel we can keep indices around that long on a reliable, performant, or cost-effective basis. It would be great to be able to output events to S3 as the primary data store and have logstash or something else read events from there, or to have beats output to both S3 and Elasticsearch simultaneously.

botelastic · 2021-10-17T23:04:59Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

thenewguy · 2021-10-17T23:14:09Z

bleh - auto robots with tags on real issues is obnoxious =/

bryanjacobsos · 2021-11-25T20:44:52Z

I'm building a pipeline for retrying failed messages when kafka or other components in our ingestion pipelines fail. Having a filebeat that can move data from log to s3 would be extremely helpful.

At this point my only option is fluentd.

ktham · 2022-01-05T18:00:01Z

At this point, we've given up on considering filebeats for S3 output and we've adopted fluent-bit (https://github.com/fluent/fluent-bit) for shipping logs to S3. (It's written in C, vs. Ruby in fluentd)

They've added support for S3 output over a year ago in fluent/fluent-bit#2583, and it working quite well for us.

The project is very active and that would be my recommendation if you need S3 output.

botelastic · 2023-01-05T18:11:18Z

Hi!
We just realized that we haven't looked into this issue in a while. We're sorry!

We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1.
Thank you for your contribution!

ktham · 2023-01-05T19:00:00Z

No problem, we've migrated to Vector already in https://vector.dev/docs/reference/configuration/sinks/aws_s3/, so no need for this anymore 🙁

Z4ck404 · 2023-03-13T16:40:40Z

Having is extremely useful .. is there any plans to add s3/gcs/azure output to filebeat ?

botelastic · 2024-03-12T17:46:18Z

Hi!
We just realized that we haven't looked into this issue in a while. We're sorry!

We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1.
Thank you for your contribution!

jakauppila · 2024-03-12T18:18:20Z

This would still be useful

botelastic bot added the needs_team Indicates that the issue/PR needs a Team:* label label May 2, 2020

andresrc added enhancement Team:Integrations Label for the Integrations team labels May 3, 2020

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label May 3, 2020

andresrc added the [zube]: Team Triage label May 3, 2020

betatim mentioned this issue Oct 3, 2020

Persist repository build logs for later access jupyterhub/binderhub#1156

Open

botelastic bot added the Stalled label Oct 17, 2021

botelastic bot removed the Stalled label Oct 17, 2021

botelastic bot added the Stalled label Jan 5, 2023

botelastic bot removed the Stalled label Jan 5, 2023

botelastic bot added the Stalled label Mar 12, 2024

botelastic bot removed the Stalled label Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add s3 output to filebeat #18158

Add s3 output to filebeat #18158

thenewguy commented May 2, 2020 •

edited

Loading

elasticmachine commented May 3, 2020

holisticode commented Jun 27, 2020

ktham commented Sep 25, 2020

lambda-9 commented Nov 16, 2020

botelastic bot commented Oct 17, 2021

thenewguy commented Oct 17, 2021

bryanjacobsos commented Nov 25, 2021

ktham commented Jan 5, 2022

botelastic bot commented Jan 5, 2023

ktham commented Jan 5, 2023

Z4ck404 commented Mar 13, 2023 •

edited

Loading

botelastic bot commented Mar 12, 2024

jakauppila commented Mar 12, 2024

Add s3 output to filebeat #18158

Add s3 output to filebeat #18158

Comments

thenewguy commented May 2, 2020 • edited Loading

elasticmachine commented May 3, 2020

holisticode commented Jun 27, 2020

ktham commented Sep 25, 2020

lambda-9 commented Nov 16, 2020

botelastic bot commented Oct 17, 2021

thenewguy commented Oct 17, 2021

bryanjacobsos commented Nov 25, 2021

ktham commented Jan 5, 2022

botelastic bot commented Jan 5, 2023

ktham commented Jan 5, 2023

Z4ck404 commented Mar 13, 2023 • edited Loading

botelastic bot commented Mar 12, 2024

jakauppila commented Mar 12, 2024

thenewguy commented May 2, 2020 •

edited

Loading

Z4ck404 commented Mar 13, 2023 •

edited

Loading