Support request for massive log lines #100

everesio · 2020-06-29T14:12:04Z

When log contains log line longer than MaxScanTokenSize (64 * 1024), an error "Error scanning stderr: bufio.Scanner: token too long" is reported. All subsequent requests (when using the logger) are blocked.

Expected Behaviour

Long lines should be logged

Current Behaviour

bufio.NewScanner is initialized with MaxScanTokenSize = 64 * 1024. If log line is longer then this limit,
bufio.Scanner: token too long is returned. On error scanner.Scan loop in bindLoggingPipe is finished and there is no
pipe reader any more.

Possible Solution

NewScanner can be replaced by NewReaderSize.
I will rework #99

Steps to Reproduce (for bugs)

token-too-long.zip

unzip token-too-long.zip
make build run
make test
make LENGTH=66000 test

All subsequent requests are not processed e.g.
5. make test

Context

This behavior is a blocker as in my use case the logged events can be longer than 64kb.

Your Environment

Docker version docker version (e.g. Docker 17.0.05 ):
Are you using Docker Swarm or Kubernetes (FaaS-netes)?
Kubernetes
Operating System and version (e.g. Linux, Windows, MacOS):
Linux
Link to your project or a code example to reproduce issue:
token-too-long.zip

The text was updated successfully, but these errors were encountered:

alexellis · 2020-06-29T14:16:18Z

�Hi, can you put your project on GitHub in its own repo so that we can check it out and see the source code?

I also wanted to know which mode of the watchdog you are using and why you feel it necessary to stream a massive file into the container logs? There may be alternatives which do not require this proposed solution, so I wanted to get a better idea of the exact use-case, can you help us out with that?

Regards,

Alex

everesio · 2020-06-29T14:36:46Z

Hi Alex,
I attached the token-too-long.zip with the test function and Makefile for building and testing, but if you want I can create a repo for this.

Please checkout the code:

template python3-http-debian - openfaas/of-watchdog:0.7.7 as watchdog
test handler

import logging
logger = logging.getLogger()
logger.setLevel(logging.INFO)

def handle(event, context):

    if event.body:
        logger.info('event body length %d', len(event.body))
        logger.info('body %s', str(event.body, "utf-8"))

    return {
        "statusCode": 200,
        "body": "Hello from OpenFaaS!"
    }

POST event which breaks the function

LENGTH=66000 
cat /dev/urandom | tr -dc 'a-zA-Z0-9' | fold -w ${LENGTH} | head -n 1 | curl -X POST --data "@-" localhost:8080

Kind regards,
Michal

ghostbody · 2020-07-30T09:37:40Z

+1

When trying to log something longer than 65536, "bufio.Scanner: token too long" will occur.

Here is my problem:

In scan.go

   144                  // Is the buffer full? If so, resize.
   145                  if s.end == len(s.buf) {
   146                          if len(s.buf) >= s.maxTokenSize {
   147                                  s.setErr(ErrTooLong)
   148                                  return false
   149                          }
   150                          newSize := len(s.buf) * 2
   151                          if newSize > s.maxTokenSize {
   152                                  newSize = s.maxTokenSize
   153                          }
   154                          newBuf := make([]byte, newSize)
   155                          copy(newBuf, s.buf[s.start:s.end])
   156                          s.buf = newBuf
   157                          s.end -= s.start
   158                          s.start = 0
   159                          continue
   160                  }

What's more, the deamon process will be dead after this error.

alexellis · 2020-07-30T12:06:24Z

/set title: Support request for massive log lines

alexellis · 2020-07-30T12:07:14Z

Can I ask why you are logging such huge amounts of data to the container logs? This doesn't seem like a normal usecase.

ghostbody · 2020-07-30T12:46:27Z

@alexellis

Yes, you are right. This is not a normal usecase.

But there all serveral senes:

Debug: Some users needs to debug the function at openfaas platform. And they turn on some debug logs which could be huge.
Error Tracking with logs to ELK (elastic logstash kibana): Some users prefer to send to log to ELK through log stream. The best practice is to use "one line log". Moreover some Java/Python Traceback or exception can be a very large stack.
Fault or Bug in handling: Maybe some logs are not intentionally huge. They just get some big parameter.

The huge log works fine in the handle function but the error is ocurr in the watchdog, and then the handling process dies. It seems better to take care of this problem.

LucasRoesler · 2020-08-08T07:22:13Z

I think we can consider increasing or even allowing the buffer size to be controlled via an env variable. Even LogStash (in ELK) has a max buffer size https://www.elastic.co/guide/en/logstash/current/plugins-inputs-udp.html#plugins-inputs-udp-buffer_size

Looking through several version

The default buffer size is:

65536 bytes on Logstash >= 5.1
8192 bytes on Logstash >= 2.0
4096 bytes on older versions.

I suspect all log systems will have some kind of limit.

If your request body is very big, I would not recommend logging it. An absurd case like a 10GB binary file as a log output is not going to be a good idea and it certainly is not something that i consider as a "log".

everesio · 2020-08-08T17:24:06Z

If a log line longer than 64K stop a service to operate, it is a serious problem
and it is irrelevant if log collectors can deal with such lines or not.

I found really amusing renaming the title of the issue from "bufio.Scanner: token too long" to "Support request for massive log lines", which implies a feature request and not a bug.

As a comparison please check hashicorp/terraform#20325
This is a similar issue and the terraform maintainer did label the issue as a "bug".

LucasRoesler · 2020-08-08T17:35:41Z

@everesio I agree, a long log line should not stop the watchdog from working. My main point above is that it seems reasonable to have a buffer and to allow control over the size of that.

Of course, logging shouldn't cause anything to crash, that goes without saying.

LucasRoesler · 2020-08-08T17:43:17Z

I just read the hashicorp/terraform#20325 it looks like something we could replicate in https://github.com/openfaas/of-watchdog/blob/master/executor/logging.go

everesio · 2020-08-08T17:51:19Z

Exactly the "scanner.Scan" method it the source of problem.
More than month ago I provided a PR #99 with the fix.
You can rework it or fix in different way. I am currently discouraged to make any source code contributions the this project.

LucasRoesler · 2020-08-08T20:56:43Z

I started adapting the terraform patch, and it is definitely a bit messy because of the internal use of logger, I have an idea for how to work around it and I will post once I have tested it

**What** - Use the bufio ReadLine method so that we can check if the the log line has been broken into several parts due to the internal buffering. This implementation has been adapted from hashicorp/go-plugin#98 The implementation also preserves the timestamp prefix and is fully backwards compatible. Resolves openfaas#100 Signed-off-by: Lucas Roesler <roesler.lucas@gmail.com>

LucasRoesler · 2020-08-08T21:28:15Z

@everesio give this a try #107

Kanshiroron · 2023-03-15T14:45:31Z

@LucasRoesler @alexellis we faced that same issue internally, causing the function to hang because its stdout is locked (as it's piped, but not read).
I looked at the PR @LucasRoesler created, and seems that it should solve the issue, any chance to have it merged?
Thanks for the help.

alexellis · 2023-03-15T16:18:23Z

Hi @Kanshiroron

And as far as I was aware we'd already done the work for this in #126 - can you confirm you've tried the new settings we have documented?

See log_buffer_size in the README

If you still have issues please raise your own Issue here.

Alex

alexellis · 2023-03-15T16:59:17Z

/close: resolved

alexellis · 2023-03-15T17:00:04Z

/lock

everesio mentioned this issue Jun 29, 2020

fix bufio.Scanner: token too long #99

Closed

11 tasks

everesio changed the title ~~bufio.Scanner: token too long~~ [BLOCKER] bufio.Scanner: token too long Jul 1, 2020

derek bot changed the title ~~[BLOCKER] bufio.Scanner: token too long~~ Support request for massive log lines Jul 30, 2020

alexellis mentioned this issue Aug 7, 2020

bufio.Scanner: token too long #105

Closed

LucasRoesler mentioned this issue Aug 8, 2020

Support piping long log lines from the function #107

Closed

11 tasks

alexellis mentioned this issue Nov 8, 2021

Allow for very long log lines #126

Merged

10 tasks

alexellis closed this as completed Mar 15, 2023

derek bot locked and limited conversation to collaborators Mar 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support request for massive log lines #100

Support request for massive log lines #100

everesio commented Jun 29, 2020

alexellis commented Jun 29, 2020

everesio commented Jun 29, 2020

ghostbody commented Jul 30, 2020 •

edited

Loading

alexellis commented Jul 30, 2020

alexellis commented Jul 30, 2020

ghostbody commented Jul 30, 2020

LucasRoesler commented Aug 8, 2020

everesio commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

everesio commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

Kanshiroron commented Mar 15, 2023 •

edited

Loading

alexellis commented Mar 15, 2023 •

edited

Loading

alexellis commented Mar 15, 2023

alexellis commented Mar 15, 2023

Support request for massive log lines #100

Support request for massive log lines #100

Comments

everesio commented Jun 29, 2020

Expected Behaviour

Current Behaviour

Possible Solution

Steps to Reproduce (for bugs)

Context

Your Environment

alexellis commented Jun 29, 2020

everesio commented Jun 29, 2020

ghostbody commented Jul 30, 2020 • edited Loading

alexellis commented Jul 30, 2020

alexellis commented Jul 30, 2020

ghostbody commented Jul 30, 2020

LucasRoesler commented Aug 8, 2020

everesio commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

everesio commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

LucasRoesler commented Aug 8, 2020

Kanshiroron commented Mar 15, 2023 • edited Loading

alexellis commented Mar 15, 2023 • edited Loading

alexellis commented Mar 15, 2023

alexellis commented Mar 15, 2023

ghostbody commented Jul 30, 2020 •

edited

Loading

Kanshiroron commented Mar 15, 2023 •

edited

Loading

alexellis commented Mar 15, 2023 •

edited

Loading