Add follow_only_path option, linked to https://github.com/jordansissel/ruby-filewatch/pull/32 #50

manusfreedom · 2015-06-10T06:48:15Z

To use all features of:
jordansissel/ruby-filewatch#32

manusfreedom/ruby-filewatch@92e660f

purbon · 2015-10-27T09:52:13Z

please jenkins, test this.

elasticsearch-release · 2015-10-27T09:56:01Z

Jenkins standing by to test this. If you aren't a maintainer, you can ignore this comment. Someone with commit access, please review this and clear it for Jenkins to run; then say 'jenkins, test it'.

purbon · 2015-10-27T09:56:24Z

please jenkins, test this.

guyboertje · 2018-05-03T08:30:59Z

Filewatch code has now been copied into this plugin code base and been extensively refactored. The changes mentioned above did not make it into the copy.

Our preferred approach is to use fingerprinting which allows us to evaluate whether content has been seen before regardless of path or inode.

On discovery, fingerprints, one way hashes, are taken of a chunk of bytes in two well known offsets in the file. On file discovery, we try to match this file with one we have seen already in the sincedb collection. We try to find a match on the first fingerprint and, if found, verify against the second.
One big challenge is when the discovered file is very small but growing, we have to delay the fingerprint taking until later before we can match. For real tail cases, for a rotated file, the content is new but in read cases where the same content is accidentally copied in we need to build fingerprints before we can match and determine whether new unread content exists beyond where we last read on the previously seen content.

- Add follow_only_path option

527f53c

manusfreedom/ruby-filewatch@92e660f

manusfreedom changed the title ~~- Add follow_only_path option~~ Add follow_only_path option, linked to https://github.com/jordansissel/ruby-filewatch/pull/32 Jun 10, 2015

splashx mentioned this pull request Nov 2, 2015

Files on NFS volume vs sincedb #45

Open

shoggeh mentioned this pull request Jan 26, 2016

[file input] re-using inodes leads to missing/corrupted data - please implement periodic cleanup of sincedb entries elastic/logstash#4566

Closed

guyboertje closed this May 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add follow_only_path option, linked to https://github.com/jordansissel/ruby-filewatch/pull/32 #50

Add follow_only_path option, linked to https://github.com/jordansissel/ruby-filewatch/pull/32 #50

Uh oh!

manusfreedom commented Jun 10, 2015

Uh oh!

purbon commented Oct 27, 2015

Uh oh!

elasticsearch-release commented Oct 27, 2015

Uh oh!

purbon commented Oct 27, 2015

Uh oh!

guyboertje commented May 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add follow_only_path option, linked to https://github.com/jordansissel/ruby-filewatch/pull/32 #50

Add follow_only_path option, linked to https://github.com/jordansissel/ruby-filewatch/pull/32 #50

Uh oh!

Conversation

manusfreedom commented Jun 10, 2015

Uh oh!

purbon commented Oct 27, 2015

Uh oh!

elasticsearch-release commented Oct 27, 2015

Uh oh!

purbon commented Oct 27, 2015

Uh oh!

guyboertje commented May 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants