New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cascading counters in ESTap #87
Comments
Can you expand on this? As far as I know, Cascading already does this through the HadoopTupleEntrySchemeCollector (they are stored under cascading.flow.SliceCounters) - which ESTap uses automatically (in case of Hadoop). |
I was not aware of that, but we have both Hadoop ES Taps and Local ES Taps (for smaller/faster indices). Are there corresponding slice counters for Local Mode too? Also do they Slice Counters correspond to # of docs that were indexed? What about failures, etc? I can't seem to find mention of Slice Counters in Cascading docs, I will look them up when I get a second in Cascading source code to see whether they support what I need, but I wanted to give you a quick feedback. |
Can't comment on the local support but I assume that is supported as well. Quickly browsing through the source indicates that SliceCounter and StepCounters are used by the I'd be happy to provide some support for it but generally speaking, monitoring happens best close to the source (i.e. within Hadoop and/or Cascading). |
dump typoinfo in favor of object inspector as the type does not change but its format can causing issue. the oi variant should be just as fast and also provide better interoperability as we're using the provider code instead of guessing its format fix elastic#87
Hi. I've added stats for Hadoop (see #141) and I'm currently looking into provided dedicated stats for Cascading as well. Should be shortly in master. |
Done. In Hadoop mode, reporting happens through the Hadoop infrastructure while in local mode, they are reported directly to Cascading. |
Can we add counters (FlowProcesss.increment) to ESTap to indicate how many tuples were attempted, how many were successfully added to index, how many timed out, how many failed, etc?
The text was updated successfully, but these errors were encountered: