New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
processing pipeline example #381
Comments
@fergiemcdowall, could you explain a little how a module would fit into the processing pipeline? It could take a URL key/value and duplicate it to also become an ID key with the same value. |
Then I'll make some examples and create some documentation. |
Yes, you could do something like: fs.createReadStream('myData')
.pipe(myAmazingProcessingStage)
.pipe(index.feed()) |
I'll test. myAmazingProcessingStage doesn't need to be written streamy, or do it? I guess I have no idea what I'm heading into =) Espen |
Its not toooo ridiculous- they are just transform streams. Here are some examples: https://github.com/fergiemcdowall/docproc/tree/master/pipeline |
Thanks, I'll test based on the IngestDoc.js. It seems doable. |
Or, maybe Spy.js is the easiest to base it on, @fergiemcdowall ? |
Yes, and you would of course do something with |
Thanks. I can do this =) |
I've added this to the list: #458 |
See if I can get the Chinese tokenizer working in the add pipeline.
The text was updated successfully, but these errors were encountered: