- as we still have all the chunk data, we can re-run extraction and post-processing - question is what happens with the existing entities do we need to keep them or remove them or merge the two processing steps together? - depends probably also on the schema change, if it's additive or subtractive - should we store the used schema on the document node? like we do for model and other parameters?