-
Notifications
You must be signed in to change notification settings - Fork 360
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow Analysis While Importing Separate Data #237
Comments
This change will affect what will happen if RITA were to crash/ be killed during import. Currently, if an import run crashes, the data that has been imported can be used for analysis. However, it is very likely that data is missing due to the crash. This change will prevent users from analyzing data that didn't come from a clean import session. |
Currently, if you were to open two terminals, each containing an instance of RITA, and ran both
rita import
andrita analyze
, therita analyze
command will pick up on the databases being populated byrita import
and try to analyze them . The analysis step uses the imported data at several points. If the data changes in between these steps, RITA will produce corrupt results. In order to prevent this from occurring, I propose we add aimport_finished
flag to the MetaDatabase.We can implement a ready to analyze flag by adding the field
import_finished
to RITA's MetaDatabase database records.Current MetaDB Database schema:
How to Alter the Import Process
import_finished
flag is set totrue
How to Alter the Analyze Process
import_finished == false
), remove it from considerationAdditionally, this feature will help support streaming importers as they constantly feed data to the RITA system. If
rita analyze
is run at any time with a streaming importer, RITA will produce corrupt results. With the addition of this field, a streaming importer can make a guarantee that it won't insert any more records into a database, and RITA can use that guarantee to safely analyze a database.The text was updated successfully, but these errors were encountered: