This is already being used in Serval to generate warnings for developers and Serval admins. We could integrate this into silnlp as a standalone script or maybe as part of `extract_corpora`. (See https://github.com/sillsdev/machine.py/pull/245 for code updates).