New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

DEVELOP-1437: Test data and automatic tests #46

Merged

GitEdvard merged 21 commits into Molmed:dev from matrulda:DEVELOP-1437_seqreports_automatic_tests

Feb 4, 2022

Collaborator

matrulda commented Jan 19, 2022

This PR adds test data and automatic tests for the pipeline.

In summary this has been done:

The folder test_data has been added with files to be able to run a complete integration test of of the pipeline. Among other things, it contains a run folder (210510_M03910_0104_000000000-JHGJL) from Illumina's public Demo space. The majority of files in this PR originates from this folder, they do not need a detailed inspection.
The complete integration test can be run using the new test profile. It is used like this:

nextflow run main.nf -profile dev,test,singularity

The folder tests contains:
- Integration tests: In these tests the pipeline is run with the test profile and then the output is validated. Validation of the output includes verifying that reports exist and that they contain the sections they should.
- Unit tests: These are unit tests for the python scripts in bin that we use in the pipeline.
A Github Actions workflow has been added so that tests are run every time code is pushed to the repo.
I have formatted all python code with black and it is now enforced through a check in the test workflow.
I've added instructions on how to perform tests locally.
Path to images has been updated so that we use images built for Singularity instead of Docker. Singularity can handle converting Docker images, but this became an issue when running the tests on Github's servers. Using these images makes the pipeline faster and more efficient. Related to this change I had to use new versions of software we use to be able to find publicly hosted images. I've made the assessment that the upgrades (minor and patch) are safe.
Configs in config has been sorted into subfolders nextflow_config and tools_config.

matrulda added 18 commits

November 9, 2021 17:35


          Add test data for FastQ Screen

c644965


          Re-organize configs

8f30904


          Add test run folder

8884d0b


          Add test profile

7c1e28a


          Add github actions config

cc24fd5


          Cache singularity images

164832b


          Use singularity images directly and use singularity action

0259ee2


          Add integration tests and instructions on how to run it locally

b9b7e62


          Check that expected sections exist in reports


          Run integration tests through pytest and add black check

e5e3cf8


          Add unit tests for metadata script

551bf97


          Update CheckQC version to avoid load issue

2dc6f81


          Add unit tests for get_qc_config

057745c


          Move nextflow binary before running tests

b059536


          Add lxml to requirements

79cd8e5


          Less verbose logs in GHA

19b1e08


          Update black info to README

20ed901


          Re-format all python scripts with black

fff6e62

matrulda requested a review from GitEdvard

January 19, 2022 09:42

GitEdvard reviewed

View reviewed changes

Contributor

GitEdvard left a comment •

edited

Loading

Good work! I appreciate that the commits were relative restrictive in size and had a good explanation. As you can see, I "dived" in to the structure of the tests a bit. Let's see what you think about that! Beside from that, I think it's important to look over if the test directory is wiped between runs or not. (if it's not, I think you should implement it)

Edit: I've checked now, test directory is handled by the tmp-factory. You don't need to do anything.

test_data/test_config/fastq_screen.conf Show resolved Hide resolved

requirements-dev.txt Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

tests/integration_tests/test_validate_output.py Outdated Show resolved Hide resolved

tests/integration_tests/test_validate_output.py Outdated



		@pytest.fixture(scope="session", autouse=True)
		def result_dir(tmpdir_factory):

Contributor

GitEdvard Jan 20, 2022

When naming a function to a noun, I anticipate a rather simple fetch algorithm. But here, the main action for the test is kind of hidden here, namely to start a nextflow pipeline. I would like a separate and explicit call for this.

Collaborator Author

matrulda Feb 3, 2022

Good point. I tried to address this in 58d7a4b

tests/integration_tests/test_validate_output.py Outdated Show resolved Hide resolved

tests/integration_tests/test_validate_output.py



		def test_project_dirs_exist(project_reports_dir, projects):
		for project in projects:

Contributor

GitEdvard Jan 20, 2022

When reading tests like these, I would like to get an easy available picture of the folder tree generated by the pipeline. I find it a bit cumbersome that the folder paths are parameterized, and hence fragmented. In order to get this picture of the folder structure, I have to assemble them in a notebook beside. The general advise I've got for automatic tests are to favor readability and verboseness over the DRY principle. I wonder if you could consider write them out as they are in each test? Like "for project in ['duck', 'wolf']", and project_reports_dir = r'path/to/reports'.

Collaborator Author

matrulda Feb 3, 2022

Good point! I've tried to make it more readable in 58d7a4b . I think it better follows the Arrange, Act, Assert model now. Even though the "Act" part comes from result_dir as input. I'd like to keep it this way to make use of the pytest fixture feature, but let me know if you still think it's too unclear.

tests/integration_tests/test_validate_output.py Show resolved Hide resolved

matrulda added 3 commits

January 24, 2022 14:19


          Remove confusing comment in fastq_screen config

75e3d57


          Added info about installing pre-requisites before running tests in RE…

5d508dd

…ADME


          Make tests more readable

58d7a4b

Collaborator Author

matrulda commented Feb 3, 2022

@GitEdvard Thanks for the comments! I've addressed them, let me know if you think it's ok.

Contributor

GitEdvard commented Feb 4, 2022

Great!

GitEdvard merged commit a0a20b1 into Molmed:dev

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment