Test musicbrainz db methods against a real musicbrainz sample database #44

alastair · 2020-11-10T18:27:19Z

We found some errors in the musicbrainz_db module due to the mocks in the tests not accurately reflecting what is actually returned by a query: https://github.com/metabrainz/brainzutils-python/blame/a73309306697cd08bc4d0bad2e296cc61c993713/brainzutils/musicbrainz_db/utils.py#L80-L87

This is an experimental setup to include a copy of the musicbrainz sample database, and use it when running tests that use mbdata.

Currently, a multi-step process. To build,

$ docker-compose -f docker-compose.db.yml build
$ docker-compose -f docker-compose.db.yml run --rm test bash
# pytest -m database

some inline comments in the PR

cc @amCap1712

alastair · 2020-11-10T18:28:29Z

brainzutils/musicbrainz_db/tests/test_artist.py

-        mb_artist.mb_session = MagicMock()
-        self.mock_db = mb_artist.mb_session.return_value.__enter__.return_value
-        self.artist_query = self.mock_db.query.return_value.options.return_value.filter.return_value.all
+@pytest.mark.database


in pytest, a mark allows you to tag a particular test, and ask it to only run specific marks, or to exclude them. By marking db tests, we can run two separate sets of tests, the regular unit tests which run quickly, and the database ones, which might be slower

alastair · 2020-11-10T18:29:00Z

brainzutils/musicbrainz_db/tests/test_artist.py

-        self.mock_db = mb_artist.mb_session.return_value.__enter__.return_value
-        self.artist_query = self.mock_db.query.return_value.options.return_value.filter.return_value.all
+@pytest.mark.database
+class TestArtist:


pytest doesn't require tests to inherit from unittest.TestCase. Instead, it just has to follow a specific naming pattern (TestX...)

alastair · 2020-11-10T18:29:22Z

conftest.py

@@ -0,0 +1,8 @@
+import pytest


conftest.py is a magic pytest file that is automatically loaded. it's a bit weird, but this is how you do it

alastair · 2020-11-10T18:30:01Z

conftest.py

+
+
+@pytest.fixture(scope="session")
+def engine():


This is a pytest fixture. It allows you to inject specific data into a test. It's used by adding the name of the fixture as a parameter to a test. It's pretty magic, try not to think about it

alastair · 2020-11-10T18:31:10Z

brainzutils/musicbrainz_db/tests/test_artist.py


-    def test_get_by_id(self):
-        self.artist_query.return_value = [artist_linkin_park]
+    def test_get_by_id(self, engine):


this is how you specify that a test requires a fixture. By marking that this test needs the sqlalchemy engine, it'll connect to the database before running this test. Fixtures are nicer than setUp methods, because you can selectively apply them to only the methods that you need

alastair · 2020-11-10T18:31:22Z

brainzutils/musicbrainz_db/tests/test_artist.py


-    def test_get_by_id(self):
-        self.artist_query.return_value = [artist_linkin_park]
+    def test_get_by_id(self, engine):
        artist = mb_artist.get_artist_by_id("f59c5520-5f46-4d2c-b2c4-822eabf53419")


Luckily, Linkin Park was already in the test data

alastair · 2020-11-10T18:31:53Z

brainzutils/musicbrainz_db/tests/test_artist.py

        artist = mb_artist.get_artist_by_id("f59c5520-5f46-4d2c-b2c4-822eabf53419")
-        self.assertDictEqual(artist, {
+        assert artist == {


pytest uses assert, instead of assert methods on a parent class.

alastair · 2020-11-10T18:33:56Z

test/musicbrainz_db/scripts/createdb.sh

@@ -0,0 +1,94 @@
+#!/bin/bash


These files are copied from the musicbrainz-docker project. Eventually we should be able to merge these projects together to remove the code duplication, but for now this is the easiest way of getting a sample database dump loaded into the musicbrainz-test-database image

alastair · 2020-11-10T18:35:00Z

test/docker-compose.db.yml

+  test:
+    build:
+      context: ..
+      dockerfile: ./test/Dockerfile.py3


currently only testing on py3. Not sure if we should also include py2. It's probably not worth it since we don't use this db access in AB at the moment.

alastair · 2020-11-10T18:36:13Z

conftest.py

+
+@pytest.fixture(scope="session")
+def engine():
+    init_db_engine("postgresql://musicbrainz@musicbrainz_db/musicbrainz_db")


this shouldn't be hard-coded.
There are some pytest plugins that allow for transactional tests, e.g. https://github.com/jeancochrane/pytest-flask-sqlalchemy, however I'm not sure that it's necessary, as we're only using the database in read-only mode.

amCap1712 · 2021-01-09T18:07:51Z

I have updated the tests to use the test database at https://github.com/amCap1712/brainzutils-python/tree/mbdb-test. I cannot add commits to this PR but cherry-picking the latest 3 commits should do. I modified the .travis.yml to run tests only with python 3. With python 2, I get weird encoding errors due to the test data used. If AB is going to be updated to Python 3 soon, I don't think we need to modify the tests to make Python 2 happy.

alastair · 2021-01-12T15:16:09Z

Thanks, I applied your commits to this branch.
We might be able to add a "python 3-only" flag to these tests so that we can still run the other tests in py 2.
Additionally, we should look at moving these tests to jenkins anyway...

Import the musicbrainz sample dump for sqlalchemy integration tests

aac098d

alastair commented Nov 10, 2020

View reviewed changes

amCap1712 added 3 commits January 9, 2021 22:14

Use test database instead of mocks

aed55fc

Add editor tests

30a2b80

Modify travis.yml to run tests with database

8884298

amCap1712 added 2 commits February 10, 2021 18:11

Run python 2 CI without database tests

5e27f8a

Add utf8 encoding header required by python2

6210681

alastair merged commit 6210681 into master Feb 10, 2021

alastair deleted the mbdb-test branch February 10, 2021 14:07

alastair mentioned this pull request Feb 10, 2021

WIP: Simplify BU tests and configure to run in Jenkins #48

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test musicbrainz db methods against a real musicbrainz sample database #44

Test musicbrainz db methods against a real musicbrainz sample database #44

alastair commented Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

alastair Nov 10, 2020

amCap1712 commented Jan 9, 2021

alastair commented Jan 12, 2021

Test musicbrainz db methods against a real musicbrainz sample database #44

Test musicbrainz db methods against a real musicbrainz sample database #44

Conversation

alastair commented Nov 10, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amCap1712 commented Jan 9, 2021

alastair commented Jan 12, 2021