Create suite of sanity tests across DBs and types for equivalency

**Context / Goal**

Since the tool is designed to help you validate migration of data across different schema types and even (relational) database implementations, and primarily works based on hashed data representing a _row_ in a _dataset_, we want to have a way to validate that it is actually working correctly for this purpose.

Since there are potentially differences in the way drivers handle things such as character encodings, number types, timestamp/data types we want to ensure that the hashed data representing one data type in one database is considered hash-identical to that of another. If there is not, there should be a way to ensure they are so using something simple in SQL, or we should probably change our implementation.


**Expected Outcome**

- Modify/refactor the `MultiDataSourceConnectivityIntegrationTest` from #28 to instead of just testing R2DBC connectivity via Micronaut, instead have a suite of simple scenarios that focus on real integration testing scenarios, focused on type differences
  - It is likely that the hashing impl in [HashedRow](https://github.com/chadlwilson/recce/blob/e82f4d305efd72379f922e9b340ae42e769af4aa/src/main/kotlin/recce/server/dataset/HashedRow.kt) is not going to work correctly. An `int` with value `10` in one DB will likely not be considered equal to a `long` with value `10` in another DB, and similar with other types. We will have to make some decisions about how this should work to canonicalize values, and how configurable it needs to be.
  - Should a string of `"10"` be considered equal to a `bigint` of `10`?
- Run simple, but real reconciliations that focus on ensuring that hashed values from one DB of a given type are equivalent to that of a different DB

**Out of Scope**

- Anything we don't support in terms of types (without SQL coercion) from #27 

**Additional context / implementation notes**

- At time of writing we are using the Exposed framework in test code to generate schemas for testing with. This may not give us the level of control over data types in the databases that we require, and may need to be re-evaluated.
- Possibly these tests could run in a matrix style with simple data set queries on each side like `SELECT id as MigrationKey, test_type_column FROM testdata`, creating a single table with a single test data column per test 
  - **Dimension 1**: DB (mysql, postgres, mssql)
  - **Dimension 2**: DB Type under test (`CHAR/VARCHAR`, `INT/INTEGER`, `BIGINT`, `NUMERIC/DECIMAL/REAL/FLOAT`, `DATETIME/DATE/TIME/TIMESTAMP` etc)
- Types
  - [PostGres Data types](https://www.postgresql.org/docs/13/datatype.html)
  - [MySQL Data types](https://dev.mysql.com/doc/refman/8.0/en/data-types.html)
  - [MS SQL Server Data types](https://docs.microsoft.com/en-us/sql/t-sql/data-types/data-types-transact-sql?view=sql-server-ver15)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Create suite of sanity tests across DBs and types for equivalency #39

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Create suite of sanity tests across DBs and types for equivalency #39

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions