Skip to content

ctacke/DataMaker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DataMaker

DataMaker Logo

A .NET library for quickly generating test data entities from existing data sources.

.NET 8.0 License: MIT

Overview

DataMaker simplifies test data generation by mapping your existing database tables (or other data sources) to strongly-typed C# entities. Instead of writing complex test data setup code, define your mappings once and generate as many test instances as you need.

Key Features

  • Strongly-typed mappings using lambda expressions
  • Foreign key lookups - automatically resolve relationships between tables
  • Multiple selection strategies - sequential or random data selection
  • Fluent query API - use FirstAdd() to ensure specific items are included in results
  • Deterministic ID generation - generate reproducible Snowflake IDs, GUIDs, and more
  • Provider pattern - easily extend to support different data sources (SQL Server, CSV, etc.)
  • Fluent API - clean, readable configuration
  • Type-safe - compile-time validation of your mappings

Installation

NuGet Package

dotnet add package DataMaker

Or via Package Manager Console:

Install-Package DataMaker

Build from Source

git clone https://github.com/ctacke/DataMaker.git
cd DataMaker
dotnet build

Quick Start

Basic Usage

using DataMaker;

// 1. Create a Generator instance
var generator = new Generator();

// 2. Add a data provider (SQL Server, CSV, etc.)
generator.AddProvider(new SqlServerDataProvider("YourConnectionString"));

// 3. Configure mappings for your entity
generator.AddDataMap<User>("Users")  // Primary table name
    .WithColumn(u => u.Name, "Name")
    .WithColumn(u => u.Email, "Email");

// 4. Generate test data
var users = generator.Generate<User>(10);  // Generate 10 users

foreach (var user in users)
{
    Console.WriteLine($"{user.Name} - {user.Email}");
}

Entity Definition

public class User
{
    public string? Name { get; set; }
    public string? Email { get; set; }
    public string? City { get; set; }
}

Core Concepts

1. Data Providers

Data providers implement IDataProvider and retrieve data from a source. DataMaker currently includes:

  • SqlServerDataProvider - Retrieves data from SQL Server databases
  • SqliteDataProvider - Retrieves data from SQLite databases
// SQL Server
var sqlProvider = new SqlServerDataProvider(
    "Server=.;Database=TestDb;Integrated Security=True;"
);
generator.AddProvider(sqlProvider);

// SQLite - accepts file path or connection string
var sqliteProvider = new SqliteDataProvider("path/to/database.sqlite");
generator.AddProvider(sqliteProvider);

2. Primary Tables

Each entity maps to a primary table. When generating data, DataMaker selects rows from this table based on your chosen strategy.

generator.AddDataMap<Product>("Products")  // "Products" is the primary table

3. Selection Strategies

Control how rows are selected from the primary table:

// Sequential - rows 0, 1, 2, 3... (wraps around if count > rows)
var sequential = generator.Generate<User>(10, SelectionStrategy.Sequential);

// Random - randomly selected rows
var random = generator.Generate<User>(10, SelectionStrategy.Random);

4. Fluent Query with FirstAdd

The Generate<T>() method returns a GeneratedQuery<T> that supports fluent filtering with FirstAdd(). This ensures specific items matching a predicate are included first, then fills the remaining slots with other items.

// Get 10 customers, ensuring the one with ID=5 is included first
var customers = generator
    .Generate<Customer>(10)
    .FirstAdd(c => c.Id == 5)
    .ToList();

// Chain multiple FirstAdd calls - items matching ANY predicate are prioritized
var customers = generator
    .Generate<Customer>(10)
    .FirstAdd(c => c.Id == specificGuid1)
    .FirstAdd(c => c.Id == specificGuid2)
    .ToList();
// Result: Items with specificGuid1 and specificGuid2 first, then 8 more random items

Key behaviors:

  • Items matching any FirstAdd() predicate are yielded first
  • Remaining slots are filled with other items based on the selection strategy
  • Multiple FirstAdd() calls can be chained (items matching ANY predicate are prioritized)
  • Uses deferred execution - generation happens when enumerated (like LINQ)

Mapping Methods

WithColumn - Simple Column Mapping

Maps a property directly to a column in the primary table.

// Explicit column name (when property name differs from column name)
generator.AddDataMap<User>("Users")
    .WithColumn(u => u.Name, "FullName")
    .WithColumn(u => u.Email, "EmailAddress");

// Auto-mapping (when property name matches column name)
generator.AddDataMap<Product>("Products")
    .WithColumn(p => p.Name)           // Maps to "Name" column
    .WithColumn(p => p.Price)          // Maps to "Price" column
    .WithColumn(p => p.Cost, "StandardCost");  // Explicit when different

WithSequence - Sequential Value Generation

Generates sequential numeric values, perfect for IDs or counters.

// Generate sequential IDs starting at 1: 1, 2, 3, 4...
generator.AddDataMap<User>("Users")
    .WithSequence(u => u.Id)
    .WithColumn(u => u.Name);

// Generate IDs starting at a specific value
generator.AddDataMap<Order>("Orders")
    .WithSequence(o => o.OrderId, startValue: 1000)  // 1000, 1001, 1002...
    .WithColumn(o => o.Description);

WithValue - Generated Values

Generates values using custom functions. Perfect for GUIDs, timestamps, or computed values.

// Generate GUIDs
generator.AddDataMap<Product>("Products")
    .WithValue(p => p.Id, () => Guid.NewGuid())
    .WithColumn(p => p.Name);

// Generate timestamps
generator.AddDataMap<Event>("Events")
    .WithValue(e => e.CreatedAt, () => DateTime.UtcNow)
    .WithColumn(e => e.Description);

// Generate values with index access
generator.AddDataMap<User>("Users")
    .WithValue(u => u.Username, index => $"user{index + 1}")           // user1, user2, user3...
    .WithValue(u => u.Email, index => $"user{index + 1}@example.com")  // user1@example.com...
    .WithValue(u => u.Code, index => $"USR-{index + 1:D5}")            // USR-00001, USR-00002...
    .WithColumn(u => u.FirstName);

WithLookup - Foreign Key Relationships

Automatically resolves foreign key relationships to related tables.

generator.AddDataMap<User>("Users")
    .WithColumn(u => u.Name, "Name")
    .WithLookup(
        u => u.City,           // Target property
        "Addresses",           // Related table
        "AddressId",           // Foreign key in primary table (Users)
        "Id",                  // Primary key in related table (Addresses)
        "City"                 // Column to retrieve from related table
    );

How it works:

  1. Selects a row from the Users table
  2. Reads the AddressId value from that row
  3. Finds the matching row in Addresses where Id = AddressId
  4. Returns the City value from the matched address row

WithTableMap - Custom Mappings

For complex scenarios, use custom mapping functions with access to the row and provider.

generator.AddDataMap<User>("Users")
    .WithColumn(u => u.FirstName, "FirstName")
    .WithColumn(u => u.LastName, "LastName")
    .WithTableMap(
        u => u.FullName,
        (row, provider) => $"{row["FirstName"]} {row["LastName"]}"
    );

With Index Access - For custom sequential or computed values:

generator.AddDataMap<Order>("Orders")
    .WithTableMap(o => o.OrderNumber, (row, provider, index) => $"ORD-{index + 1000:D5}")  // ORD-01000, ORD-01001...
    .WithTableMap(o => o.CustomId, (row, provider, index) => index + 1)  // Sequential: 1, 2, 3...
    .WithColumn(o => o.Description, "Description");

Advanced Examples

Sequential ID Generation

Generate entities with auto-incrementing IDs without pulling them from the database:

public class Customer
{
    public int Id { get; set; }
    public string? Name { get; set; }
    public string? Email { get; set; }
}

generator.AddDataMap<Customer>("Customers")
    .WithSequence(c => c.Id)  // Auto-generate: 1, 2, 3, 4...
    .WithColumn(c => c.Name, "Name")
    .WithColumn(c => c.Email, "Email");

var customers = generator.Generate<Customer>(100);
// Creates customers with IDs from 1 to 100

Complex Entity with Multiple Lookups

public class Order
{
    public string? OrderNumber { get; set; }
    public string? CustomerName { get; set; }
    public string? ProductName { get; set; }
    public decimal? Price { get; set; }
}

generator.AddDataMap<Order>("Orders")
    .WithColumn(o => o.OrderNumber, "OrderNum")
    .WithLookup(o => o.CustomerName, "Customers", "CustomerId", "Id", "Name")
    .WithLookup(o => o.ProductName, "Products", "ProductId", "Id", "Name")
    .WithLookup(o => o.Price, "Products", "ProductId", "Id", "Price");

Combining Multiple Tables

generator.AddDataMap<Employee>("Employees")
    .WithColumn(e => e.EmployeeId, "Id")
    .WithColumn(e => e.FirstName, "FirstName")
    .WithColumn(e => e.LastName, "LastName")
    .WithLookup(e => e.DepartmentName, "Departments", "DeptId", "Id", "Name")
    .WithLookup(e => e.ManagerName, "Employees", "ManagerId", "Id", "FirstName")
    .WithLookup(e => e.City, "Addresses", "AddressId", "Id", "City")
    .WithLookup(e => e.State, "Addresses", "AddressId", "Id", "State");

var employees = generator.Generate<Employee>(50, SelectionStrategy.Random);

Custom Data Transformation

generator.AddDataMap<Product>("Products")
    .WithColumn(p => p.Name, "ProductName")
    .WithTableMap(
        p => p.DisplayName,
        (row, provider) =>
        {
            var name = row["ProductName"].ToString();
            var category = row["Category"].ToString();
            return $"{category} - {name}";
        }
    );

Custom Sequential Formats

Generate formatted IDs, order numbers, or codes using the index:

public class Invoice
{
    public int Id { get; set; }
    public string? InvoiceNumber { get; set; }
    public string? ReferenceCode { get; set; }
    public decimal? Amount { get; set; }
}

generator.AddDataMap<Invoice>("Invoices")
    .WithSequence(i => i.Id)  // Simple numeric: 1, 2, 3...
    .WithTableMap(
        i => i.InvoiceNumber,
        (row, provider, index) => $"INV-2024-{index + 1:D6}")  // INV-2024-000001, INV-2024-000002...
    .WithTableMap(
        i => i.ReferenceCode,
        (row, provider, index) =>
        {
            var year = DateTime.Now.Year;
            var month = DateTime.Now.Month;
            return $"{year}{month:D2}-{index + 1000}";  // 202401-1000, 202401-1001...
        })
    .WithColumn(i => i.Amount, "Amount");

var invoices = generator.Generate<Invoice>(50);

SQLite with Auto-Mapping and Generated Values

Combine SQLite provider with auto-mapping and value generation:

public class Order
{
    public Guid Id { get; set; }
    public int OrderNumber { get; set; }
    public string? CustomerName { get; set; }
    public string? ProductName { get; set; }
    public decimal? Price { get; set; }
    public DateTime CreatedAt { get; set; }
}

var generator = new Generator();
generator.AddProvider(new SqliteDataProvider("Northwind.sqlite"));

generator.AddDataMap<Order>("Orders")
    .WithValue(o => o.Id, () => Guid.NewGuid())                     // Generate GUID
    .WithValue(o => o.OrderNumber, index => 1000 + index)           // Sequential: 1000, 1001...
    .WithValue(o => o.CreatedAt, () => DateTime.UtcNow)             // Timestamp
    .WithColumn(o => o.CustomerName)                                // Auto-map to "CustomerName"
    .WithColumn(o => o.ProductName)                                 // Auto-map to "ProductName"
    .WithColumn(o => o.Price);                                      // Auto-map to "Price"

var orders = generator.Generate<Order>(50);

Custom Sequential Formats

Generate formatted IDs, order numbers, or codes using the index:

public class Invoice
{
    public int Id { get; set; }
    public string? InvoiceNumber { get; set; }
    public string? ReferenceCode { get; set; }
    public decimal? Amount { get; set; }
}

generator.AddDataMap<Invoice>("Invoices")
    .WithSequence(i => i.Id)  // Simple numeric: 1, 2, 3...
    .WithTableMap(
        i => i.InvoiceNumber,
        (row, provider, index) => $"INV-2024-{index + 1:D6}")  // INV-2024-000001, INV-2024-000002...
    .WithTableMap(
        i => i.ReferenceCode,
        (row, provider, index) =>
        {
            var year = DateTime.Now.Year;
            var month = DateTime.Now.Month;
            return $"{year}{month:D2}-{index + 1000}";  // 202401-1000, 202401-1001...
        })
    .WithColumn(i => i.Amount, "Amount");

var invoices = generator.Generate<Invoice>(50);

Deterministic ID Generation

The IdGenerator class generates deterministic IDs for testing purposes. When initialized with the same seed, it produces the same sequence of IDs every time - perfect for reproducible tests.

Basic Usage

// Create a generator with a seed for deterministic results
var idGen = new IdGenerator(seed: 12345);

// Generate Snowflake-like long IDs (64-bit)
var snowflakeId1 = idGen.NextLong();  // e.g., 7129382947102834688
var snowflakeId2 = idGen.NextLong();  // Different, but deterministic

// Generate deterministic GUIDs (RFC 4122 compliant)
var guid1 = idGen.NextGuid();  // e.g., a1b2c3d4-e5f6-7890-abcd-ef1234567890

// Generate deterministic int IDs
var intId = idGen.NextInt();  // Positive integers

// Generate string IDs with optional prefix
var stringId = idGen.NextString(length: 10, prefix: "USR-");  // e.g., "USR-A7B3K9M2X1"

Generating Batches

var idGen = new IdGenerator(seed: 42);

// Generate multiple IDs at once
var longIds = idGen.TakeLong(100).ToList();
var guids = idGen.TakeGuid(50).ToList();
var intIds = idGen.TakeInt(25).ToList();
var stringIds = idGen.TakeString(10, length: 8, prefix: "ORD-").ToList();

Reproducible Test Data

[Fact]
public void OrderProcessing_ShouldCalculateTotal()
{
    // Same seed = same IDs every test run
    var idGen = new IdGenerator(seed: 99999);

    var order = new Order
    {
        Id = idGen.NextLong(),           // Always the same ID
        CustomerId = idGen.NextGuid(),   // Always the same GUID
        OrderNumber = idGen.NextString(prefix: "ORD-")
    };

    // Test logic...
}

Resetting the Generator

var idGen = new IdGenerator(seed: 12345);
var id1 = idGen.NextLong();
var id2 = idGen.NextLong();

// Reset to regenerate the same sequence
idGen.Reset(seed: 12345);
var sameAsId1 = idGen.NextLong();  // Equal to id1
var sameAsId2 = idGen.NextLong();  // Equal to id2

Fluent Integration with DataMap

Use deterministic IDs directly in your data mappings with the WithDeterministic* methods:

generator.AddDataMap<Customer>("Customers")
    .WithDeterministicLong(c => c.Id, seed: 12345)           // Snowflake-like long IDs
    .WithDeterministicGuid(c => c.ExternalId, seed: 67890)   // Deterministic GUIDs
    .WithDeterministicString(c => c.Code, seed: 11111, length: 10, prefix: "CUST-")
    .WithColumn(c => c.Name, "Name");

// Every test run produces the same IDs
var customers = generator.Generate<Customer>(100).ToList();

Available fluent methods:

// Long IDs (Snowflake-like)
.WithDeterministicLong(c => c.Id, seed: 12345)

// Int IDs
.WithDeterministicInt(c => c.LegacyId, seed: 12345)

// GUIDs
.WithDeterministicGuid(c => c.ExternalId, seed: 12345)

// String IDs with optional length and prefix
.WithDeterministicString(c => c.Code, seed: 12345)
.WithDeterministicString(c => c.Code, seed: 12345, length: 12)
.WithDeterministicString(c => c.Code, seed: 12345, length: 10, prefix: "ORD-")

Supported ID Types

Method Type Description
NextLong() long Snowflake-like 64-bit IDs with timestamp, worker, and sequence components
NextInt() int Positive 32-bit integers
NextGuid() Guid RFC 4122 compliant version 4 GUIDs
NextString(length, prefix) string Alphanumeric strings with optional prefix

API Reference

Generator Class

Method Description
AddProvider(IDataProvider) Adds a data provider for retrieving data
AddDataMap<T>(string tableName) Creates a mapping configuration for type T with specified primary table
Generate<T>(int count, SelectionStrategy, allowRepeats) Returns a GeneratedQuery<T> for generating entities with optional filtering

GeneratedQuery Class

Method Description
FirstAdd(Func<T, bool> predicate) Ensures items matching the predicate are included first, then fills remaining slots with other items
ToList(), ToArray(), etc. Standard LINQ methods trigger deferred execution and return the generated items

IdGenerator Class

Method Description
IdGenerator(int seed) Creates a generator with the specified seed for deterministic ID generation
NextLong() Generates the next Snowflake-like 64-bit ID
NextInt() Generates the next positive 32-bit integer ID
NextGuid() Generates the next deterministic GUID
NextString(length, prefix) Generates the next alphanumeric string ID
TakeLong(count) Generates a sequence of long IDs
TakeInt(count) Generates a sequence of int IDs
TakeGuid(count) Generates a sequence of GUIDs
TakeString(count, length, prefix) Generates a sequence of string IDs
Reset(seed) Resets the generator to regenerate the same sequence

DataMap Class

Method Description
WithColumn(property) Maps property to column with same name in primary table
WithColumn(property, columnName) Maps property to specified column in primary table
WithSequence(property, startValue) Generates sequential numeric values (default starts at 1)
WithValue(property, func) Generates values using a function (GUIDs, timestamps, etc.)
WithValue(property, indexFunc) Generates values using a function with index access
WithLookup(property, table, fk, pk, column) Maps property using foreign key lookup
WithTableMap(property, func) Maps property using custom function (with optional index parameter)
WithDeterministicLong(property, seed) Maps property to deterministic Snowflake-like long IDs
WithDeterministicInt(property, seed) Maps property to deterministic int IDs
WithDeterministicGuid(property, seed) Maps property to deterministic GUIDs
WithDeterministicString(property, seed, length, prefix) Maps property to deterministic string IDs

SelectionStrategy Enum

Value Description
Sequential Selects rows in order (0, 1, 2...), wraps around if needed
Random Randomly selects rows from available data

Data Providers

SQL Server Provider

var provider = new SqlServerDataProvider(connectionString);
generator.AddProvider(provider);

Features:

  • Lazy loading and caching of table data
  • SQL injection protection via table name validation
  • Support for standard SQL Server table naming conventions

Security: Table names are validated using a whitelist pattern (alphanumeric, underscore, period, brackets only).

SQLite Provider

// Option 1: File path
var provider = new SqliteDataProvider("path/to/database.sqlite");
generator.AddProvider(provider);

// Option 2: Connection string
var provider = new SqliteDataProvider("Data Source=path/to/database.sqlite");
generator.AddProvider(provider);

Features:

  • Supports file paths or full connection strings
  • Automatic file validation
  • Lazy loading and caching of table data
  • SQL injection protection via table name validation
  • Perfect for lightweight databases and testing scenarios

Use Cases:

  • Unit testing with embedded test databases
  • Prototyping with sample data
  • Generating test data from Northwind or other sample SQLite databases

Creating Custom Providers

Implement IDataProvider to support other data sources:

public class CsvDataProvider : IDataProvider
{
    public IDataEntity this[string entityName]
    {
        get
        {
            // Load CSV file and return as IDataEntity
        }
    }
}

Best Practices

  1. Reuse Generator instances - Configure once, generate many times
  2. Use WithColumn for simple mappings - More performant than custom functions
  3. Cache related table data - Providers cache table data automatically
  4. Validate your mappings - Run tests to ensure foreign keys resolve correctly
  5. Choose appropriate selection strategy - Use Sequential for deterministic tests, Random for broader coverage

Performance Considerations

  • Table caching: Each table is loaded once and cached by the provider
  • Foreign key lookups: Performed via LINQ queries on cached data
  • Large datasets: Consider using views or filtered queries in your provider

Troubleshooting

"No data provider has been added"

Call AddProvider() before calling Generate().

"Primary table 'X' has no rows"

Ensure your database table contains data.

Foreign key lookup returns null

  • Verify the foreign key column name matches your database
  • Ensure related table has matching records
  • Check that key values match (types and values)

Type conversion errors

DataMaker attempts automatic type conversion. For complex types, use WithTableMap with custom conversion logic.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

For issues, questions, or suggestions:

  • Open an issue on GitHub
  • Check existing issues for solutions
  • Review the examples in the test project

About

A library to quickly generate test data entities from your data sources

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages