feat(synthetic-sf): handle GPU memory contention in parallel batch processing

## Summary

When `process_batch` in `src/sampleworks/eval/generate_synthetic_sf.py` is run with `n_jobs > 1` and a CUDA device, multiple `loky` worker processes each initialise their own CUDA context, competing for GPU memory. This can cause OOM errors or severe performance degradation.

## Background

Raised during review of PR #234 (comment: https://github.com/diff-use/sampleworks/pull/234#discussion_r3251582043).

Even with multiple workers, jobs likely converge on one or a small number of GPUs, so a more targeted fix may involve specifying the device more precisely per worker rather than simply forcing `n_jobs=1`.

## Suggested approaches
- Detect `device.type == "cuda"` and, depending on available memory, either warn and cap `n_jobs` or assign each worker an explicit GPU device index.
- Consider exposing a per-worker device assignment strategy.

## Requested by
@marcuscollins

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(synthetic-sf): handle GPU memory contention in parallel batch processing #241

Summary

Background

Suggested approaches

Requested by

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

feat(synthetic-sf): handle GPU memory contention in parallel batch processing #241

Description

Summary

Background

Suggested approaches

Requested by

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions