HDLBits Dataset for RLFT [test]

### Dataset Collection
- [x] Gather questions from **HDLBits** across categories:
  - [x] Basics
  - [x] Vectors
  - [x] Modules & Hierarchy
  - [x] Procedures
  - [ ] Combinational Logic (Gates, Multiplexers, Arithmetic Circuits, etc.)
  - [ ] Sequential Logic (Latches, Flip-flops, Counters, Shift Registers, etc.)
- [x] Expand coverage with **augmented questions** (variation, rephrasing, scaling difficulty)
- [ ] Prepare additional sources if needed (e.g., RTL-Repo, custom prompts)

### Dataset Structuring
- [x] Define consistent schema:  
  - Question / Problem statement  
  - Expected input/output behavior (testbench or truth table)  
  - Ground truth solution (reference Verilog code)  
- [x] Ensure compatibility with reward functions (compilation, synthesis, functional correctness, etc.)
- [ ] Keep some for evals (5%-10%)

### Documentation
- [x] Document dataset structure (fields, formatting rules)
- [x] Provide small example subset in repo for reference
- [x] Note augmentation methods used and rationale

---

## Notes
- Initial dataset size: ~20–50 HDLBits problems  
- Augmented to increase volume while retaining diversity  
- Benchmarking targets: **VerilogEval**, **VeriReason Benchmarks**, and **RTL-Repo** (for external validation)  

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HDLBits Dataset for RLFT [test] #2

Dataset Collection

Dataset Structuring

Documentation

Notes

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

HDLBits Dataset for RLFT [test] #2

Description

Dataset Collection

Dataset Structuring

Documentation

Notes

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions