345M model support
- Support the 345M model (thanks to Neil Shepperd for the gradient checkpointing implementation!)
- Support model_name in the CLI for above support
- Support run_name in the CLI
- Support
.csvfiles as an input dataset tofinetune(will parse the CSV as if it was done viaencode_csv()). - Fix one off issues (#21)