feat: Add Phase 1 languages (Bash, C, C++, C#, COBOL) to CodeLang#271
Merged
johnnygreco merged 3 commits intomainfrom Jan 31, 2026
Merged
feat: Add Phase 1 languages (Bash, C, C++, C#, COBOL) to CodeLang#271johnnygreco merged 3 commits intomainfrom
johnnygreco merged 3 commits intomainfrom
Conversation
Add support for five high-priority programming languages to Data Designer's code generation capabilities: - **Bash**: Universal DevOps and automation scripting - **C, C++, C#**: Systems programming and enterprise development - **COBOL**: Legacy mainframe systems and modernization These languages address critical enterprise use cases including legacy code maintenance, systems programming, and infrastructure automation. Changes: - Add new CodeLang enum values for bash, c, cpp, csharp, cobol - Update code_lang_to_syntax_lexer() with Pygments lexer mappings - Update documentation to reflect new supported languages - Update tests to account for 21 total supported languages (up from 16)
Contributor
Greptile OverviewGreptile SummaryThis PR successfully adds support for 5 new programming languages to DataDesigner's code generation capabilities: Bash, C, C++, C#, and COBOL. Changes:
Implementation Quality:
Confidence Score: 5/5
Important Files Changed
Sequence DiagramsequenceDiagram
participant User
participant ConfigBuilder
participant CodeLang
participant CodeResponseRecipe
participant LLMResponseParser
participant Pygments
User->>ConfigBuilder: Create LLMCodeColumnConfig(code_lang="bash")
ConfigBuilder->>CodeLang: Validate "bash" enum value
CodeLang-->>ConfigBuilder: Valid CodeLang.BASH
Note over ConfigBuilder: Configuration stored
User->>ConfigBuilder: Generate dataset
ConfigBuilder->>CodeResponseRecipe: Initialize with syntax="bash"
CodeResponseRecipe->>CodeLang: parse_lang("bash")
CodeLang-->>CodeResponseRecipe: "bash"
CodeResponseRecipe->>CodeResponseRecipe: Set task_instructions with "bash"
CodeResponseRecipe-->>ConfigBuilder: Ready for generation
Note over ConfigBuilder,LLMResponseParser: LLM generates code response
LLMResponseParser->>LLMResponseParser: Extract code from ```bash fence
LLMResponseParser-->>ConfigBuilder: Parsed bash code
opt Syntax Highlighting
ConfigBuilder->>CodeLang: code_lang_to_syntax_lexer(CodeLang.BASH)
CodeLang-->>ConfigBuilder: "bash"
ConfigBuilder->>Pygments: get_lexer_by_name("bash")
Pygments-->>ConfigBuilder: BashLexer instance
end
|
b4bba34 to
37a337f
Compare
johnnygreco
approved these changes
Jan 30, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Add support for five high-priority programming languages to Data Designer's code generation capabilities:
These languages address critical enterprise use cases including legacy code maintenance, systems programming, and infrastructure automation.
Changes: