[Misc] Add model config parsing support for more model types by beiguo218 · Pull Request #296 · sgl-project/ome

beiguo218 · 2025-10-17T20:02:08Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

Added changes to support below more model types so that their model configuration files can be parsed properly.

gemma3, gemma3_text
phi3, phi3small, phimoe
qwen, qwen2_5_vl, qwen3, qwen3_moe, qwen3_vl_moe

Added missed config field (AutoMap) in some model config structs;
Added unit tests and also sample config files.

Does this PR introduce a user-facing change?

Users who using model agent or ome agent model metadata now can have more typed models to be parsed properly in terms of model configuration.

gemini-code-assist · 2025-10-17T20:02:40Z

Summary of Changes

Hello @beiguo218, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the model configuration parsing capabilities of the system by integrating support for numerous contemporary Hugging Face models. The changes involve adding new configuration structures, updating existing ones to accommodate model-specific parameters like flexible token IDs and advanced RoPE scaling, and introducing the AutoMap field for improved custom model loading. This expansion ensures that model agents and metadata processing can accurately interpret and utilize a broader spectrum of model configurations, improving overall compatibility and functionality.

Highlights

Expanded Model Support: Added comprehensive parsing support for a wide range of new Hugging Face model types, including gemma3, gemma3_text, phi3, phi3small, phimoe, qwen, qwen2_5_vl, qwen3, qwen3_moe, and qwen3_vl_moe.
AutoMap Integration: Introduced the AutoMap field into several model configuration structs, enabling better support for custom model loading mechanisms often required by Hugging Face models.
Gemma Configuration Refinements: Updated the existing GemmaConfig to handle flexible eos_token_id types (int or array) and incorporated gemma3_text specific fields like RopeLocalBaseFreq and SlidingWindowPattern.
Qwen2-VL Enhancements: Modified Qwen2VisionConfig to properly parse fields specific to qwen2_5_vl models, such as IntermediateSize, OutHiddenSize, and WindowSize, along with the introduction of Qwen2MRopeScaling.
New Unit Tests and Sample Data: Included new unit tests and corresponding sample JSON configuration files for all newly supported model types, ensuring robust and accurate parsing.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request significantly expands model support by adding configuration parsing for several new model families, including Gemma, Phi-3, and Qwen variants. The changes are well-structured, with new files for each model and comprehensive unit tests. I've identified a few minor issues in the new tests, such as incorrect comments and a bug in an error message. Additionally, I've suggested some improvements to the test code structure to enhance readability and reduce code duplication, which is particularly beneficial given the number of new test files added.

gemini-code-assist · 2025-10-17T20:04:47Z

pkg/hfutil/modelconfig/qwen3_moe_test.go

+	// Check parameter count (should be approximately 7B)
+	paramCount := config.GetParameterCount()
+	expectedCount := int64(30_000_000_000) // 7B parameters
+	if paramCount != expectedCount {
+		t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)
+	}
+
+	// Check RoPE theta value (specific to Qwen3)
+	if qwen3MoeConfig.RopeTheta != 10000000.0 {
+		t.Errorf("Expected RoPE theta to be 5000000.0, but got %f", qwen3MoeConfig.RopeTheta)
+	}


There are a couple of issues in this test case:

The comment for the parameter count check is misleading. It says // 7B parameters but the expected value is 30B. The comment should be updated to reflect the correct expected value.

The error message for the RopeTheta check has an incorrect expected value (5000000.0 instead of 10000000.0). This could cause confusion when debugging test failures.

Suggested change

// Check parameter count (should be approximately 7B)

paramCount := config.GetParameterCount()

expectedCount := int64(30_000_000_000) // 7B parameters

if paramCount != expectedCount {

t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)

}

// Check RoPE theta value (specific to Qwen3)

if qwen3MoeConfig.RopeTheta != 10000000.0 {

t.Errorf("Expected RoPE theta to be 5000000.0, but got %f", qwen3MoeConfig.RopeTheta)

}

// Check parameter count (should be approximately 30B)

paramCount := config.GetParameterCount()

expectedCount := int64(30_000_000_000) // 30B parameters

if paramCount != expectedCount {

t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)

}

// Check RoPE theta value (specific to Qwen3)

if qwen3MoeConfig.RopeTheta != 10000000.0 {

t.Errorf("Expected RoPE theta to be 10000000.0, but got %f", qwen3MoeConfig.RopeTheta)

}

gemini-code-assist · 2025-10-17T20:04:47Z

pkg/hfutil/modelconfig/deepseek_v3_test.go

+	if config.AutoMap == nil {
+		t.Error("Expected auto_map to be parsed, but it is nil")
+	} else {
+		expectedAutoConfig := "configuration_deepseek.DeepseekV3Config"
+		if config.AutoMap.AutoConfig != expectedAutoConfig {
+			t.Errorf("Expected AutoConfig to be '%s', but got '%s'", expectedAutoConfig, config.AutoMap.AutoConfig)
+		}
+
+		expectedAutoModel := "modeling_deepseek.DeepseekV3ForCausalLM"
+		if config.AutoMap.AutoModelForCausalLM != expectedAutoModel {
+			t.Errorf("Expected AutoModelForCausalLM to be '%s', but got '%s'", expectedAutoModel, config.AutoMap.AutoModelForCausalLM)
+		}
+
+		expectedAutoModelBase := "modeling_deepseek.DeepseekV3Model"
+		if config.AutoMap.AutoModel != expectedAutoModelBase {
+			t.Errorf("Expected AutoModel to be '%s', but got '%s'", expectedAutoModelBase, config.AutoMap.AutoModel)
+		}
+	}


The test for auto_map parsing can be made more concise and robust. Using t.Fatal when config.AutoMap is nil will correctly stop the test execution, as subsequent checks would panic. Also, structuring the checks for each field, for example using a table-driven subtest, improves readability and makes it easier to add more checks in the future.

if config.AutoMap == nil { t.Fatal("Expected auto_map to be parsed, but it is nil") } t.Run("AutoMap values", func(t *testing.T) { testCases := []struct { name string got string expected string }{ {"AutoConfig", config.AutoMap.AutoConfig, "configuration_deepseek.DeepseekV3Config"}, {"AutoModelForCausalLM", config.AutoMap.AutoModelForCausalLM, "modeling_deepseek.DeepseekV3ForCausalLM"}, {"AutoModel", config.AutoMap.AutoModel, "modeling_deepseek.DeepseekV3Model"}, } for _, tc := range testCases { t.Run(tc.name, func(t *testing.T) { if tc.got != tc.expected { t.Errorf("expected %q, got %q", tc.expected, tc.got) } }) } })

gemini-code-assist · 2025-10-17T20:04:47Z

pkg/hfutil/modelconfig/gemma3_test.go

+func TestLoadGemma3Config(t *testing.T) {
+	configPath := filepath.Join("testdata", "gemma3.json")
+
+	// Load the config
+	config, err := LoadGemma3Config(configPath)
+	if err != nil {
+		t.Fatalf("Failed to load Gemma3 config: %v", err)
+	}
+
+	// Check that it's the correct model type
+	if config.GetModelType() != "gemma3" {
+		t.Errorf("Expected model type 'gemma3' but got '%s'", config.GetModelType())
+	}
+}


There's a lot of repeated code for loading the test configuration in this file (and other new test files in this PR). To improve maintainability and reduce duplication, consider creating a helper function to load the Gemma3Config for tests. This pattern can be applied to the other new test files as well (e.g., phi3_test.go, phi3small_test.go, etc.).

func loadGemma3TestConfig(t *testing.T) *Gemma3Config { t.Helper() configPath := filepath.Join("testdata", "gemma3.json") config, err := LoadGemma3Config(configPath) if err != nil { t.Fatalf("Failed to load Gemma3 config: %v", err) } return config } func TestLoadGemma3Config(t *testing.T) { config := loadGemma3TestConfig(t) // Check that it's the correct model type if config.GetModelType() != "gemma3" { t.Errorf("Expected model type 'gemma3' but got '%s'", config.GetModelType()) } }

gemini-code-assist · 2025-10-17T20:04:48Z

pkg/hfutil/modelconfig/phi3_v_test.go

+	// Check auto_map parsing
+	if phi3Config.AutoMap == nil {
+		t.Error("Expected auto_map to be parsed, but it is nil")
+	} else {
+		expectedAutoConfig := "configuration_phi3_v.Phi3VConfig"
+		if phi3Config.AutoMap.AutoConfig != expectedAutoConfig {
+			t.Errorf("Expected AutoConfig to be '%s', but got '%s'", expectedAutoConfig, phi3Config.AutoMap.AutoConfig)
+		}
+
+		expectedAutoModel := "modeling_phi3_v.Phi3VForCausalLM"
+		if phi3Config.AutoMap.AutoModelForCausalLM != expectedAutoModel {
+			t.Errorf("Expected AutoModelForCausalLM to be '%s', but got '%s'", expectedAutoModel, phi3Config.AutoMap.AutoModelForCausalLM)
+		}
+	}


This test for auto_map can be improved for conciseness and robustness. Using t.Fatal when phi3Config.AutoMap is nil is better as it stops the test immediately. Also, structuring the checks for each field makes the test cleaner.

// Check auto_map parsing if phi3Config.AutoMap == nil { t.Fatal("Expected auto_map to be parsed, but it is nil") } if got, want := phi3Config.AutoMap.AutoConfig, "configuration_phi3_v.Phi3VConfig"; got != want { t.Errorf("AutoConfig: got %q, want %q", got, want) } if got, want := phi3Config.AutoMap.AutoModelForCausalLM, "modeling_phi3_v.Phi3VForCausalLM"; got != want { t.Errorf("AutoModelForCausalLM: got %q, want %q", got, want) }

gemini-code-assist · 2025-10-17T20:04:48Z

pkg/hfutil/modelconfig/qwen3_test.go

+	// Check parameter count (should be approximately 7B)
+	paramCount := config.GetParameterCount()
+	expectedCount := int64(4_000_000_000) // 7B parameters
+	if paramCount != expectedCount {
+		t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)
+	}


The comment for the parameter count check is misleading. It says // 7B parameters but the expected value is 4B. Please update the comment to match the expected value.

Suggested change

// Check parameter count (should be approximately 7B)

paramCount := config.GetParameterCount()

expectedCount := int64(4_000_000_000) // 7B parameters

if paramCount != expectedCount {

t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)

}

// Check parameter count (should be approximately 4B)

paramCount := config.GetParameterCount()

expectedCount := int64(4_000_000_000) // 4B parameters

if paramCount != expectedCount {

t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)

}

gemini-code-assist · 2025-10-17T20:04:48Z

pkg/hfutil/modelconfig/qwen3_vl_test.go

+	// Check parameter count (should be approximately 7B)
+	paramCount := config.GetParameterCount()
+	expectedCount := int64(235_000_000_000) // 7B parameters
+	if paramCount != expectedCount {
+		t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)
+	}


The comment for the parameter count check is misleading. It says // 7B parameters but the expected value is 235B. Please update the comment to match the expected value.

Suggested change

// Check parameter count (should be approximately 7B)

paramCount := config.GetParameterCount()

expectedCount := int64(235_000_000_000) // 7B parameters

if paramCount != expectedCount {

t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)

}

// Check parameter count (should be approximately 235B)

paramCount := config.GetParameterCount()

expectedCount := int64(235_000_000_000) // 235B parameters

if paramCount != expectedCount {

t.Errorf("Expected parameter count to be %d, but got %d", expectedCount, paramCount)

}

[Misc] Add model config parsing support for more model types

6569ba6

beiguo218 requested a review from slin1237 as a code owner October 17, 2025 20:02

github-actions bot added the feature label Oct 17, 2025

gemini-code-assist bot reviewed Oct 17, 2025

View reviewed changes

slin1237 approved these changes Oct 17, 2025

View reviewed changes

slin1237 merged commit d9492cf into main Oct 17, 2025
23 checks passed

zhyncs deleted the add-more-model-type-support branch November 3, 2025 05:09

slin1237 pushed a commit that referenced this pull request Dec 22, 2025

[Misc] Add model config parsing support for more model types (#296)

9a27959

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Add model config parsing support for more model types#296

[Misc] Add model config parsing support for more model types#296
slin1237 merged 1 commit intomainfrom
add-more-model-type-support

beiguo218 commented Oct 17, 2025

Uh oh!

gemini-code-assist bot commented Oct 17, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 17, 2025

Uh oh!

gemini-code-assist bot Oct 17, 2025

Uh oh!

gemini-code-assist bot Oct 17, 2025

Uh oh!

gemini-code-assist bot Oct 17, 2025

Uh oh!

gemini-code-assist bot Oct 17, 2025

Uh oh!

gemini-code-assist bot Oct 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

beiguo218 commented Oct 17, 2025

What type of PR is this?

What this PR does / why we need it:

Does this PR introduce a user-facing change?

Uh oh!

gemini-code-assist bot commented Oct 17, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants