parameterlab · cemde · Nov 18, 2025 · Nov 18, 2025 · Nov 18, 2025 · Nov 18, 2025
diff --git a/README.md b/README.md
@@ -12,7 +12,7 @@
 [![Python 3.10+](https://img.shields.io/badge/python-3.10%2B-blue.svg)](https://www.python.org/downloads/)
 [![PyPI version](https://badge.fury.io/py/maseval.svg)](https://badge.fury.io/py/maseval)
 [![Documentation](https://img.shields.io/badge/docs-latest-brightgreen.svg)](#)
-[![Tests](https://github.com/cemde/MASEval/actions/workflows/test.yml/badge.svg)](https://github.com/cemde/MASEval/actions/workflows/test.yml)
+[![Tests](https://github.com/parameterlab/MASEval/actions/workflows/test.yml/badge.svg)](https://github.com/parameterlab/MASEval/actions/workflows/test.yml)
 [![License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
 
 MASEval is an evaluation library that provides a unified interface for benchmarking (multi-)agent systems. It offers standardized abstractions for running any agent implementation—whether built with AutoGen, LangChain, custom frameworks, or direct API calls—against established benchmarks like GAIA and AgentBench, or your own custom evaluation tasks.
@@ -63,7 +63,7 @@ Examples are available in the documentiation. TODO add link!
 
 ## Contribute
 
-We welcome any contributions. Please read the [CONTRIBUTING.md](CONTRIBUTING.md) file to learn more!
+We welcome any contributions. Please read the [CONTRIBUTING.md](https://github.com/parameterlab/MASEval/tree/fix-porting-issue?tab=contributing-ov-file) file to learn more!
 
 ## Benchmarks
 

diff --git a/assets/logo-dark.svg b/assets/logo-dark.svg
diff --git a/assets/logo-light.svg b/assets/logo-light.svg
diff --git a/docs/assets/logo_short.svg → assets/logo-short.svg b/docs/assets/logo_short.svg → assets/logo-short.svg
diff --git a/assets/logo.svg b/assets/logo.svg
diff --git a/docs/assets/logo-dark.svg b/docs/assets/logo-dark.svg
@@ -0,0 +1 @@
+../../assets/logo-dark.svg
diff --git a/docs/assets/logo-light.svg b/docs/assets/logo-light.svg
@@ -0,0 +1 @@
+../../assets/logo-light.svg
diff --git a/docs/assets/logo-short.svg b/docs/assets/logo-short.svg
@@ -0,0 +1 @@
+../../assets/logo-short.svg
diff --git a/docs/assets/logo.svg b/docs/assets/logo.svg
diff --git a/docs/assets/logo.svg b/docs/assets/logo.svg
@@ -0,0 +1 @@
+../../assets/logo.svg
diff --git a/docs/logo-dark.svg b/docs/logo-dark.svg
diff --git a/docs/logo-light.svg b/docs/logo-light.svg
diff --git a/maseval/core/benchmark.py b/maseval/core/benchmark.py
@@ -834,7 +834,7 @@ def run_agents(self, agents, task, environment):
         """
         pass
 
-    def run(self, tasks: Union[Task, TaskCollection, Iterable[Union[Task, dict]]]):
+    def run(self, tasks: Union[Task, TaskCollection, Iterable[Union[Task, dict]]]) -> List[Dict[str, Any]]:
         """Initialize and execute the complete benchmark loop across all tasks.
 
         Args:

diff --git a/mkdocs.yml b/mkdocs.yml
@@ -47,8 +47,10 @@ plugins:
             show_root_heading: true
             type_parameter_headings: true
             show_source: false
+
             # Hide private members (methods/attributes starting with _)
-            show_private: false
+            extra:
+              show_private: false
 
 nav:
   - Getting Started:
-Original file line number
+Diff line change
@@ Expand Up / @@ -834,7 +834,7 @@ def run_agents(self, agents, task, environment): @@
             """
             pass
-        def run(self, tasks: Union[Task, TaskCollection, Iterable[Union[Task, dict]]]):
+        def run(self, tasks: Union[Task, TaskCollection, Iterable[Union[Task, dict]]]) -> List[Dict[str, Any]]:
             """Initialize and execute the complete benchmark loop across all tasks.
             Args:
@@ Expand Down @@