Feature/implement the mentalchat16k dataset support for clinical evaluation by chakravarthik27 · Pull Request #1218 · PacificAI/langtest

chakravarthik27 · 2025-09-02T15:54:15Z

This pull request introduces a new mental health evaluation capability to the codebase, enabling the assessment of AI-generated mental health counseling responses using a set of clinical consultation metrics. It adds a new evaluation prompt and schema, a corresponding evaluation class, and integrates this functionality into the clinical test transformation pipeline. The changes also include a new SimplePrompt sample type to support these evaluations and ensure results are parsed and scored appropriately.

Mental Health Evaluation Integration

Added MENTAL_HEALTH_EVAL_PROMPT and MHCEvaluation schema in eval_prompts.py to define a structured prompt and scoring rubric for mental health counseling response evaluation.
Introduced the RatingEval class in llm_eval.py, which uses the new prompt and schema to parse and score AI responses, including batch evaluation support.
Updated the clinical test transformation logic to support a new mental_health test type, with a dedicated MentalHealth class that loads data, transforms samples, and runs evaluations using the new prompt and scoring system. [1] [2]

Sample Type Extension

Added the SimplePrompt class to sample.py, designed for prompt-response pairs, with methods for evaluation, scoring, and pass/fail determination using the mental health metrics and the new evaluation pipeline.

Internal Imports and Wiring

Registered and imported the new prompt and evaluation classes in relevant modules to enable their use throughout the codebase. [1] [2]

…n Snow Labs to Pacific AI Corp across documentation.

…torial

…n Snow Labs to Pacific AI Corp across documentation.

…torial

…corp' of https://github.com/pacific-ai-corp/langtest into refactor/replace-links-from-johnsnowlabs-to-pacific-ai-corp

…linical tests

…vironment

…icon.ico

…anization

…ified model classes for openrouter integration; adjusted MedFuzzSample initialization to exclude None and unset values

…xisting evaluation framework

…lth class

…tationMetricScores and MHCEvaluation classes

…from-johnsnowlabs-to-pacific-ai-corp Refactor/replace links from johnsnowlabs to pacific ai corp

…linical tests

…xisting evaluation framework

…lth class

…tationMetricScores and MHCEvaluation classes

…-clinical-evaluation' of https://github.com/Johnsnowlabs/langtest into feature/implement-the-mentalchat16k-dataset-support-for-clinical-evaluation

…for better type safety

blidiselalin

Should be ok

…tion processing logic

…fleoptions sample

…eToSummarySample classes

…-Corp in README.md

chakravarthik27 added 30 commits August 11, 2025 14:34

Update tutorial links to reflect repository ownership change from Joh…

867bf31

…n Snow Labs to Pacific AI Corp across documentation.

fix: update permalink to use lowercase for consistency in security tu…

cf92ff5

…torial

refactor: update the notebook links inside .ipynb files

ba65762

Implement code changes to enhance functionality and improve performance

d70cad5

Update tutorial links to reflect repository ownership change from Joh…

b2fefed

…n Snow Labs to Pacific AI Corp across documentation.

fix: update permalink to use lowercase for consistency in security tu…

e2dfeeb

…torial

refactor: update the notebook links inside .ipynb files

880edb3

Implement code changes to enhance functionality and improve performance

ddd44d8

Merge branch 'refactor/replace-links-from-johnsnowlabs-to-pacific-ai-…

e96f2cc

…corp' of https://github.com/pacific-ai-corp/langtest into refactor/replace-links-from-johnsnowlabs-to-pacific-ai-corp

updated: logo in Question-Answering notebook

583ec85

updated: build workflow file.

483326d

feat: add MentalHealth class and enhance evaluation capabilities in c…

ecb057d

…linical tests

updated: poetry config in ci/cd

b1508eb

updated: add cache clear in poetry

acd8aaa

updated: no-cache flag for poetry install in ci/cd

1935ff0

fix: poetry error

00cbcc1

updated: cache clear with poetry

6b715e6

updated: poetry.lock file

25f7dfc

updated: poetry.lock and workflow

3e4d82d

fix: ensure proper cleanup of pyspark site-packages during installation

be2a13e

fix: refine pyspark uninstallation to target specific site-packages

a7d896f

fix: ensure pyspark uninstallation occurs in the activated virtual en…

1a26a40

…vironment

updated: notebooks from tutorials folder

97eb5f8

updated: notebooks from end-to-end-notebooks folder

9fde607

add: include logo png image in assets

a30f01d

updated: logo in llm_notebooks folder - 1

7fbfd2b

updated: logo links in llm_notebooks folder - 2

7c91805

updated: logo links in llm_notebook folder - 3

d68742d

updated: logo links from misc folder - 1

77c1476

updated: logo links in misc folder - 2

468612f

chakravarthik27 added 4 commits August 26, 2025 14:18

updated: favicon and favicon.ico files; added jsl_fav.ico and jsl_fav…

19abed0

…icon.ico

updated: project metadata to reflect Pacific AI as the author and org…

1ffd818

…anization

updated: added 'openrouter' to INSTALLED_HUBS and LANGCHAIN_HUBS; mod…

8c2f74e

…ified model classes for openrouter integration; adjusted MedFuzzSample initialization to exclude None and unset values

feat: implement mental health evaluation metrics and integrate into e…

b19b2c1

…xisting evaluation framework

chakravarthik27 requested review from amit-shrestha and blidiselalin September 2, 2025 15:54

chakravarthik27 self-assigned this Sep 2, 2025

chakravarthik27 added 10 commits September 2, 2025 22:05

fix: improve error handling and enhance sample selection in MentalHea…

4d204db

…lth class

refactor: format field definitions for improved readability in Consul…

c9f7862

…tationMetricScores and MHCEvaluation classes

Merge pull request #1209 from Pacific-AI-Corp/refactor/replace-links-…

d6d18c2

…from-johnsnowlabs-to-pacific-ai-corp Refactor/replace links from johnsnowlabs to pacific ai corp

feat: add MentalHealth class and enhance evaluation capabilities in c…

3213a8e

…linical tests

feat: implement mental health evaluation metrics and integrate into e…

5de455c

…xisting evaluation framework

fix: improve error handling and enhance sample selection in MentalHea…

90e7a22

…lth class

refactor: format field definitions for improved readability in Consul…

53de446

…tationMetricScores and MHCEvaluation classes

Merge branch 'feature/implement-the-mentalchat16k-dataset-support-for…

c5e3128

…-clinical-evaluation' of https://github.com/Johnsnowlabs/langtest into feature/implement-the-mentalchat16k-dataset-support-for-clinical-evaluation

refactor: update SimplePrompt class attributes to use Optional types …

f7cd35f

…for better type safety

updated: add new notebook for Mental_Health

0399398

blidiselalin approved these changes Sep 5, 2025

View reviewed changes

chakravarthik27 added 10 commits September 8, 2025 10:30

fix: add initial state for Jupyter widgets in Mental Health notebook

3440e40

refactor: MedFuzz class by removing commented code and enhancing ques…

652af13

…tion processing logic

updated: new notebook for MedFuzz Test

f1a57cf

updated: add amega notebook

3b76759

updated: add notebook for randomize_options test. refactored the shuf…

da194c1

…fleoptions sample

updated: randomize_options notebook

4156d4a

refactor: enhance evaluation logic and scoring in LlmEval and Dialogu…

88e7b11

…eToSummarySample classes

updated: add notebooks for aci_bench and mts_dialog

cb091db

updated: notebook link for amega

f1d367c

updated: change repository references from JohnSnowLabs to Pacific-AI…

9d5c187

…-Corp in README.md

chakravarthik27 merged commit 9490563 into release/2.7.0 Sep 9, 2025
3 checks passed

chakravarthik27 deleted the feature/implement-the-mentalchat16k-dataset-support-for-clinical-evaluation branch September 27, 2025 08:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/implement the mentalchat16k dataset support for clinical evaluation#1218

Feature/implement the mentalchat16k dataset support for clinical evaluation#1218
chakravarthik27 merged 56 commits intorelease/2.7.0from
feature/implement-the-mentalchat16k-dataset-support-for-clinical-evaluation

chakravarthik27 commented Sep 2, 2025

Uh oh!

blidiselalin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chakravarthik27 commented Sep 2, 2025

Uh oh!

blidiselalin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants