Eliciting-Latent-Knowledge-in-Comprehensive-AI-Services-Models

A Conceptual Framework and Preliminary Proposals for AI Alignment and Safety in R&D

This is a research report I authored over the summer as part of the CHERI fellowship program, under the supervision of Patrick Levermore. The project explores the complexities of AI alignment, with a specific focus on reinterpreting the Eliciting Latent Knowledge problem through the lens of the Comprehensive AI Services (CAIS) model. Furthermore, I delve into the model's applicability in ensuring R&D design safety and certification.

Refer to the blog post for a summary of the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Eliciting Latent Knowledge in Comprehensive AI Services Models.pdf		Eliciting Latent Knowledge in Comprehensive AI Services Models.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eliciting Latent Knowledge in Comprehensive AI Services Models.pdf

Eliciting Latent Knowledge in Comprehensive AI Services Models.pdf

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Eliciting-Latent-Knowledge-in-Comprehensive-AI-Services-Models

A Conceptual Framework and Preliminary Proposals for AI Alignment and Safety in R&D

About

Releases

Packages

License

alecabodi/Eliciting-Latent-Knowledge-in-Comprehensive-AI-Services-Models

Folders and files

Latest commit

History

Repository files navigation

Eliciting-Latent-Knowledge-in-Comprehensive-AI-Services-Models

A Conceptual Framework and Preliminary Proposals for AI Alignment and Safety in R&D

About

Resources

License

Stars

Watchers

Forks