author	title	semester	footer	license
Christian Kaestner and Eunsuk Kang	MLiP: Transparency and Accountability	Fall 2023	Machine Learning in Production/AI Engineering • Christian Kaestner & Eunsuk Kang, Carnegie Mellon University • Spring 2023	Creative Commons Attribution 4.0 International (CC BY 4.0)

Machine Learning in Production

Transparency and Accountability

Administrativa

Will end this lecture at 3pm
Homework I4 (Explainability/Transparency) out tonight

More Explainability, Policy, and Politics

Readings

Required reading:

Google PAIR. People + AI Guidebook. Chapter: Explainability and Trust. 2019.

Recommendedr hoeading:

Metcalf, Jacob, and Emanuel Moss. "Owning ethics: Corporate logics, silicon valley, and the institutionalization of ethics." Social Research: An International Quarterly 86, no. 2 (2019): 449-476.

Learning Goals

Explain key concepts of transparency and trust
Discuss whether and when transparency can be abused to game the system
Design a system to include human oversight
Understand common concepts and discussions of accountability/culpability
Critique regulation and self-regulation approaches in ethical machine learning

Explainability vs Transparency

Explainability & Interpretability: Tools to understand the model, mostly debugging, mostly developer-focused

Transparency: Users know that algorithm exists and how the algorithm works

Transparency is focused on users -- Human-AI interaction, oversight, appeals, audits

Recall: Explaining for Human-AI Interaction, Trust

Goyal, Raman, Gabriel Ferreira, Christian Kästner, and James Herbsleb. "Identifying unusual commits on GitHub." Journal of Software: Evolution and Process 30, no. 1 (2018): e1893.

Recall: Explainability for Auditing

Understand safety implications
Ensure predictions use objective criteria and reasonable rules
Inspect fairness properties
Reason about biases and feedback loops
Validate "learned specifications/requirements" with stakeholders

IF age between 18–20 and sex is male THEN predict arrest
ELSE IF age between 21–23 and 2–3 prior offenses THEN predict arrest
ELSE IF more than three priors THEN predict arrest
ELSE predict no arrest

Transparency of the Model's Existance

Case Study: Facebook's Feed Curation

Eslami, Motahhare, et al. I always assumed that I wasn't really that close to [her]: Reasoning about Invisible Algorithms in News Feeds. In Proc. CHI, 2015.

Case Study: Facebook's Feed Curation

62% of interviewees were not aware of curation algorithm
Surprise and anger when learning about curation

"Participants were most upset when close friends and family were not shown in their feeds [...] participants often attributed missing stories to their friends’ decisions to exclude them rather than to Facebook News Feed algorithm."

Learning about algorithm did not change satisfaction level
More active engagement, more feeling of control

Eslami, Motahhare, et al. I always assumed that I wasn't really that close to [her]: Reasoning about Invisible Algorithms in News Feeds. In Proc. CHI, 2015.

Transparency of How the Model Works

Enabling Oversight and Appeals

Human-AI Interaction

Give the user details about why a prediction was made in a high stakes scenario. Here, the user is exercising after an injury and needs confidence in the app’s recommendation.

Source: People + AI Guidebook, Google

Tell the user when a lack of data might mean they’ll need to use their own judgment. Don’t be afraid to admit when a lack of data could affect the quality of the AI recommendations.

Source: People + AI Guidebook, Google

Setting Cancer Imaging -- What explanations do radiologists want?

Past attempts often not successful at bringing tools into production. Radiologists do not trust them. Why?
Wizard of oz study to elicit requirements

Radiologists' questions

How does it perform compared to human experts?
"What is difficult for the AI to know? Where is it too sensitive? What criteria is it good at recognizing or not good at recognizing?"
What data (volume, types, diversity) was the model trained on?
"Does the AI have access to information that I don’t have? Does it have access to ancillary studies?" Is all used data shown in the UI?
What kind of things is the AI looking for? What is it capable of learning? ("Maybe light and dark? Maybe colors? Maybe shapes, lines?", "Does it take into consideration the relationship between gland and stroma? Nuclear relationship?")
"Does it have a bias a certain way?" (compared to colleagues)

Radiologists' questions

Capabilities and limitations: performance, strength, limitations; e.g. how does it handle well-known edge cases
Functionality: What data used for predictions, how much context, how data is used
Medical point-of-view: calibration, how liberal/conservative when grading cancer severity
Design objectives: Designed for few false positives or false negatives? Tuned to compensate for human error?
Other considerations: legal liability, impact on workflow, cost of use

Further details: Paper, Table 1

Radiologists Study Insights

AI literacy important for trust

Be transparent about data used

Describe training data and capabilities

Give mental model, examples, human-relatable test cases

Communicate the AI’s point-of-view and design goal

Cai, Carrie J., Samantha Winter, David Steiner, Lauren Wilcox, and Michael Terry. ""Hello AI": Uncovering the Onboarding Needs of Medical Practitioners for Human-AI Collaborative Decision-Making." Proceedings of the ACM on Human-computer Interaction 3, no. CSCW (2019): 1-24.

Designing Transparency

Be explicit about the goal

Tailor explanation to specific user needs and user's AI literacy

Partial explanations or justifications often sufficient

Test effectiveness of transparency mechanisms

The Dark Side of Transparency

Many explanations are wrong

Approximations of black-box models, often unstable

Explanations necessarily partial, social

Often multiple explanations possible (Rashomon effect)

Possible to use inherently interpretable models instead?

When explanation desired/required: What quality is needed/acceptable?

Explanations foster Trust

Users are less likely to question the model when explanations provided

Even if explanations are unreliable
Even if explanations are nonsensical/incomprehensible

Danger of overtrust and intentional manipulation

Stumpf, Simone, Adrian Bussone, and Dympna O’sullivan. "Explanations considered harmful? user interactions with machine learning systems." In Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI). 2016.

Springer, Aaron, Victoria Hollis, and Steve Whittaker. "Dice in the black box: User experiences with an inscrutable algorithm." In 2017 AAAI Spring Symposium Series. 2017.

(a) Rationale, (b) Stating the prediction, (c) Numerical internal values

Observation: Both experts and non-experts overtrust numerical explanations, even when inscrutable.

Ehsan, Upol, Samir Passi, Q. Vera Liao, Larry Chan, I. Lee, Michael Muller, and Mark O. Riedl. "The who in explainable AI: how AI background shapes perceptions of AI explanations." arXiv preprint arXiv:2107.13509 (2021).

Illusion of Control

Users may feel influence and control, even with placebo controls

Vaccaro, Kristen, Dylan Huang, Motahhare Eslami, Christian Sandvig, Kevin Hamilton, and Karrie Karahalios. "The illusion of control: Placebo effects of control settings." In Proc CHI, 2018.

Regulatory Compliance

Companies give vague generic explanations to appease regulators

Checkbox compliance: Provide some mechanism without ensuring effectiveness

Example: FairCredit act requires explanation for declined credit applications -- Explanations generic and ineffective for fighting discrimination, at most ensure that input data was correct

Selbst, Andrew D., and Solon Barocas. "The intuitive appeal of explainable machines." Fordham L. Rev. 87 (2018): 1085.

Gaming/Attacking the Model with Explanations?

Does providing an explanation allow customers to 'hack' the system?

Loan applications?
Apple FaceID?
Recidivism?
Auto grading?
Cancer diagnosis?
Spam detection?

Gaming the Model with Explanations?

Constructive Alignment in Teaching

see also Claus Brabrand. Teaching Teaching & Understanding Understanding. Youtube 2009

Gaming the Model with Explanations?

A model prone to gaming uses weak proxy features
Protections requires to make the model hard to observe (e.g., expensive to query predictions)
Protecting models akin to "security by obscurity"
Good models rely on hard facts that relate causally to the outcome <- hard to game

IF age between 18–20 and sex is male THEN predict arrest
ELSE IF age between 21–23 and 2–3 prior offenses THEN predict arrest
ELSE IF more than three priors THEN predict arrest
ELSE predict no arrest

Human Oversight and Appeals

Unavoidable that ML models will make mistakes
Users knowing about the model may not be comforting
Inability to appeal a decision can be deeply frustrating

Capacity to keep humans in the loop?

ML used because human decisions as a bottleneck

ML used because human decisions biased and inconsistent

Do we have the capacity to handle complaints/appeals?

Wouldn't reintroducing humans bring back biases and inconsistencies?

Designing Human Oversight

Consider the entire system and consequences of mistakes

Deliberately design mitigation strategies for handling mistakes

Consider keeping humans in the loop, balancing harms and costs

Provide pathways to appeal/complain? Respond to complains?
Review mechanisms? Can humans override tool decision?
Tracking telemetry, investigating common mistakes?
Audit model and decision process rather than appeal individual outcomes?

Breakout: Transparency in Admissions

For a automated Master's admission support system, consider what you would make transparent and to whom.

In groups, tagging group members, respond in #lecture:

What information (global, local) would you provide to applicants? What's the purpose?
What information (global, local) would you provide to the admissions committee? What's the purpose?

Accountability and Culpability

Who is held accountable if things go wrong?

On Terminology

accountability, responsibility, liability, and culpability all overlap in common use
often about assigning blame -- responsible for fixing or liable for paying for damages
liability, culpability have legal connotation
responsibility tends to describe ethical aspirations
accountability often defined as oversight relationship, where actor is accountable to some "forum" that can impose penalties
see also legal vs ethical earlier

On Terminology

Academic definition of accountability:

A relationship between an actor and a forum, in which the actor has an obligation to explain and to justify his or her conduct, the forum can pose questions and pass judgement, and the actor may face consequences.

That is accountability implies some oversight with ability to penalize

Wieringa, Maranke. "What to account for when accounting for algorithms: a systematic literature review on algorithmic accountability." In Proceedings of the Conference on Fairness, Accountability, and Transparency, pp. 1-18. 2020.

Who is responsible?

Faceswap's README "FaceSwap has ethical uses"

[...] as is so often the way with new technology emerging on the internet, it was immediately used to create inappropriate content.

[...] it was the first AI code that anyone could download, run and learn by experimentation without having a Ph.D. in math, computer theory, psychology, and more. Before "deepfakes" these techniques were like black magic, only practiced by those who could understand all of the inner workings as described in esoteric and endlessly complicated books and papers.

[...] the release of this code opened up a fantastic learning opportunity.

Are there some out there doing horrible things with similar software? Yes. And because of this, the developers have been following strict ethical standards. Many of us don't even use it to create videos, we just tinker with the code to see what it does. [...]

FaceSwap is not for creating inappropriate content. FaceSwap is not for changing faces without consent or with the intent of hiding its use. FaceSwap is not for any illicit, unethical, or questionable purposes. [...]

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Note: Software engineers got (mostly) away with declaring not to be liable

Easy to Blame "The Algorithm" / "The Data" / "Software"

"Just a bug, things happen, nothing we could have done"

But system was designed by humans
But humans did not anticipate possible mistakes, did not design to mitigate mistakes
But humans made decisions about what quality was good enough
But humans designed/ignored the development process
But humans gave/sold poor quality software to other humans
But humans used the software without understanding it
...

Results from the 2018 StackOverflow Survey

What to do?

Responsible organizations embed risk analysis, quality control, and ethical considerations into their process
Establish and communicate policies defining responsibilities
Work from aspirations toward culture change: baseline awareness + experts
Document tradeoffs and decisions (e.g., datasheets, model cards)
Continuous learning
Consider controlling/restricting how software may be used, whether it should be built at all
And... follow the law
Get started with existing guidelines, e.g., in AI Ethics Guidelines

(Self-)Regulation and Policy

Policy Discussion and Frameing

Corporate pitch: "Responsible AI" (Microsoft, Google, Accenture)
Counterpoint: Ochigame "The Invention of 'Ethical AI': How Big Tech Manipulates Academia to Avoid Regulation", The Intercept 2019
- "The discourse of “ethical AI” was aligned strategically with a Silicon Valley effort seeking to avoid legally enforceable restrictions of controversial technologies."

Self-regulation vs government regulation? Assuring safety vs fostering innovation?

"Wishful Worries"

We are distracted with worries about fairness and safety of hypothetical systems

Most systems fail because they didn't work in the first place; don't actually solve a problem or address impossible tasks

Wouldn't help even if they solved the given problem (e.g., predictive policing?)

Raji, Inioluwa Deborah, I. Elizabeth Kumar, Aaron Horowitz, and Andrew Selbst. "The fallacy of AI functionality." In 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 959-972. 2022.

“Accelerating America’s Leadership in Artificial Intelligence”

“the policy of the United States Government [is] to sustain and enhance the scientific, technological, and economic leadership position of the United States in AI.” -- White House Executive Order Feb. 2019

Tone: "When in doubt, the government should not regulate AI."

Note:

1. Setting AI Governance Standards: "foster public trust in AI systems by establishing guidance for AI development. [...] help Federal regulatory agencies develop and maintain approaches for the safe and trustworthy creation and adoption of new AI technologies. [...] NIST to lead the development of appropriate technical standards for reliable, robust, trustworthy, secure, portable, and interoperable AI systems."

EU AI Act

Broad regulatory framework, passed March 13, 2024

Risk-based framework:

Unacceptable risk: Banned applications for social scoring, cognitive manipulation, real-time biometric surveillance, ...
High risk: Registration, (ongoing) risk assessment, data governance, monitoring, documentation, transparency, human oversight, and assurances for robustness, security, and accuracy for applications for hiring, health, law enforcement, ...
Limited risk: Limited transparency (model's existence) obligations, e.g., image generation
Minimal risk: Not regulated, e.g., video games, spam filters

2023 WH Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence

Instructs agencies to study risks and develop standards

Broad scope, touches on quality assurance standards (incl. red teaming) and marking AI-generated content

Domain-specific guidelines for chemical, biological, radiological, nuclear, and cybersecurity risks to be explored

Committees, reports, guidance, research instead of enforceable rules

Call for Transparent and Audited Models

"no black box should be deployed when there exists an interpretable model with the same level of performance"

For high-stakes decisions

... with government involvement (recidivism, policing, city planning, ...)
... in medicine
... with discrimination concerns (hiring, loans, housing, ...)
... that influence society and discourse? (algorithmic content amplifications, targeted advertisement, ...)

Regulate possible conflict: Intellectual property vs public welfare

Rudin, Cynthia. "Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead." Nature Machine Intelligence 1.5 (2019): 206-215. (Preprint)

Criticism: Ethics Washing, Ethics Bashing, Regulatory Capture

Summary

Transparency goes beyond explaining predictions
Plan for mistakes and human oversight
Accountability and culpability are hard to capture, little regulation
Be a responsible engineer, adopt a culture of responsibility
Regulations may be coming

Files

transparency.md

Latest commit

History

transparency.md

File metadata and controls

Machine Learning in Production

Transparency and Accountability

Administrativa

More Explainability, Policy, and Politics

Readings

Learning Goals

Explainability vs Transparency

Recall: Explaining for Human-AI Interaction, Trust

Recall: Explainability for Auditing

Transparency of the Model's Existance

Case Study: Facebook's Feed Curation

Case Study: Facebook's Feed Curation

Transparency of How the Model Works

Enabling Oversight and Appeals

Human-AI Interaction

Setting Cancer Imaging -- What explanations do radiologists want?

Radiologists' questions

Radiologists' questions

Radiologists Study Insights

Designing Transparency

The Dark Side of Transparency

Many explanations are wrong

Explanations foster Trust

Illusion of Control

Regulatory Compliance

Gaming/Attacking the Model with Explanations?

Gaming the Model with Explanations?

Constructive Alignment in Teaching

Gaming the Model with Explanations?

Human Oversight and Appeals

Human Oversight and Appeals

Capacity to keep humans in the loop?

Designing Human Oversight

Breakout: Transparency in Admissions

Accountability and Culpability

On Terminology

On Terminology

Who is responsible?

Who is responsible?

Who is responsible?

Who is responsible?

Faceswap's README "FaceSwap has ethical uses"

Easy to Blame "The Algorithm" / "The Data" / "Software"

What to do?

(Self-)Regulation and Policy

Policy Discussion and Frameing

"Wishful Worries"

“Accelerating America’s Leadership in Artificial Intelligence”

EU AI Act

2023 WH Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence

Call for Transparent and Audited Models

Criticism: Ethics Washing, Ethics Bashing, Regulatory Capture

Summary

Further Readings