Skip to content

Latest commit

 

History

History
705 lines (401 loc) · 25 KB

transparency.md

File metadata and controls

705 lines (401 loc) · 25 KB
author title semester footer license
Christian Kaestner and Eunsuk Kang
MLiP: Transparency and Accountability
Fall 2023
Machine Learning in Production/AI Engineering • Christian Kaestner & Eunsuk Kang, Carnegie Mellon University • Spring 2023
Creative Commons Attribution 4.0 International (CC BY 4.0)

Machine Learning in Production

Transparency and Accountability


Administrativa

  • Will end this lecture at 3pm
  • Homework I4 (Explainability/Transparency) out tonight

More Explainability, Policy, and Politics

Overview of course content


Readings

Required reading:

Recommendedr hoeading:


Learning Goals

  • Explain key concepts of transparency and trust
  • Discuss whether and when transparency can be abused to game the system
  • Design a system to include human oversight
  • Understand common concepts and discussions of accountability/culpability
  • Critique regulation and self-regulation approaches in ethical machine learning

Explainability vs Transparency

Explainability & Interpretability: Tools to understand the model, mostly debugging, mostly developer-focused

Transparency: Users know that algorithm exists and how the algorithm works

Transparency is focused on users -- Human-AI interaction, oversight, appeals, audits


Recall: Explaining for Human-AI Interaction, Trust

Reported commit

Goyal, Raman, Gabriel Ferreira, Christian Kästner, and James Herbsleb. "Identifying unusual commits on GitHub." Journal of Software: Evolution and Process 30, no. 1 (2018): e1893.


Recall: Explainability for Auditing

  • Understand safety implications
  • Ensure predictions use objective criteria and reasonable rules
  • Inspect fairness properties
  • Reason about biases and feedback loops
  • Validate "learned specifications/requirements" with stakeholders
IF age between 1820 and sex is male THEN predict arrest
ELSE IF age between 2123 and 23 prior offenses THEN predict arrest
ELSE IF more than three priors THEN predict arrest
ELSE predict no arrest

Transparency of the Model's Existance



Case Study: Facebook's Feed Curation

Facebook with and without filtering

Eslami, Motahhare, et al. I always assumed that I wasn't really that close to [her]: Reasoning about Invisible Algorithms in News Feeds. In Proc. CHI, 2015.


Case Study: Facebook's Feed Curation

  • 62% of interviewees were not aware of curation algorithm
  • Surprise and anger when learning about curation

"Participants were most upset when close friends and family were not shown in their feeds [...] participants often attributed missing stories to their friends’ decisions to exclude them rather than to Facebook News Feed algorithm."

  • Learning about algorithm did not change satisfaction level
  • More active engagement, more feeling of control

Eslami, Motahhare, et al. I always assumed that I wasn't really that close to [her]: Reasoning about Invisible Algorithms in News Feeds. In Proc. CHI, 2015.


Transparency of How the Model Works


Enabling Oversight and Appeals

Shap force plot


Human-AI Interaction

Positive example

Negative example

Give the user details about why a prediction was made in a high stakes scenario. Here, the user is exercising after an injury and needs confidence in the app’s recommendation.

Source: People + AI Guidebook, Google


Positive example

Negative example

Tell the user when a lack of data might mean they’ll need to use their own judgment. Don’t be afraid to admit when a lack of data could affect the quality of the AI recommendations.

Source: People + AI Guidebook, Google


Setting Cancer Imaging -- What explanations do radiologists want?

  • Past attempts often not successful at bringing tools into production. Radiologists do not trust them. Why?
  • Wizard of oz study to elicit requirements

Shown predictions in prostate cancer study


Radiologists' questions

  • How does it perform compared to human experts?
  • "What is difficult for the AI to know? Where is it too sensitive? What criteria is it good at recognizing or not good at recognizing?"
  • What data (volume, types, diversity) was the model trained on?
  • "Does the AI have access to information that I don’t have? Does it have access to ancillary studies?" Is all used data shown in the UI?
  • What kind of things is the AI looking for? What is it capable of learning? ("Maybe light and dark? Maybe colors? Maybe shapes, lines?", "Does it take into consideration the relationship between gland and stroma? Nuclear relationship?")
  • "Does it have a bias a certain way?" (compared to colleagues)

Radiologists' questions

  • Capabilities and limitations: performance, strength, limitations; e.g. how does it handle well-known edge cases
  • Functionality: What data used for predictions, how much context, how data is used
  • Medical point-of-view: calibration, how liberal/conservative when grading cancer severity
  • Design objectives: Designed for few false positives or false negatives? Tuned to compensate for human error?
  • Other considerations: legal liability, impact on workflow, cost of use

Further details: Paper, Table 1


Radiologists Study Insights

AI literacy important for trust

Be transparent about data used

Describe training data and capabilities

Give mental model, examples, human-relatable test cases

Communicate the AI’s point-of-view and design goal

Cai, Carrie J., Samantha Winter, David Steiner, Lauren Wilcox, and Michael Terry. ""Hello AI": Uncovering the Onboarding Needs of Medical Practitioners for Human-AI Collaborative Decision-Making." Proceedings of the ACM on Human-computer Interaction 3, no. CSCW (2019): 1-24.


Designing Transparency

Be explicit about the goal

Tailor explanation to specific user needs and user's AI literacy

Partial explanations or justifications often sufficient

Test effectiveness of transparency mechanisms


The Dark Side of Transparency


Many explanations are wrong

Approximations of black-box models, often unstable

Explanations necessarily partial, social

Often multiple explanations possible (Rashomon effect)

Possible to use inherently interpretable models instead?

When explanation desired/required: What quality is needed/acceptable?


Explanations foster Trust

Users are less likely to question the model when explanations provided

  • Even if explanations are unreliable
  • Even if explanations are nonsensical/incomprehensible

Danger of overtrust and intentional manipulation

Stumpf, Simone, Adrian Bussone, and Dympna O’sullivan. "Explanations considered harmful? user interactions with machine learning systems." In Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI). 2016.


Paper screenshot of experiment user interface

Springer, Aaron, Victoria Hollis, and Steve Whittaker. "Dice in the black box: User experiences with an inscrutable algorithm." In 2017 AAAI Spring Symposium Series. 2017.


3 Conditions of the experiment with different explanation designs

(a) Rationale, (b) Stating the prediction, (c) Numerical internal values

Observation: Both experts and non-experts overtrust numerical explanations, even when inscrutable.

Ehsan, Upol, Samir Passi, Q. Vera Liao, Larry Chan, I. Lee, Michael Muller, and Mark O. Riedl. "The who in explainable AI: how AI background shapes perceptions of AI explanations." arXiv preprint arXiv:2107.13509 (2021).


Illusion of Control

Users may feel influence and control, even with placebo controls

Sensemaking in study on how humans interpret machine filters and controls they have over it

Vaccaro, Kristen, Dylan Huang, Motahhare Eslami, Christian Sandvig, Kevin Hamilton, and Karrie Karahalios. "The illusion of control: Placebo effects of control settings." In Proc CHI, 2018.


Regulatory Compliance

Companies give vague generic explanations to appease regulators

Checkbox compliance: Provide some mechanism without ensuring effectiveness

Example: FairCredit act requires explanation for declined credit applications -- Explanations generic and ineffective for fighting discrimination, at most ensure that input data was correct

Selbst, Andrew D., and Solon Barocas. "The intuitive appeal of explainable machines." Fordham L. Rev. 87 (2018): 1085.


Gaming/Attacking the Model with Explanations?

Does providing an explanation allow customers to 'hack' the system?

  • Loan applications?
  • Apple FaceID?
  • Recidivism?
  • Auto grading?
  • Cancer diagnosis?
  • Spam detection?

Gaming the Model with Explanations?

Course assessment does not align with learning goals, leading to shallow learning


Constructive Alignment in Teaching

Course assessment does align with learning goals, leading to better learning

see also Claus Brabrand. Teaching Teaching & Understanding Understanding. Youtube 2009


Gaming the Model with Explanations?

  • A model prone to gaming uses weak proxy features
  • Protections requires to make the model hard to observe (e.g., expensive to query predictions)
  • Protecting models akin to "security by obscurity"
  • Good models rely on hard facts that relate causally to the outcome <- hard to game
IF age between 1820 and sex is male THEN predict arrest
ELSE IF age between 2123 and 23 prior offenses THEN predict arrest
ELSE IF more than three priors THEN predict arrest
ELSE predict no arrest

Human Oversight and Appeals


Human Oversight and Appeals

  • Unavoidable that ML models will make mistakes
  • Users knowing about the model may not be comforting
  • Inability to appeal a decision can be deeply frustrating

Capacity to keep humans in the loop?

ML used because human decisions as a bottleneck

ML used because human decisions biased and inconsistent

Do we have the capacity to handle complaints/appeals?

Wouldn't reintroducing humans bring back biases and inconsistencies?


Designing Human Oversight

Consider the entire system and consequences of mistakes

Deliberately design mitigation strategies for handling mistakes

Consider keeping humans in the loop, balancing harms and costs

  • Provide pathways to appeal/complain? Respond to complains?
  • Review mechanisms? Can humans override tool decision?
  • Tracking telemetry, investigating common mistakes?
  • Audit model and decision process rather than appeal individual outcomes?

Breakout: Transparency in Admissions

For a automated Master's admission support system, consider what you would make transparent and to whom.

In groups, tagging group members, respond in #lecture:

  • What information (global, local) would you provide to applicants? What's the purpose?
  • What information (global, local) would you provide to the admissions committee? What's the purpose?

Accountability and Culpability

Who is held accountable if things go wrong?


On Terminology

  • accountability, responsibility, liability, and culpability all overlap in common use
  • often about assigning blame -- responsible for fixing or liable for paying for damages
  • liability, culpability have legal connotation
  • responsibility tends to describe ethical aspirations
  • accountability often defined as oversight relationship, where actor is accountable to some "forum" that can impose penalties
  • see also legal vs ethical earlier

Random letters


On Terminology

Academic definition of accountability:

A relationship between an actor and a forum, in which the actor has an obligation to explain and to justify his or her conduct, the forum can pose questions and pass judgement, and the actor may face consequences.

That is accountability implies some oversight with ability to penalize

Wieringa, Maranke. "What to account for when accounting for algorithms: a systematic literature review on algorithmic accountability." In Proceedings of the Conference on Fairness, Accountability, and Transparency, pp. 1-18. 2020.

Random letters


Who is responsible?

teen-suicide-rate


Who is responsible?

News headline: How US surveillance technology is propping up authoritarian regimes


Who is responsible?

Weapons robot


Who is responsible?

Faceswap github webpage


Faceswap's README "FaceSwap has ethical uses"

[...] as is so often the way with new technology emerging on the internet, it was immediately used to create inappropriate content.

[...] it was the first AI code that anyone could download, run and learn by experimentation without having a Ph.D. in math, computer theory, psychology, and more. Before "deepfakes" these techniques were like black magic, only practiced by those who could understand all of the inner workings as described in esoteric and endlessly complicated books and papers.

[...] the release of this code opened up a fantastic learning opportunity.

Are there some out there doing horrible things with similar software? Yes. And because of this, the developers have been following strict ethical standards. Many of us don't even use it to create videos, we just tinker with the code to see what it does. [...]

FaceSwap is not for creating inappropriate content. FaceSwap is not for changing faces without consent or with the intent of hiding its use. FaceSwap is not for any illicit, unethical, or questionable purposes. [...]


THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Note: Software engineers got (mostly) away with declaring not to be liable


Easy to Blame "The Algorithm" / "The Data" / "Software"

"Just a bug, things happen, nothing we could have done"

  • But system was designed by humans
  • But humans did not anticipate possible mistakes, did not design to mitigate mistakes
  • But humans made decisions about what quality was good enough
  • But humans designed/ignored the development process
  • But humans gave/sold poor quality software to other humans
  • But humans used the software without understanding it
  • ...

Stack overflow survey on responsible

Results from the 2018 StackOverflow Survey


What to do?

  • Responsible organizations embed risk analysis, quality control, and ethical considerations into their process
  • Establish and communicate policies defining responsibilities
  • Work from aspirations toward culture change: baseline awareness + experts
  • Document tradeoffs and decisions (e.g., datasheets, model cards)
  • Continuous learning
  • Consider controlling/restricting how software may be used, whether it should be built at all
  • And... follow the law
  • Get started with existing guidelines, e.g., in AI Ethics Guidelines

(Self-)Regulation and Policy


Self regulation of tech companies on facial recognition


Responsible AI website from Microsoft


Policy Discussion and Frameing

Self-regulation vs government regulation? Assuring safety vs fostering innovation?




"Wishful Worries"

We are distracted with worries about fairness and safety of hypothetical systems

Most systems fail because they didn't work in the first place; don't actually solve a problem or address impossible tasks

Wouldn't help even if they solved the given problem (e.g., predictive policing?)

Raji, Inioluwa Deborah, I. Elizabeth Kumar, Aaron Horowitz, and Andrew Selbst. "The fallacy of AI functionality." In 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 959-972. 2022.


Forbes Article: This Is The Year Of AI Regulations


“Accelerating America’s Leadership in Artificial Intelligence”

“the policy of the United States Government [is] to sustain and enhance the scientific, technological, and economic leadership position of the United States in AI.” -- White House Executive Order Feb. 2019

Tone: "When in doubt, the government should not regulate AI."

Note:

    1. Setting AI Governance Standards: "foster public trust in AI systems by establishing guidance for AI development. [...] help Federal regulatory agencies develop and maintain approaches for the safe and trustworthy creation and adoption of new AI technologies. [...] NIST to lead the development of appropriate technical standards for reliable, robust, trustworthy, secure, portable, and interoperable AI systems."

EU AI Act

Broad regulatory framework, passed March 13, 2024

Risk-based framework:

  • Unacceptable risk: Banned applications for social scoring, cognitive manipulation, real-time biometric surveillance, ...
  • High risk: Registration, (ongoing) risk assessment, data governance, monitoring, documentation, transparency, human oversight, and assurances for robustness, security, and accuracy for applications for hiring, health, law enforcement, ...
  • Limited risk: Limited transparency (model's existence) obligations, e.g., image generation
  • Minimal risk: Not regulated, e.g., video games, spam filters

2023 WH Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence

Instructs agencies to study risks and develop standards

Broad scope, touches on quality assurance standards (incl. red teaming) and marking AI-generated content

Domain-specific guidelines for chemical, biological, radiological, nuclear, and cybersecurity risks to be explored

Committees, reports, guidance, research instead of enforceable rules


Call for Transparent and Audited Models

"no black box should be deployed when there exists an interpretable model with the same level of performance"

For high-stakes decisions

  • ... with government involvement (recidivism, policing, city planning, ...)
  • ... in medicine
  • ... with discrimination concerns (hiring, loans, housing, ...)
  • ... that influence society and discourse? (algorithmic content amplifications, targeted advertisement, ...)

Regulate possible conflict: Intellectual property vs public welfare

Rudin, Cynthia. "Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead." Nature Machine Intelligence 1.5 (2019): 206-215. (Preprint)


Criticism: Ethics Washing, Ethics Bashing, Regulatory Capture


Summary

  • Transparency goes beyond explaining predictions
  • Plan for mistakes and human oversight
  • Accountability and culpability are hard to capture, little regulation
  • Be a responsible engineer, adopt a culture of responsibility
  • Regulations may be coming

Further Readings