author	title	semester	footer	license
Christian Kaestner & Eunsuk Kang	17-645: Gathering Requirements	Fall 2022	17-645 Machine Learning in Production • Christian Kaestner, Carnegie Mellon University • Fall 2022	Creative Commons Attribution 4.0 International (CC BY 4.0)

Machine Learning in Production

Gathering Requirements

Exploring Requirements...

Learning Goals

Understand the role of requirements in ML-based systems and their failures
Understand the distinction between the world and the machine
Understand the importance of environmental assumptions in establishing system requirements

Readings

Required reading: 🗎 Jackson, Michael. "The world and the machine." In Proceedings of the International Conference on Software Engineering. IEEE, 1995.

Going deeper: 🕮 Van Lamsweerde, Axel. Requirements engineering: From system goals to UML models to software. John Wiley & Sons, 2009.

Failures in ML-Based Systems

Facial Recognition in ATM

Q. What went wrong? What is the root cause of the failure?

Automated Hiring

Q. What went wrong? What is the root cause of the failure?

Autopilot in Vehicles

Q. What went wrong? What is the root cause of the failure?

IBM Watson

Washington Post, 06/2015

IBM Watson

"We got concerns from them that the recommendations that it was giving were just not relevant...it would suggest a particular kind of treatment that wasn’t available in the locality in which it was making the recommendation, or the recommendation did not at all square with the treatment protocols that were in use at the local institution..."

Slate, 01/2022

Risks in ML-based Systems

What went wrong? What were the root causes of failures in these systems? Was the quality of an ML model to blame?

Reminder: ML in a System

Machine learning is a component within a system

Need to also understand other parts and environment

Software Requirements

Describe what the system should do, in terms of the services that it provides and their qualities (safety, reliability, performance...)
Gathered through discussions with stakeholders (customers, domain experts, marketing team, industry regulators...)

Importance of Requirements

"The hardest single part of building a software system is deciding precisely what to build...No other part of the work so cripples the resulting system if done wrong." -- Fred Brooks, Mythical Man Month (1975)

Importance of Requirements

An investigation of software-related failures by the National Research Council in the US (2007)

Bugs in code account only for 3% of fatal software accidents

Most failures due to poor understanding of requirements or usability issues

Urge to start coding...

Developers tend to focus on writing code...

Often ignore requirements...

Too much effort, busywork only, distracts from coding...

Facing costly problems later... (built the wrong system)

Requirements & Design: Think before coding

Untangling Requirements

For completeness: Beh. vs quality req.

Behavioral requirements (functional requirements)

What the system shall do
How inputs and outputs relate
... typically clear 'correctness' specifications

Quality requirements (non-functional requirements)

How the system should operate and be built
Development budget and deadlines
Code quality, maintainability, extensibility requirements
Latency, scalability, throughput requirements
Safety, security, fairness req.
Usability requirements
... all require measurement

Machine vs World

No software lives in vacuum; every system is deployed as part of the world
A requirement describes a desired state of the world (i.e., environment)
Machine (software) is designed to sense and manipulate the environment into this desired state using input & output devices

Machine vs World: Fall Detection

What are elements of the environment?
What are the goals/requirements of the software in the real world?

(Smartwatch-based fall detection and emergency response)

Machine vs World: Lane Keeping Assist

What are the goals/requirements of the software in the real world?

Note: Requirement: The vehicle must be prevented from veering off the lane.

Shared Phenomena

Shared phenomena: Interface between the environment & software
- Input: Lidar, camera, pressure sensors, GPS
- Output: Signals generated & sent to the engine or brake control
Software can influence the environment only through the shared interface
- Unshared parts of the environment are beyond software’s control
- We can only assume how these parts will behave

---- ## Requirement vs Specification

System Requirement (REQ): What the system must ensure, in terms of desired effects on the environment
Software Specification (SPEC): What software must implement, expressed over the shared phenomena
Assumptions (ASM): What’s assumed about the behavior/properties of the environment; bridges the gap between REQ and SPEC

Formally: ASM ∧ SPEC ⊨ REQ

Shared Phenomena

Requirements (REQ) are expressed only in terms of world phenomena

Assumptions (ASM) expressed in terms of world & shared phenomena

Specifications (SPEC) are expressed in terms of shared phenomena

Software cannot directly satisfy a requirement on its own; it relies on assumptions about the environment!

Lane Assist Specification

Requirement (REQ): The vehicle must be prevented from veering off the lane.
Specification (SPEC): ??

Breakout: Lane Assist Assumptions

REQ: The vehicle must be prevented from veering off the lane.

SPEC: Lane detector accurately identifies lane markings in the input image; the controller generates correct steering commands

Discuss with your neighbor to come up with 2-3 assumptions

Example Assumptions for Lane Assist

Sensors are providing accurate information about the lane

Driver responses when given warning

Steering wheel is functional

...

What could go wrong?

Recall: ASM ∧ SPEC ⊨ REQ

Wrong, inconsistent or infeasible requirements (REQ)
Missing or incorrect environmental assumptions (ASM)
Wrong or violated specification (SPEC)
Inconsistency in assumptions & spec (ENV ∧ SPEC = False)

Example each for lane assist?

Lufthansa 2904 Runaway Crash

Reverse thrust: Decelerates plane after landing

REQ: Reverse thrust is enabled if and only if plane is on the ground

SPEC: Reverse thrust is enabled if and only if wheel is turning

if (a) 6.3 tons of weight are sensed on each landing gear or (b) sensors indicate the wheels are turning faster than 72 knots

ASM: Wheel is turning if and only if plane on the ground

ASM: High amounts of weight are only on both landing gears if the plan is on the ground

Lufthansa 2904 Runaway Crash

CC BY-SA 3.0 Anynobody

Lufthansa 2904 Runaway Crash

REQ: Reverse thrust is enabled if and only if plane is on the ground

SPEC: Reverse thrust is enabled if and only if wheel is turning

ASM: Wheel is turning if and only if plane on the ground

On that day, runway was wet due to rain!

Wheel fails to turn, even though the plane is on the ground (assumption violated)
Pilot attempts to enable RT; overridden by the software
Plane goes off the runway and crashes!

Assumption Violations in ML-based Systems (1)

Assumptions about correctness of model predictions?

Assumptions of human behavior? Interaction with the system?

Assumptions about training data?

Assumptions about stability of data? about reliability of sensors? reliability of human input?

Assumption Violations in ML-based Systems (1)

Unrealistic or missing assumptions

e.g., poorly understood effect of weather conditions on sensor accuracy, missing pedestrian behavior

Concept drift

Environment evolves over time; underlying distribution changes
e.g. user's preferences on products
(More on this in the data quality lecture)

Assumption Violations in ML-based Systems (2)

Adversaries

A malicious actor deliberately tries to violate assumptions
e.g., adversarial attacks on stop signs
(More in the security lecture)

Feedback loops

System repeatedly acts on and changes the environment over time; earlier assumptions may cease to hold
e.g., predictive policing

Recall: Lane Assist

REQ: The vehicle must be prevented from veering off the lane.
ASM: Sensors are providing accurate information about the lane; driver responses when given warning; steering wheel is functional

What could go wrong in lane assist?

REQ: The vehicle must be prevented from veering off the lane.
ASM: Sensors are providing accurate information about the lane; driver responses when given warning; steering wheel is functional

Missing or incorrect environmental assumptions (ASM)?

Concept drift? Adversaries?

Wrong or violated specification (SPEC)?

Process for Establishing Requirements

Identify environmental entities and machine components
State a desired requirement (REQ) over the environment
Identify the interface between the environment & machine
Identify the environmental assumptions (ENV)
Develop specifications (SPEC) that are sufficient to establish REQ
Check whether ENV ∧ SPEC ⊧ REQ
If not, go back to the beginning & repeat

Breakout Session: Fall detection

As a group, post answer to #lecture and tag group members:

Requirement: ...
Assumptions: ...
Specification: ...
What can go wrong: ...

What went wrong? (REQ, ASM, SPEC)?

"We got concerns from them that the recommendations that it was giving were just not relevant...it would suggest a particular kind of treatment that wasn’t available in the locality in which it was making the recommendation, or the recommendation did not at all square with the treatment protocols that were in use at the local institution..."

Slate, 01/2022

Takeaway

Software/ML models alone cannot fulfill system requirements
- They are just one part of the system, and have limited control over the environment
Environmental assumptions are just as critical in achieving requirements
- If you ignore/misunderstand these, your system may fail or do poorly (no matter how good your model is)
- Identify and document these assumptions as early as possible!
- Some of the assumptions may be violated over time as the environment changes -- Monitor these assumptions and adjust your specification accordingly

Gathering and Negotiating Requirements

Understanding requirements is hard

Why?

See 🗎 Jackson, Michael. "The world and the machine." In Proceedings of the International Conference on Software Engineering. IEEE, 1995.

Understanding requirements is hard

Customers don't know what they want until they see it
Customers change their mind ("no, not like that")
Descriptions are vague
It is easy to ignore important requirements (privacy, fairness)
Focused too narrowly on needs of few users
Engineers think they already know the requirements
Engineers are overly influenced by technical capability
Engineers prefer elegant abstractions

Examples?

See also 🗎 Jackson, Michael. "The world and the machine." In Proceedings of the International Conference on Software Engineering. IEEE, 1995.

Start with Stakeholders...

Stakeholders: all persons and entities who have an interest in a project or who may be affected by the project

Not only direct customers and users, also affected people, owners, developers, operators, regulators, ...

All may have needs, preferences, or concerns...

Start creating a list of all possible stakeholders

Stakeholders in lane keeping software? In fall detection software? In college admissions software?

Requirements elicitation techniques

Requirements elicitation techniques (1)

Background study: understand organization, read documents, try to use old system
Interview different stakeholders
- Ask open ended questions about problems, needs, possible solutions, preferences, concerns...
- Support with visuals, prototypes, ask about tradeoffs
- Use checklists to consider qualities (usability, privacy, latency, ...)

What would you ask in lane keeping software? In fall detection software? In college admissions software?

ML Prototyping: Wizard of Oz

Note: In a wizard of oz experiment a human fills in for the ML model that is to be developed. For example a human might write the replies in the chatbot.

Requirements elicitation techniques (2)

Surveys, groups sessions, workshops: Engage with multiple stakeholders, explore conflicts
Ethnographic studies: embed with users, passively observe or actively become part
Requirements taxonomies and checklists: Reusing domain knowledge
Personas: Shift perspective to explore needs of stakeholders not interviewed

Personas in GenderMag

See examples and details http://gendermag.org/foundations.php

Requirements elicitation example

For accessibility feature: What would you do?

Negotiating Requirements

Many requirements are conflicting/contradictory

Different stakeholders want different things, have different priorities, preferences, and concerns

Formal requirements and design methods such as card sorting, affinity diagramming, importance-difficulty matrices

Generally: sort through requirements, identify alternatives and conflicts, resolve with priorities and decisions -> single option, compromise, or configuration

Stakeholder Conflict Examples

User wishes vs developer preferences: free updates vs low complexity

Customer wishes vs affected third parties: privacy preferences vs disclosure

Product owner priorities vs regulators: maximizing revenue vs privacy protections

Conflicts in lane keeping software? In fall detection software? In college admissions software?

Who makes the decisions?

Requirements documentation

Write down requirements

what the software shall do, what it shall not do, what qualities it shall have,
document decisions and rationale for conflict resolution

Requirements as input to design and quality assurance

Formal requirements documents often seen as bureaucratic, lightweight options in notes, wikis, issues common

Systems with higher risk -> consider more formal documentation

Requirements evaluation (validation!)

Requirements evaluation

Manual inspection (like code review)

Show requirements to stakeholders, ask for misunderstandings, gaps

Show prototype to stakeholders

Checklists to cover important qualities

Critically inspect assumptions for completeness and realism

Look for unrealistic ML-related assumptions (no false positives, unbiased representative data)

How much requirements eng. and when?

Requirements important in risky systems

Requirements as basis of a contract (outsourcing, assigning blame)

Rarely ever fully completely upfront and stable, anticipate change

Stakeholders see problems in prototypes, change their minds
Especially ML requires lots of exploration to establish feasibility

Low-risk problems often use lightweight, agile approaches

(We'll return to this later)

How much requirements eng. and when?

Summary

Requirements state the needs of the stakeholders and are expressed over the phenomena in the world

Software/ML models have limited influence over the world

Environmental assumptions play just as an important role in establishing requirements

Identify stakeholders, interview them, resolve conflicts

Files

requirements.md

Latest commit

History

requirements.md

File metadata and controls

Machine Learning in Production

Gathering Requirements

Exploring Requirements...

Learning Goals

Readings

Failures in ML-Based Systems

Facial Recognition in ATM

Automated Hiring

Autopilot in Vehicles

IBM Watson

IBM Watson

IBM Watson

Risks in ML-based Systems

Reminder: ML in a System

Software Requirements

Software Requirements

Importance of Requirements

Importance of Requirements

Urge to start coding...

Untangling Requirements

For completeness: Beh. vs quality req.

Machine vs World

Machine vs World

Machine vs World: Fall Detection

Machine vs World: Lane Keeping Assist

Shared Phenomena

Shared Phenomena

Lane Assist Specification

Breakout: Lane Assist Assumptions

Example Assumptions for Lane Assist

What could go wrong?

Lufthansa 2904 Runaway Crash

Lufthansa 2904 Runaway Crash

Lufthansa 2904 Runaway Crash

Lufthansa 2904 Runaway Crash

Assumption Violations in ML-based Systems (1)

Assumption Violations in ML-based Systems (1)

Assumption Violations in ML-based Systems (2)

Recall: Lane Assist

What could go wrong in lane assist?

Process for Establishing Requirements

Breakout Session: Fall detection

What went wrong? (REQ, ASM, SPEC)?

What went wrong? (REQ, ASM, SPEC)?

What went wrong? (REQ, ASM, SPEC)?

What went wrong? (REQ, ASM, SPEC)?

Takeaway

Gathering and Negotiating Requirements

Understanding requirements is hard

Understanding requirements is hard

Start with Stakeholders...

Requirements elicitation techniques

Requirements elicitation techniques (1)

ML Prototyping: Wizard of Oz

Requirements elicitation techniques (2)

Personas in GenderMag

Requirements elicitation example

Negotiating Requirements

Stakeholder Conflict Examples

Requirements documentation

Requirements documentation

Requirements evaluation (validation!)

Requirements evaluation

How much requirements eng. and when?

How much requirements eng. and when?

How much requirements eng. and when?

Summary

Further Reading