Replies: 1 comment
-
|
At the recent EuroSTAR conference, I initiated a round-table discussion on this topic and this led to some interesting findings.
On the topic of AI systems, we discussed their use in anomaly detection. Two examples were specifically discussed: a software in cars that detect if automobile drivers are distracted by detecting signs such as their eyes wandering off the road, or the vehicle drifting off its lane. The software then alerts the driver to take a break. A second example is the use of AI software in train platforms on Swedish subway to detect and prevent suicides by detecting anomalous behaviour and alerting personnel at the stations to intervene. A third example is the successful use of AI in financial fraud detection. In all these cases, a further set of characteristics surfaced:
Does reading this trigger any ideas of similar examples? Or some other interesting observations here? Credits: Sanne Visser, Tara Walton, @sharovatov, @maryiat (and a few others on whose names I am blanking out) participated in this discussion. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
In Testing, we rightly assign a special place to human judgement. Human judgement should play a central role in qualifying software for human use.
Yet, certain quality assurance tasks are much better suited for machines, like spell-check or linting.
In Testing, an oracle is the means by which we identify if there is a problem. The intuition by which a human user detects that formatting on a site is broken, or the algorithm used by a spell-check software to detect misspelt words and draw a red squiggly line under them are oracles.
The aim of this enquiry is to step back and understand under what conditions machines outperform humans as oracles. By exploring this question deeper, my hope is that we can extrapolate our findings to GenAI to determine the kinds of quality assurance tasks we can delegate to GenAI based systems, and where we will continue to retain our edge as humans.
So here is my concrete ask:
PS: This enquiry was motivated by reflecting on the work of behaviour scientists like Daniel Kahneman, who have explored the fallibility of human judgement, the conditions under algorithms produce superior results as compared to human judgement.
Beta Was this translation helpful? Give feedback.
All reactions