Contextual interview protocol
Pages 11
Clone this wiki locally
Initial DeepPhe Contextual Inquiry Protocol
During the interview, we will ask the informant to conduct relevant tasks, explaining what is being done, why it must be done, and how it is done. As the informant continues through the task, the interviewer(s) will ask for clarification, pose questions about contextual factors, and generally attempt to build an understanding of the work. Therefore, the interview will be unstructured and much of the discussion will be determined by the events that occur as the informant completes their tasks.
Below is some broad structure for the interview. However, no ordering of these questions is implied - rather, they can be asked when convenient during the course of the discussion.
Contents
Recording
All sessions should be audio-recorded with the participant’s permission, either via web conference software or portable digital audio recorder. Written notes will be taken as well. If interviews are conducted via web-conference, screen captures will be recorded as well. We will use GoToMeeting for web conferencing. Artifacts such as data files would also be useful when available.
Steps will be taken to avoid protected health information (PHI). We will try to remove PHI data entirely from consideration. We will only observe what participants do if it does not involve any PII. If any displays or artifacts involving PHI are inadvertently introduced into the discussion, any screen capture or recording modes that might capture the PHI will be disabled immediately.
Interview Introduction
The introductory component of the discussion is designed to provide basic context and to start conversation:
Introductory script:
Hello. My name is XXXX. As you know, you’re here today to participate in a contextual inquiry study to understand the requirements for the visual analytics tools we will be developing. These tools will help researchers explore and interpret phenotype data that will be extracted with our software. We would like to understand users’ information and workflow needs to maximize the usefulness of the software.
During the session, you will be questioned and observed. We will ask some general questions about your work, and more detailed questions about specific tasks. Data will be collected in the form of notes and artifacts. With your permission, the session will be audio-recorded and captured via screen capture software. The session will last approximately one hour.
In this session, we would like to understand your broad research and clinical goals, and then explore in detail how you currently identify, store, and manipulate specific phenotype data and link this to genotype data in your work. We’d like to understand how this work is done, any challenges you face, and what you would hope for in a new tool to make work more efficient and easier to accomplish. If possible, we’d like you to demonstrate the tools that you currently use on some meaningful data. We’d like you to identify a representative dataset and task that you are currently working on or have worked on previously to demonstrate how you currently do things.
Do you have any questions?
Before we begin, do we have your permission to record this session? (IF YES, BEGIN RECORDING)
To start off, we’d like to get a sense of your background and research.
Respondent background:
- What is your training? What degrees have you obtained, and in what areas?
- What is your current position? How long have you been at this position?
- How much of your time do you spend on cancer data analytics?
- If applicable: How much of your time is spent on clinical work?
Problem description:
- What questions are you currently trying to answer related to cancer analytics?
- What sort of data are you dealing with?
- Is this a research or clinical effort?
- Who is this effort supported by?
Data
- Which tools do you currently work with for your data?
- What are the major data challenges that you struggle with?
Major non-data challenges:
- What are the major challenges that you struggle with that are not data related?
##Task identification
Now we’d like you to identify a specific dataset and tasks that fit within the scope of the work. This should be representative of your work and illustrative of difficulties you encounter. Please choose something that does not involve sensitive data.
-
Have you selected a task or tasks? Please explain the task.
-
Will demonstrating this task involve any protected health information? (IF YES, WE WILL NOT OBSERVE BUT JUST HAVE THE PARTICIPANT EXPLAIN HOW THEY DO THEIR WORK)
-
Note: If participants identify multiple important tasks, they should be prioritized so that the session will start with the highest priority and continue from there as time allows.
Observation
-
If task does not contain PHI: Now I’d like you to work through the task. I’ll be observing you and asking questions as you work to gain an understanding of exactly what you’re doing and why you’re doing it.
-
If task does contain PHI: Now I’d like you to explain in detail how you do the task. I’ll be asking questions to gain an understanding of exactly what you do and why you do it.
-
Questions to be asked at any point during the interview, including a few that are focused on specific stakeholder groups. (Some answers may become apparent based on respondent comments)
- Which statistical methods (if any) do you use?
- Which file formats are used? Which file formats are preferred?
- What challenges or problems do you experience with storing data?
- How do you record findings or insights?
- How do you form hypotheses?
- Given data that might support a hypothesis, how is other data handled if it is not consistent with the hypothesis?
- Are there any attempts made to explicitly rule out alternative hypotheses?
- Which data tools/repositories do you use?
- How is EHR clinical text managed alongside genomic information and other supporting data?
- What difficulties do you run into with manipulating phenotype data?
- When, why, and how do you make use of the research literature to inform understanding?
- How do you build accurate models of phenotypes, given complexity of clinical text, particularly with respect to temporal ordering?
- How do you validate these models?
- How do you represent uncertainty in your models?
- How do you determine that patients or treatments are “similar” to each other? How confident are you in this determination?
- Which standardized vocabulary/vocabularies do you use?
- How do you communicate results to peers? What output formats are needed in your work?
- What are the challenges you experience with communicating results?
- How do current tools succeed? How do they fail? What are current tools missing?
- For Information Brokers: How do you translate between needs of researchers and worlds of NLP and/or basic clinical records?
- For NLP developers: How do you evaluate success?
- It seems that you typically XXXXX while working. And that you struggle with XXXXX. Is that accurate?
-
Note: During the session, the interviewer might be tempted to make suggestions regarding potential designs or improvements to tools and/or workflow. Such suggestions can be useful, but should be presented carefully, so as to avoid any implicit promises.
-
Repeat this process for additional tasks as time allows.
Debrief
The last part of the interview will be used to review general impressions, ask about observations, and to confirm understanding of what was seen during the observations.
- Ask about general patterns that were observed.
- Ask for clarification of any inconsistent patterns that may have been observed.
- Attempt to draw the respondent out - try to get them to reflect on their work.
- Give the respondent an opportunity to raise other concerns and to ask questions about the work.
Sample Script
- Are there any challenges that you have previously encountered that weren’t observed today?
- What would you like to see in a new tool?
- Are there any other comments you would like to make?
- Thank you very much for your time. We really appreciate your input.