From 406af08daecc7f172fb7acfe50c34befc24c4e3f Mon Sep 17 00:00:00 2001
From: Anthony Gitter <gitter@biostat.wisc.edu>
Date: Mon, 1 May 2017 23:39:05 -0500
Subject: [PATCH 1/5] Initial draft of conclusions

---
 references/tags.tsv        |  1 +
 sections/07_conclusions.md | 63 +++++++++++++++++++++++++++++++++-----
 2 files changed, 56 insertions(+), 8 deletions(-)

diff --git a/references/tags.tsv b/references/tags.tsv
index 15f1e48c..ba40bf96 100644
--- a/references/tags.tsv
+++ b/references/tags.tsv
@@ -148,6 +148,7 @@ Silver2016_alphago	doi:10.1038/nature16961
 Sonderby	arxiv:1503.01919
 Soueidan	doi:10.1515/metgen-2016-0001
 Spark	doi:10.1145/2934664
+Speech_recognition	url:http://www.businessinsider.com/ibm-edges-closer-to-human-speech-recognition-2017-3
 Stein2010_cloud	doi:10.1186/gb-2010-11-5-207
 Stenstrom2005_latent	doi:10.2337/diabetes.54.suppl_2.S68
 Stormo2000_dna	doi:10.1093/bioinformatics/16.1.16
diff --git a/sections/07_conclusions.md b/sections/07_conclusions.md
index fe208249..78fe242d 100644
--- a/sections/07_conclusions.md
+++ b/sections/07_conclusions.md
@@ -1,15 +1,60 @@
 ## Conclusions
 
-Final thoughts and future outlook here. The Discussion will give an overview
-and the Conclusion will provide a short, punchy take home message.
+Deep learning-based methods now represent the state of the art in a diverse
+array of tasks in patient and disease categorization, fundamental biological
+study, genomics, and treatment development.  We return to our central question:
+given this rapid progress, has deep learning transformed the study of human
+disease?  Though the answer is highly dependent on the specific domain and
+problem being addressed, we conclude that deep learning has not *yet* realized
+its transformative potential or induced a strategic inflection point.  Despite
+its dominance over competing machine learning approaches in many of the areas
+reviewed here and quantitative improvements in predictive performance, deep
+learning has not yet qualitatively "solved" those problems that were previously
+"unsolved".
 
-Points to mention based on discussion thus far that may make the bar for
-conclusions:
+As an analogy, consider recent progress in conversational speech recognition.
+Since 2009 there have been drastic performance improvements, with error rates
+dropping from more than 20% to less than 6% [@tag:Speech_recognition] and
+finally approaching or exceeding human performance in the past year
+[@arxiv:1610.05256 @arxiv:1703.02136] `TODO: working on a second source for this
+error trajectory from a talk by Eric Horvitz`. The phenomenal  improvements on
+benchmark datasets are undeniable, but the successes of the early 2010s did not
+fundamentally transform the domain. `TODO: this is debatable, maybe need a
+different example or to clarify what is meant by "conversational" speech`
+Widespread adoption of these technologies will requires not only improvements
+over baseline methods but truly "solving" the problem, in this case exceeding
+human-level performance, as well as convincing users to embrace the technology
+[@tag:Speech_recognition].  We see parallels to the healthcare domain, where
+achieving the full potential of deep learning will require outstanding
+predictive performance as well as adoption by biologists and clinicians.
 
-* Limitations of data & workarounds (availability impacts on amount, etc)
-* Transferability of features
-* Strong enthusiasm for unsupervised approaches.
-* Right to an explanation (possibly, depends if raised in multiple areas)
+Some of the areas we have discussed are closer to surpassing this lofty bar than
+others, generally those that are more similar to the non-biomedical tasks that
+are now monopolized by deep learning.  In medical imaging, diabetic retinopathy
+[@doi:10.1001/jama.2016.17216], diabetic macular edema
+[@doi:10.1001/jama.2016.17216], and skin lesion [@doi:10.1038/nature21056]
+classifiers are highly accurate and comparable to dermatologist performance in
+the latter case. `TODO: more imaging examples or other examples that might be at
+or close to "transformative"?`  In other domains, perfect accuracy will not be
+required because deep learning will be used primarily to prioritize experiments.
+For example, in chemical screening for drug discovery, a deep learning system
+that successfully identifies dozens or hundreds of target-specific, active
+small molecules from a massive search space would have immense practical value
+even if its overall precision is modest. Conversely, the most challenging tasks
+may be those in which predictions are used directly for downstream modeling or
+decision-making, especially in the clinic. `TODO: elaborate more on this idea
+or split in a new paragraph?`
+
+Even if deep learning in biology and healthcare is not yet transformative today,
+we are extremely optimistic about its future.  We have highlighted numerous
+challenges beyond improving training and predictive accuracy, such as preserving
+patient privacy and interpreting models.  Ongoing research has begun to address
+these problems and shown they are not insurmountable.  Deep learning offers the
+flexibility to model data in its most natural form, spurring creative modeling
+approaches that would be infeasible with other machine learning techniques.
+Unsupervised methods are currently less-developed than their supervised
+counterparts, making them an attractive target for future research in this
+domain.  `TODO: still working on a strong closing line`
 
 ### Author contributions
 
@@ -29,3 +74,5 @@ revisions; approved the final manuscript draft; and agreed to be accountable in
 all aspects of the work. Individuals who did not contribute in one or more of
 these ways, but who did participate, are acknowledged at the end of the
 manuscript.
+
+`TODO: update after finalizing discussion in #369`

From 395533133a4b5304a6cdbf4396e182bf041c7e97 Mon Sep 17 00:00:00 2001
From: Anthony Gitter <gitter@biostat.wisc.edu>
Date: Wed, 3 May 2017 10:11:53 -0500
Subject: [PATCH 2/5] Respond to feedback

---
 sections/07_conclusions.md | 96 +++++++++++++++++++++-----------------
 1 file changed, 53 insertions(+), 43 deletions(-)

diff --git a/sections/07_conclusions.md b/sections/07_conclusions.md
index 78fe242d..2af8d4ad 100644
--- a/sections/07_conclusions.md
+++ b/sections/07_conclusions.md
@@ -1,60 +1,70 @@
 ## Conclusions
 
-Deep learning-based methods now represent the state of the art in a diverse
-array of tasks in patient and disease categorization, fundamental biological
-study, genomics, and treatment development.  We return to our central question:
-given this rapid progress, has deep learning transformed the study of human
-disease?  Though the answer is highly dependent on the specific domain and
-problem being addressed, we conclude that deep learning has not *yet* realized
-its transformative potential or induced a strategic inflection point.  Despite
-its dominance over competing machine learning approaches in many of the areas
-reviewed here and quantitative improvements in predictive performance, deep
-learning has not yet qualitatively "solved" those problems that were previously
-"unsolved".
+Deep learning-based methods now matches or surpasses the previous state of the
+art in a diverse array of tasks in patient and disease categorization,
+fundamental biological study, genomics, and treatment development.  We return to
+our central question: given this rapid progress, has deep learning transformed
+the study of human disease?  Though the answer is highly dependent on the
+specific domain and problem being addressed, we conclude that deep learning has
+not *yet* realized its transformative potential or induced a strategic
+inflection point.  Despite its dominance over competing machine learning
+approaches in many of the areas reviewed here and quantitative improvements in
+predictive performance, deep learning has not yet definitively "solved" those
+problems.
 
 As an analogy, consider recent progress in conversational speech recognition.
 Since 2009 there have been drastic performance improvements, with error rates
 dropping from more than 20% to less than 6% [@tag:Speech_recognition] and
 finally approaching or exceeding human performance in the past year
-[@arxiv:1610.05256 @arxiv:1703.02136] `TODO: working on a second source for this
-error trajectory from a talk by Eric Horvitz`. The phenomenal  improvements on
-benchmark datasets are undeniable, but the successes of the early 2010s did not
-fundamentally transform the domain. `TODO: this is debatable, maybe need a
-different example or to clarify what is meant by "conversational" speech`
-Widespread adoption of these technologies will requires not only improvements
-over baseline methods but truly "solving" the problem, in this case exceeding
-human-level performance, as well as convincing users to embrace the technology
-[@tag:Speech_recognition].  We see parallels to the healthcare domain, where
-achieving the full potential of deep learning will require outstanding
-predictive performance as well as adoption by biologists and clinicians.
+[@arxiv:1610.05256 @arxiv:1703.02136] `TODO: need better source for this error
+trajectory`. The phenomenal improvements on benchmark datasets are undeniable,
+but halving the error rates on these benchmarks did not fundamentally transform
+the domain.  Widespread adoption of conversational speech technologies will
+require not only improvements over baseline methods but truly solving the
+problem, in this case exceeding human-level performance, as well as convincing
+users to embrace the technology [@tag:Speech_recognition].  We see parallels to
+the healthcare domain, where achieving the full potential of deep learning will
+require outstanding predictive performance as well as acceptance and adoption by
+biologists and clinicians.
 
 Some of the areas we have discussed are closer to surpassing this lofty bar than
 others, generally those that are more similar to the non-biomedical tasks that
 are now monopolized by deep learning.  In medical imaging, diabetic retinopathy
 [@doi:10.1001/jama.2016.17216], diabetic macular edema
-[@doi:10.1001/jama.2016.17216], and skin lesion [@doi:10.1038/nature21056]
-classifiers are highly accurate and comparable to dermatologist performance in
-the latter case. `TODO: more imaging examples or other examples that might be at
-or close to "transformative"?`  In other domains, perfect accuracy will not be
-required because deep learning will be used primarily to prioritize experiments.
-For example, in chemical screening for drug discovery, a deep learning system
-that successfully identifies dozens or hundreds of target-specific, active
-small molecules from a massive search space would have immense practical value
-even if its overall precision is modest. Conversely, the most challenging tasks
-may be those in which predictions are used directly for downstream modeling or
-decision-making, especially in the clinic. `TODO: elaborate more on this idea
-or split in a new paragraph?`
+[@doi:10.1001/jama.2016.17216], tuberculosis [@doi:10.1148/radiol.2017162326],
+and skin lesion [@doi:10.1038/nature21056] classifiers are highly accurate and
+comparable to clinician performance in the latter case. `TODO: more imaging
+examples or other examples that might be at or close to "transformative"?`
+
+In other domains, perfect accuracy will not be required because deep learning
+will be used primarily to prioritize experiments and assist discovery. For
+example, in chemical screening for drug discovery, a deep learning system that
+successfully identifies dozens or hundreds of target-specific, active small
+molecules from a massive search space would have immense practical value even if
+its overall precision is modest.  In medical imaging, deep learning can point an
+expert to the most challenging cases that require manual review
+[@doi:10.1148/radiol.2017162326].
+
+Conversely, the most challenging tasks may be those in which predictions are
+used directly for downstream modeling or decision-making, especially in the
+clinic. In addition, the stochasticity and complexity of biological systems
+implies that for some problems, for instance, predicting gene regulation in
+disease, perfect accuracy will be unattainable.  `TODO: expand this paragraph?`
 
 Even if deep learning in biology and healthcare is not yet transformative today,
-we are extremely optimistic about its future.  We have highlighted numerous
-challenges beyond improving training and predictive accuracy, such as preserving
-patient privacy and interpreting models.  Ongoing research has begun to address
-these problems and shown they are not insurmountable.  Deep learning offers the
-flexibility to model data in its most natural form, spurring creative modeling
-approaches that would be infeasible with other machine learning techniques.
-Unsupervised methods are currently less-developed than their supervised
-counterparts, making them an attractive target for future research in this
-domain.  `TODO: still working on a strong closing line`
+we are extremely optimistic about its future.  Given how rapidly deep learning
+is evolving, its full potential in biomedicine has not been explored.  We have
+highlighted numerous challenges beyond improving training and predictive
+accuracy, such as preserving patient privacy and interpreting models.  Ongoing
+research has begun to address these problems and shown they are not
+insurmountable.  Deep learning offers the flexibility to model data in its most
+natural form `TODO: rephrase`, spurring creative modeling approaches that would
+be infeasible with other machine learning techniques. Unsupervised methods are
+currently less-developed than their supervised counterparts, but they may have
+the most potential. When deep learning algorithms can summarize very large
+collections of input data into interpretable models that spur scientists to ask
+questions that they didn't know to ask, it will be clear that deep learning has
+transformed biology and medicine.
 
 ### Author contributions
 

From 85a567b5494cec2976f4a284d9b923efe34ec861 Mon Sep 17 00:00:00 2001
From: Anthony Gitter <gitter@biostat.wisc.edu>
Date: Wed, 3 May 2017 14:50:05 -0500
Subject: [PATCH 3/5] Rephrasing

---
 sections/07_conclusions.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/sections/07_conclusions.md b/sections/07_conclusions.md
index 2af8d4ad..e04fc151 100644
--- a/sections/07_conclusions.md
+++ b/sections/07_conclusions.md
@@ -2,7 +2,7 @@
 
 Deep learning-based methods now matches or surpasses the previous state of the
 art in a diverse array of tasks in patient and disease categorization,
-fundamental biological study, genomics, and treatment development.  We return to
+fundamental biological study, genomics, and treatment development.  Returning to
 our central question: given this rapid progress, has deep learning transformed
 the study of human disease?  Though the answer is highly dependent on the
 specific domain and problem being addressed, we conclude that deep learning has

From e979cf3e2295ee4ea1397d5397ab7186f9a03f3c Mon Sep 17 00:00:00 2001
From: Anthony Gitter <gitter@biostat.wisc.edu>
Date: Thu, 4 May 2017 09:13:34 -0500
Subject: [PATCH 4/5] Address TODOs and grammar

---
 sections/07_conclusions.md | 44 +++++++++++++++++++++-----------------
 1 file changed, 24 insertions(+), 20 deletions(-)

diff --git a/sections/07_conclusions.md b/sections/07_conclusions.md
index e04fc151..681924ec 100644
--- a/sections/07_conclusions.md
+++ b/sections/07_conclusions.md
@@ -1,16 +1,15 @@
 ## Conclusions
 
-Deep learning-based methods now matches or surpasses the previous state of the
-art in a diverse array of tasks in patient and disease categorization,
-fundamental biological study, genomics, and treatment development.  Returning to
-our central question: given this rapid progress, has deep learning transformed
-the study of human disease?  Though the answer is highly dependent on the
-specific domain and problem being addressed, we conclude that deep learning has
-not *yet* realized its transformative potential or induced a strategic
-inflection point.  Despite its dominance over competing machine learning
-approaches in many of the areas reviewed here and quantitative improvements in
-predictive performance, deep learning has not yet definitively "solved" those
-problems.
+Deep learning-based methods now match or surpass the previous state of the art
+in a diverse array of tasks in patient and disease categorization, fundamental
+biological study, genomics, and treatment development.  Returning to our central
+question: given this rapid progress, has deep learning transformed the study of
+human disease?  Though the answer is highly dependent on the specific domain and
+problem being addressed, we conclude that deep learning has not *yet* realized
+its transformative potential or induced a strategic inflection point.  Despite
+its dominance over competing machine learning approaches in many of the areas
+reviewed here and quantitative improvements in predictive performance, deep
+learning has not yet definitively "solved" those problems.
 
 As an analogy, consider recent progress in conversational speech recognition.
 Since 2009 there have been drastic performance improvements, with error rates
@@ -18,7 +17,7 @@ dropping from more than 20% to less than 6% [@tag:Speech_recognition] and
 finally approaching or exceeding human performance in the past year
 [@arxiv:1610.05256 @arxiv:1703.02136] `TODO: need better source for this error
 trajectory`. The phenomenal improvements on benchmark datasets are undeniable,
-but halving the error rates on these benchmarks did not fundamentally transform
+but halving the error rate on these benchmarks did not fundamentally transform
 the domain.  Widespread adoption of conversational speech technologies will
 require not only improvements over baseline methods but truly solving the
 problem, in this case exceeding human-level performance, as well as convincing
@@ -33,8 +32,7 @@ are now monopolized by deep learning.  In medical imaging, diabetic retinopathy
 [@doi:10.1001/jama.2016.17216], diabetic macular edema
 [@doi:10.1001/jama.2016.17216], tuberculosis [@doi:10.1148/radiol.2017162326],
 and skin lesion [@doi:10.1038/nature21056] classifiers are highly accurate and
-comparable to clinician performance in the latter case. `TODO: more imaging
-examples or other examples that might be at or close to "transformative"?`
+comparable to clinician performance in the latter case.
 
 In other domains, perfect accuracy will not be required because deep learning
 will be used primarily to prioritize experiments and assist discovery. For
@@ -43,13 +41,16 @@ successfully identifies dozens or hundreds of target-specific, active small
 molecules from a massive search space would have immense practical value even if
 its overall precision is modest.  In medical imaging, deep learning can point an
 expert to the most challenging cases that require manual review
-[@doi:10.1148/radiol.2017162326].
+[@doi:10.1148/radiol.2017162326], though the risk of false negatives must be
+addressed.
 
 Conversely, the most challenging tasks may be those in which predictions are
 used directly for downstream modeling or decision-making, especially in the
-clinic. In addition, the stochasticity and complexity of biological systems
-implies that for some problems, for instance, predicting gene regulation in
-disease, perfect accuracy will be unattainable.  `TODO: expand this paragraph?`
+clinic.  For instance, errors in a predicted protein contact map could be
+amplified if that contact map is used directly for 3D structure prediction.  In
+addition, the stochasticity and complexity of biological systems implies that
+for some problems, for instance, predicting gene regulation in disease, perfect
+accuracy will be unattainable.
 
 Even if deep learning in biology and healthcare is not yet transformative today,
 we are extremely optimistic about its future.  Given how rapidly deep learning
@@ -58,8 +59,11 @@ highlighted numerous challenges beyond improving training and predictive
 accuracy, such as preserving patient privacy and interpreting models.  Ongoing
 research has begun to address these problems and shown they are not
 insurmountable.  Deep learning offers the flexibility to model data in its most
-natural form `TODO: rephrase`, spurring creative modeling approaches that would
-be infeasible with other machine learning techniques. Unsupervised methods are
+natural form, for example, longer DNA sequences instead of k-mers for
+transcription factor binding prediction and molecular graphs instead of
+pre-computed bit vectors for drug discovery. These flexible input feature
+representations have spurred creative modeling approaches that would be
+infeasible with other machine learning techniques. Unsupervised methods are
 currently less-developed than their supervised counterparts, but they may have
 the most potential. When deep learning algorithms can summarize very large
 collections of input data into interpretable models that spur scientists to ask

From 934fa5a277723d99f6aa637bac36ba66c74a1e09 Mon Sep 17 00:00:00 2001
From: Anthony Gitter <gitter@biostat.wisc.edu>
Date: Thu, 4 May 2017 09:15:10 -0500
Subject: [PATCH 5/5] Minor rewording

---
 sections/07_conclusions.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/sections/07_conclusions.md b/sections/07_conclusions.md
index 681924ec..45e6639e 100644
--- a/sections/07_conclusions.md
+++ b/sections/07_conclusions.md
@@ -46,7 +46,7 @@ addressed.
 
 Conversely, the most challenging tasks may be those in which predictions are
 used directly for downstream modeling or decision-making, especially in the
-clinic.  For instance, errors in a predicted protein contact map could be
+clinic.  As an example, errors in a predicted protein contact map could be
 amplified if that contact map is used directly for 3D structure prediction.  In
 addition, the stochasticity and complexity of biological systems implies that
 for some problems, for instance, predicting gene regulation in disease, perfect