Skip to content

Latest commit

 

History

History
70 lines (47 loc) · 6.33 KB

response.md

File metadata and controls

70 lines (47 loc) · 6.33 KB

Dear Prof. Ouellette,

Thank you for providing us the opportunity to revise our manuscript. We are grateful for the reviewers' comments and suggestions on how to improve our manuscript and have modified the text accordingly.

Reviewer 1

The authors have specified the importance of using Deep Learning in Biological data and on what are the aspects they have to focus when the researchers are using the Deep Learning. Even though they have mentioned ten different areas they have to focus the authors did not mentioned enough suitable examples and some common practices for some areas. Further it is important to mention some of the difficulties and limitations they have to face when they are using these methods.

Specially need to identify what sort of the data need to be there and how to react when there is not enough data to carry out Deep Learning technique. In the paper some area not contain precise information with relevant references to support the statements. Hence these areas need to improve it further.

We appreciate the constructive feedback and added additional references and clarifications where we thought it was needed. For instance, we provided six more references and an example for data augmentation and weak supervised learning via the following paragraph:

Methods that try to increase the amount of training data, such as data augmentation (in which existing data is slightly manipulated in an attempt to yield "new" samples) [@doi:10.3390/info11020125] and weak supervision (in which simple labeling heuristics are combined to produce noisy, probabilistic labels) [@arxiv:1605.07723v3] are valuable for small- to medium-scale dataset. For example, when classifying microalgae using models trained on 21,000 images, data augmentation improved the prediction accuracy by 17% [@doi:10.1109/ICMLA.2017.0-183]. In a text detection context based on a small dataset of 229 fully annotated scene text images, weakly supervised learning improved the precision by 11% [@doi:10.1109/ICCV.2017.166]. However, methods cannot overcome substantial data shortages in many practical scenarios, and recent research investigating machine learning methods in neuroimaging studies of depression suggests that high prediction accuracies obtained from small datasets may be caused by misestimation due to insufficient test dataset sizes [@doi:10.1038/s41386-021-01020-7].

Similarly we provided a further example to begin the subsequent paragraph:

In the fields of computer vision and natural language processing, deep neural networks are routinely trained on sample sizes ranging from hundreds of thousands to millions of training examples [@doi:10.1109/ICCV.2017.97; @doi:10.18653/v1/2020.acl-main.747].

Additionally, we have added additional references in our section on when to use deep learning and an additional guide to best practices in the introduction.

Reviewer 2

The article titled “Ten Quick Tips for Deep Learning in Biology” provides a review of potential challenges to employing deep learning in biomedical research. This well written article is of importance at a time when deep learning is broadly viewed as a magic solution to many prediction and modelling problems. It cautions the inexperienced and reminds the experienced researcher that while DL has led to remarkable advances in certain applications, it is far from guaranteed to succeed in a broad range of biomedical applications for a gamut of reasons, can lead the researcher astray, be time consuming to develop, and result in unexpected ethical risks. While the article is well written, easy to follow and of interest to the readership, a few points for improvement are worth considering prior to publication:

Major comments

Thank you for this suggestion. We have added this sentence to point 10:

"Other methods, such a distributed learning, in which small subsets of the data are processed independently in silos (possibly by different agents), are also promising but requrie careful investigation before applying to protected information [@doi:10.1016/j.compbiomed.2021.104716]."

  • I think it might be worth to further emphasis how results should be evaluated in the presence of class imbalance and risk/benefits of incorrect/correct predictions in the context of the specific application.

On a similar note, it is mentioned that AUCROC may not be reliable, but consider recommending alternatives such as precision-recall curves.

We appreciate the suggestion to include an alternative and have added the follwing sentence: "One alternative approach is to use the precision-recall curve rather than the receiver operating characteristic since the former is more resistant to imbalanced data [@doi:10.1145/1143844.1143874]."

Minor comments:

  • Lines 153: those though

We have corrected this typo.

  • Line 183 is missing a period

We have added the missing period.

  • Paragraph starting at line 271: This paragraph is cumbersome and contains jargon that is not introduced (CUDA/CuDNN). Is it implied that implementation (hardware and software) variations between platforms can hinder reproducibility? Or perhaps the implementation is reproducible, so long that random seeding is not used?

The paragraph now reads:

The use of non-deterministic algorithms, often by default, is a specific reproducibility pitfall that is often missed in applying deep learning. For example, GPU acceleration libraries like CUDA/CuDNN, which facilitate the parallelized computing powering state-of-the-art deep learning, often use default algorithms that produce different outcomes from iteration to iteration even with the same hardware and software. Achieving reproducibility in this context requires explicitly specifying the use of deterministic algorithms, which are typically available within deep learning libraries [@url:https://docs.nvidia.com/deeplearning/sdk/cudnn-developer-guide/index.html#reproducibility]. This step is distinct from and in addition to the setting of random seeds that typically achieve reproducibility by controlling pseudorandom deterministic procedures such as shuffling and initialization.

We hope these changes improve the quality of the manuscript and satisfy the concerns of the reviewers.

Sincerely,
Benjamin Lee, on behalf of all authors