Objectives of the first Workshop (Jan '21)

The theme of the current workshop is ``Building Convergence''.

Specific objectives of the workshop include:

  • Bring computational scientists and pathologists together
  • Computational scientists introduced to pathology data sets and state-of-the-art tools and workflows
  • Pathologists introduced to cutting edge trends in data science, AI, and data management
  • Breakout groups to identify gaps in state-of-the-art computational pathology workflows, datasets
  • Discuss plans for outreach in both academia and industry
  • Identify research thrusts

Focus areas of the workshop include:

  • Digital Pathology and Imaging
  • Machine Learning and Artificial Intelligence
  • Human-Machine Teaming
  • Big Data Management
  • High-Performance Computing
  • Outreach and Industry Connections


Program

All times in Eastern Standard Time (EST)

Day 1 (Tuesday - Jan 26)

12:00 - 12:15 PM

Welcome and Opening, Anil Parwani and DK Panda, The Ohio State University, USA

12:15 - 1:00 PM

Keynote Talk #1

Speaker: Ulysses Balis, University of Michigan, USA

Title: Supporting the NIDDK Kidney Precision Medicine Project (KPMP): Standing up the U-M Pathology AI / Data Visualization Center and Core Lab - Five years in retrospect [slides] [Video]

Abstract: The NIDDK Kidney Precision Medicine Project (U2C mechanism) is a multi-institutional consortial effort to generate further biological understanding of acute kidney injury and chronic kidney disease, through a novel discovery model that includes actual study participants (who serve concurrent roles as being both study subjects and patients at the same time), with the goal of elucidating new renal cells types, regulatory & disease mechanisms, and therapeutic targets, in the process of developing a multi-axial atlas of combined omics, single cell, and histology data. The Data Visualization Center (DVC) at the University of Michigan plays a central role in the creation of the needed tools for this effort, with one thrust being the development of a Whole Slide Image computational discovery suite, known as VIPR Studio. This presentation will provide an overview of this tool suite, the UM Pathology Imaging Core Lab and how it these efforts are shaping the KPMP effort.

1:00 - 1:30 PM

Invited Talk #1

Speaker: Michael Becich, University of Pittsburgh, USA

Title: How Computational Pathology is Improving Predictive Analytics? [slides] [Video]

Abstract: With the FDA approval of whole slide imaging (WSI) and digital pathology workflows as part of Pathology practice there is a significant gap in infrastructure for supporting translational research in cancer centers. Pathology Informatics is key to supporting deep interrogation of WSI and allowing for multi-dimensional studies combining multiplexed immunohistochemical marker studies of the tumor microenvironment with the rich genomic data provided by single cell genomic studies. Modern tissue banking protocols coupled with diagnostic workflows in surgical pathology of patient’s tumors samples are key to supporting both computational pathology and predictive analytics. We will discuss an approach that incorporates multiple sources of pathology data and clinically actionable knowledge to improve Pathology decision making via predictive analytics fueled by advanced machine learning and causal discovery.

1:30 - 2:00 PM

Invited Talk #2

Speaker: Anant Madabhushi, Case Western Reserve University, USA

Title: Digital Pathomics - An Alternative to Deep Learning to Prognosticating Disease Outcome [slides] [Video]

Abstract: With the advent of digital pathology, there is an opportunity to develop computerized image analysis methods to not just detect and diagnose disease from histopathology tissue sections, but to also attempt to predict risk of recurrence, predict disease aggressiveness and long term survival. At the Center for Computational Imaging and Personalized Diagnostics, our team has been developing a suite of image processing and computer vision tools, specifically designed to predict disease progression and response to therapy via the extraction and analysis of image-based “histological biomarkers” derived from digitized tissue biopsy specimens. These tools would serve as an attractive alternative to molecular based assays, which attempt to perform the same predictions. The fundamental hypotheses underlying our work are that: 1) the genomic expressions detected by molecular assays manifest as unique phenotypic alterations (i.e. histological biomarkers) visible in the tissue; 2) these histological biomarkers contain information necessary to predict disease progression and response to therapy; and 3) sophisticated computer vision algorithms are integral to the successful identification and extraction of these biomarkers. We have developed and applied these prognostic tools in the context of several different disease domains including ER+ breast cancer, prostate cancer, Her2+ breast cancer, ovarian cancer, and more recently medulloblastomas. For the purposes of this talk I will focus on our work in breast, prostate, rectal, oropharyngeal, and lung cancer.

2:00 - 2:30 PM

Invited Talk #3

Speaker: Jason Hipp and Khan Baykaner, AstraZeneca

Title: How Digital Pathology will Transform Drug Development and Cancer Diagnosis?

Abstract: Digital pathology, with the advent of modern deep learning techniques, promises to revolutionise drug development and cancer diagnosis. Traditional pathology requires highly trained individuals to make judgements based on time-constrained analyses of extremely rich and complex data. AI and automation are ideally suited to such tasks; whereas it is expensive and difficult to scale-up the population of highly trained pathologists, it is cheap and easy to scale-up digital computation. Modern deep learning techniques have shown amazing advances across domains such as natural image processing, NLP, and speech recognition. In this talk Drs Jason Hipp and Khan Baykaner will explain how these techniques can be applied to pathology and drug development, and the way in which this unlocks previously unavailable analyses.

2:30 - 3:00 PM

Invited Talk #4

Speaker: Jeffrey Goldstein, Northwestern, USA

Title: Machine learning for placental pathology: Why, How, and What Now? [slides] [Video]

Abstract: The placenta is the first organ to form in human development and acts as the lungs, gut, and kidney in fetal life. Most adverse pregnancy outcomes are associated with abnormalities in the placenta - abnormalities that may have life-long consequences for the mother or infant. Yet, <20% of placentas delivered in the U.S. receive a pathologic examination. Among pathologists, there is a wide range of capability and high inter-observer variability. Machine learning and digital pathology offer the possibility of making placenta examination universally available and increasing consistency. As a demonstration of the potential utility of machine learning in placental pathology, we trained a model to estimate gestational age from the microscopic appearance of placental villi. Gestational age is the single most important variable in neonatal survival and abnormalities in the appearance of villi - either appearing from a later (accelerated maturation) or earlier (delayed maturation) gestational age - are associated with the pregnancy complications of pre-eclampsia and diabetes, among others. We scanned slides from 154 patients representing a range of gestational ages from 24 week, the threshold of viability, to 42 weeks, after the expected time of delivery (37-40 weeks). To simulate gestalt formation by pathologists, we trained a model using 16 high-power fields at a time, scattered across a single slide. The model contains an attention subnetwork that weights each high-power field before generating an average. Using the attention and aggregation, we showed a high accuracy. We tested the utility of this model in estimating gestational age from un-annotated whole slide images, with high accuracy. This work shows the utility of machine learning in placental pathology. One use case could be providing an estimated gestational that pathologists can use to improve interrater reliability in their diagnoses of appropriate / delayed / accelerated maturation. This use case highlights key challenges, including inter-institutional slide variability, slide artifacts, differences in scanners or cameras, pathologist disagreements about regions of interest, and interpretability. Robust solutions to these problems will increase the utility of AI and the digital pathology field.

3:00 - 3:45 PM

Breakouts and Discussion

Breakout #1 Digital Pathology and Imaging, Breakout leaders: Jennifer Picarsic (Cincinnati Children) and Jeffrey Goldstein (Northwestern)

Breakout #2 Machine Learning and Artificial Intelligence, Breakout leaders: Michael Becich (Pittsburgh) and Anant Madabhushi (CWRU)

3:45 - 4:05 PM

Reporting from two breakout sessions (10 mins each)

4:05 - 4:15 PM

Closing for Day 1 and Plans for Day2 and Day 3, Anil Parwani and DK Panda, The Ohio State University, USA

Day 2 (Wednesday - Jan 27)

12:00 - 12:15 PM

Updates from Day 1 and Day 2 Overview, Anil Parwani and DK Panda, The Ohio State University, USA

12:15 - 1:00 PM

Keynote Talk #2

Speaker: Raghu Machiraju, The Ohio State University, USA

Title: Highlighting Challenges for Machine Learning in the Pathology Clinic through Specific Use Cases [slides] [Video]

Abstract: In this talk, I will provide case studies drawn from both well-characterized and not-so-well-studied cancers of the breast, the thyroid, the prostate and sarcomas in general. Each case study will address the reasons as to why deploying ML is hard given the intrinsic complexity and a well-acknowledged paucity of training data. In the same vein, through appropriate demonstrations, I will illustrate the difficulties in gathering training data from whole slide images. Further, these case studies will be examined for their inability to make robust and interpretable predictions.

1:00 - 1:30 PM

Invited Talk #5

Speaker: Eric Fossler-Lussier, The Ohio State University, USA

Title: Understanding how to Understand Teammates [slides] [Video]

Abstract: Language is perhaps our most fundamental communication medium, and creating natural interactions within human-machine teaming can only enhance the collaborative efforts of all partners in the team. However, language is often imprecise, relying on inference and context to resolve ambiguities, which can be difficult in the face of a sparse data landscape. This talk has two parts: the first part will be a high-level discussion of the challenges that come with situated, teamed language understanding and generation in human-machine teaming systems. The second part discusses some of our work in mitigating issues of uncertainty, error, and sparseness in understanding for spoken dialogue systems, drawing on examples from a Virtual Patient system used to train medical students.

1:30 - 2:00 PM

Invited Talk #6

Speaker: Srinivasan Parthasarathy, The Ohio State University, USA

Title: Stochastic Flow Clustering: Consolidation, Renewed Bearing and Applications to Image Segmentation i [slides] [Video]

Abstract: Since its introduction in the late nineties, the idea of Markov Clustering, a graph clustering approach based on the principle of simulating stochastic flows (random walks) has seen wide use. In the first part of this talk I will review this basic idea and then describe several principled enhancements to this approach that in turn improve the quality (via regularization, and the accommodation of overlapped clustering) and speed (via sparsification, and a multi-level mechanism) of such stochastic flow algorithms so that they can be deployed on big data problems. I will subsequently discuss ongoing efforts on leveraging these ideas for human-in-the-loop image analysis.

2:00 - 2:30 PM

Invited Talk #7

Speaker: Jose Otero, The Ohio State University, USA

Title: Providing molecular insight for resource-strained patients with machine learning-based workflows [slides] [Video]

Abstract: The advent of molecular testing has improved medical decision making by increasing the precision in which neuropathological diagnoses predict patient outcomes. However, many molecular tests, even those recommended by international clinical guidelines, are often not reimbursed by health insurance providers. To save costs, tests which increase the pretest probability of molecular testing will prevent waste, lower costs, and accelerate the delivery of information needed for treatment plan generation. As a test case, we evaluated the diagnostic workflow for oligodendroglioma, a well-studied WHO grade II neoplasm with a prognosis relatively favorable astrocytoma. The distinction in between these entities is based on a chromosomal deletion in chromosomes 1p and 19q. This test can be performed by fluorescent in situ hybridization (FISH), a reimbursable test with a false positive rate of ~ 5%, or by chromosomal microarray, a test considered a gold standard but for which reimbursement is not possible. Our clinical decision-making challenge was to determine when we should order the non-reimbursable test in the setting of a positive FISH result. We applied Bayesian probability, information theory, and dimensionality reduction techniques to identify and compare the utility of distinct assays in improving medical decision-making.

2:30 - 3:00 PM

Invited Talk #8

Speaker: Hamid Tizhoosh, University of Waterloo, Canada

Title: BERT - Transformers, NLP, and Pathology Reports [slides] [Video]

Abstract: Pathology whole slide images, and microscopic snapshots in some instances, are always accompanied with diagnostic report. These reports contain details about the visual inspection of the specimen by the pathologist. Such text documents may be structured or unstructured. Pathologists write these reports based on special knowledge and by understanding the intricacies of tissue to establish a diagnosis, sometimes in conjunction with information from other modalities. Natural language processing (NLP) and deep learning provide us with new tools to exploit the expert knowledge embedded in the pathology reports in order to connect it with the visual information from whole-slide images. In this talk, we review some NLP approaches to shed light on their potentials. BERT (Bidirectional Encoded Representations of Transformers) is a recent development in this field that is based on the concept of transformer, a concept that builds on attention and avoids recurrent structures to learn language embeddings. A case study will be discussed to show the challenges and potential benefits of using NLP schemes like BERT in pathology.

3:00 - 3:45 PM

Breakouts and Discussion

Breakout #3 Human-Machine Teaming, Breakout leaders: Eric Fossler-Lussier (OSU) and Jose Otero (OSU)

Breakout #4 Big Data Management, Breakout leaders: Srinivasan Parthasarathy (OSU) and Hamid Tizhoosh (Waterloo)

3:45 - 4:05 PM

Reporting from two breakout sessions (10 mins each)

4:05 - 4:15 PM

Closing for Day 2 and Plans for Day 3, Anil Parwani and DK Panda, The Ohio State University, USA

Day 3 (Thursday - Jan 28)

12:00 - 12:15 PM

Updates from Day 2 and Day 3 Overview, Anil Parwani and DK Panda, The Ohio State University, USA

12:15 - 1:00 PM

Keynote Talk #3

Speaker: Lee Cooper, Northwestern, USA

Title: Charting a Future Course for Computational Pathology [slides] [Video]

Abstract: There have been significant advances in both machine learning (ML) and digital pathology over the past decade, and the performance of ML in pathology applications suggests clinical utility in the not-so-distant future. In this talk I will discuss recent history of ML for pathology imaging, describe limitations of current approaches based on deep learning, and discuss challenges that need to be overcome to increase clinical translation. These challenges include scaling data labeling and annotation, creating realistic validation datasets that can reveal weaknesses in ML models, and the creation of shared data resources.

1:00 - 1:30 PM

Invited Talk #9

Speaker: Chakra Chennubhotla, University of Pittsburgh, USA

Title: Explainable AI (xAI) for Anatomic Pathology [slides] [Video]

Abstract: Pathologists are adopting whole slide images (WSIs) for diagnosis, thanks to recent FDA approval of WSI systems as class II medical devices. In response to new market forces and recent technology advances outside of pathology, a new field of computational pathology has emerged that applies artificial intelligence (AI) and machine learning algorithms to WSIs. Computational pathology has great potential for augmenting pathologists’ accuracy and efficiency, but there are important concerns regarding trust of AI due to the opaque, black-box nature of most AI algorithms. In addition, there is a lack of consensus on how pathologists should incorporate computational pathology systems into their workflow. To address these concerns, building computational pathology systems with explainable AI (xAI) mechanisms is a powerful and transparent alternative to black-box AI models. xAI can reveal underlying causes for its decisions; this is intended to promote safety and reliability of AI for critical tasks such as pathology diagnosis. This talk outlines xAI enabled applications in anatomic pathology workflow that improves efficiency and accuracy of the practice. In addition, we describe HistoMapr-Breast, an initial xAI enabled software application for breast core biopsies. HistoMapr-Breast automatically previews breast core WSIs and recognizes the regions of interest to rapidly present the key diagnostic areas in an interactive and explainable manner. We anticipate xAI will ultimately serve pathologists as an interactive computational guide for computer-assisted primary diagnosis.

1:30 - 2:00 PM

Invited Talk #10

Speaker: Mike Montalto, PathAI, USA

Title: AI-Based Pathology in Clinical Stage Biopharmaceutical Drug Development [slides] [Video]

Abstract: Pathology-based biomarkers play an important role along the continuum of early and late phase biopharmaceutical drug development including indication selection, pharmacodynamics, patient selection and stratification. In the era of precision medicine, traditional microscope-based pathology may fall short of the needed precision, reproducibility and accuracy that are needed to enhance the technical and regulatory success of new drug applications. This talk will explore the application of machine learning and artificial intelligence in pathology to the clinical drug development paradigm. Real-life examples will be reviewed in oncology and non-oncology therapeutic areas where AI-base pathology provides superior performance over traditional microscope methods. Regulatory considerations of using AI-based pathology in drug development will also be discussed.

2:00 - 2:30 PM

Invited Talk #11

Speaker: Mohammad Salama, Mayo Clinic, USA

Title: AI Promise to the Practice of Hematopathology [slides] [Video]

Abstract: Hematopathologists are increasingly using digital imaging tools for a wide spectrum of practice settings. However, digital imaging associated with artificial intelligence applications for effective learning and diagnosis rendering are not yet routinely incorporated in practice. We will share our experience in utilizing digital tools and will demonstrate methods and applications of digital imaging along with augmented human intelligence to effectively improve the workflow in the practice of hematopathology. We will cover the essential elements as well as the pitfalls, advantages, challenges and opportunities in utilization of digital tools in practice.

2:30 - 3:00 PM

Invited Talk #12

Speaker: Hari Subramoni, The Ohio State University, USA

Title: High-Performance Deep Learning with Large Pathology WSI Images [slides] [Video]

Abstract: This talk will focus on high-performance computing (HPC) technologies, techniques, and solutions to accelerate various operations associated with computational pathology. In the distributed Machine Learning (ML) training regime, data-parallelism has become an established paradigm to train ML models that fit inside GPU memory on large-scale HPC systems. However, model-parallelism is required to train very large whole slide pathology images and associated models that will not fit in the memory available on modern GPU platforms. We deal with emerging requirements brought forward by such very large models and whole-slide-images (WSIs) being trained using high-resolution images common in digital pathology. To address these, we propose, design, and implement GEMS; a GPU-Enabled Memory-Aware Model-Parallelism System. We present several design schemes that offer excellent speedups over state-of-the-art systems like Mesh-TensorFlow and FlexFlow. Our designs enable the training of large ML models on large patches of histopathology images and decrease the overall training time. The lower training time allows the deep learning researcher to train bigger and better models, thereby increasing existing designs' accuracy. We show the potential of our proposed designs by training a custom ResNet-110-v2 on image tiles of size 1,024 x 1,024 and reducing the training time from 1 day to 28 minutes for the real-world histopathology WSI of 100,000 x 100,000 pixels.

3:00 - 3:45 PM

Breakouts and Discussion

Breakout #5 High-Performance Computing, Breakout leaders: Mike Montalto (PathAI) and Hari Subramoni (OSU)

Breakout #6 Outreach and Industry Connections, Breakout leaders: Mohammad Salama (Mayo Clinic) and Chakra Chennubhotla (Pittsburgh)

3:45 - 4:05 PM

Reporting from two breakout sessions (10 mins each)

4:05 - 4:15 PM

Closing for Day 3 and Plans for Workshop #2, Anil Parwani and DK Panda, The Ohio State University, USA