Back to the top

DATA SCIENCE HUB

This dedicated effort aims to maximize the discovery potential — and long-term value — of data generated across Break Through Cancer’s TeamLab projects. By spurring creation of robust tools for research data gathering and analysis, the Hub also aims to accelerate discovery at our collaborating institutions and in the global cancer research community.

This dedicated effort aims to maximize the discovery potential — and long-term value — of data generated across Break Through Cancer’s TeamLab projects. By spurring creation of robust tools for research data gathering and analysis, the Hub also aims to accelerate discovery at our collaborating institutions and in the global cancer research community.

PROJECT HIGHLIGHTS

  • Ensure that all our scientific and clinical projects yield scientifically robust and technically reproducible findings — and that the resulting data are broadly accessible over the long term, thereby breaking through traditional research data silos.
  • Expand computational discovery by developing and applying wholly new analytic methods and by integrating data sets across our projects.
  • Create and adapt algorithms and methods necessary to make best use of data generated with new and emerging research technologies.
  • Execute integrated, pan-cancer analysis that enable disease-specific findings to be explored in the context of other disease types.
  • Provide a unique collaborative framework for training and mentoring future leaders in computational biology and for co-developing new technical approaches across laboratories and institutions.
  • Guided by world-class experts in cancer data science with expertise in creating large scale data infrastructures, developing computational algorithms and methods, and deploying widely used software tools and databases on a global scale.

MEET THE TEAM

The Data Science Hub creates synergies and new opportunities for Break Through Cancer projects and collaborating researchers at five leading clinical and research centers.
We invite you to learn about the institutions and individual investigators driving this important research and development project.

AllDana-Farber Cancer InstituteMemorial Sloan Kettering Cancer CenterMIT’s Koch Institute for Integrative Cancer ResearchThe Sidney Kimmel Comprehensive Cancer Center at Johns HopkinsThe University of Texas MD Anderson Cancer Center

MEET THE TEAM

The Data Science Hub creates synergies and new opportunities for Break Through Cancer projects and collaborating researchers at five leading clinical and research centers.
We invite you to learn about the institutions and individual investigators driving this important research and development project.

View Team
AllDana-Farber Cancer InstituteMemorial Sloan Kettering Cancer CenterMIT’s Koch Institute for Integrative Cancer ResearchThe Sidney Kimmel Comprehensive Cancer Center at Johns HopkinsThe University of Texas MD Anderson Cancer Center

PROJECT SUMMARY

Break Through Cancer’s projects will generate large and varied datasets, reflecting work at multiple institutions, across different scales, using a variety of technologies and tissue types. The kinds of data range, for example, from clinical parameters and single cell multi-omics to high resolution spatial profiling and medical imaging. Achieving robust, reproducible research findings across this complex data stream, and ensuring long-term access to high volumes of multifaceted data it will yield, requires substantial technical infrastructure, expert data analysis, and strong data governance.

Break Through Cancer also has a unique opportunity to expand computational discovery by developing and applying wholly new analytic methods for the data our projects produce, and by integrating data sets across the diseases we are studying. For example, meta analyses of our projects could focus on cancer evolution and cellular plasticity, spatial determinants of the tumor microenvironment, and cancer-immune-stromal cell interactions. But pursuing these novel computational approaches to discovery will depend on a rigorous application of data science principles and systems from the projects’ inception.

The Data Science Hub will conceptualize, maintain, and—where necessary— create the technical resources necessary to pursue this ambitious vision for leveraging data created in Break Through Cancer’s projects. The Hub will also provide a unique collaborative framework for training and mentoring future leaders in computational biology, for offering opportunities for co-development of new technical approaches across laboratories and institutions, and for establishing enduring professional networks.

The Hub is pursuing six primary aims:

  • Implementing bioinformatics best practices, tools, and pipelines—in order to enhance Break Through Cancer projects’ efficiency and interoperability, address their considerable analytical needs, and enable end-to-end reproducibility of findings. This process will include harmonizing metadata collection, data generation and processing, and quality control, as well as applying standards for data quality assessment.
  • Creating a robust, cloud-based Data Science and Data Governance Platform that will support standardized pipelines, ensure secure data collaborations, and offer effective data governance across collaborating institutions — while protecting patient privacy and satisfying security and regulatory constraints.
  • Advancing algorithms and methods necessary to make best and fullest use of data generated with new and emerging technologies such as multiplexed spatial imaging and spatial transcriptomic and proteomic analysis. Given that novel statistical and machine learning models for these data types are still in development, the Hub will pursue a targeted effort to develop new computational methods — prioritizing tools for three interconnected areas: cancer evolution and cellular plasticity; spatial determinants of microenvironments; and tumor cell immune cell interactions.
  • Enabling robust statistical evaluation of existing and emerging data analysis tools — allowing for continuous monitoring of performance and early detection of data anomalies, code issues, or problematic assumptions. Toward this aim, the Hub will deliver production grade software for running new tools, benchmarking datasets for all data types, and visualization software tools to evaluate performance and generate interpretable results.
  • Executing integrated, pan-cancer analysis across all Break Through Cancer-funded projects. By harnessing, for example, the multi-omics datasets generated by all TeamLabs, these integrated analyses can maximize insights from individual projects and enable disease-specific findings to be explored in the context of other disease types. Ultimately, the integrated data analyses the Hub create holds the potential to drive a better understanding of therapy resistance, tumor-immune interaction, the mechanisms underlying cancer immune evasion, and tumor evolution.
  • Creating a Break Through Cancer Data Science Network that will forge close and ongoing connections among scientists at participating institutions, and will support ongoing development of the broader cancer data-science community. Not only will the Network engage computational scientists at multiple institutions in generating cutting-edge algorithms, it will also create a data portal accessible to cancer researchers globally, host events that advance community building and methods development, and offer unique learning opportunities for cancer data science trainees.

MAKE A DIFFERENCE

Break Through Cancer was created in February 2021 with an extraordinary matching gift of $250,000,000. Every gift to the Foundation supports groundbreaking cancer research and helps us to meet our matching commitment.

For questions about giving please email Lisa Schwarz, Chief Philanthropy Officer at LMS@BreakThroughCancer.org

Break Through Cancer
101 Rogers Street
Suite 3A
Cambridge, MA 02142
Info@breakthroughcancer.org
1-800-757-9881