Skip to main content

Program Snapshot 

The Big Data to Knowledge (BD2K) program has transitioned from Common Fund support. Common Fund programs are strategic investments that achieve a set of high-impact goals within a 5-10 year timeframe. At the conclusion of each program, deliverables transition to other sources of support or use within the scientific community.

Please note that since the BD2K program is no longer supported by the Common Fund, the program website is being maintained as an archive and will not be updated on a regular basis.

The Big Data to Knowledge (BD2K) program supports the research and development of innovative and transformative approaches and tools to maximize and accelerate the utility of big data and data science in biomedical research. As biomedical tools and technologies rapidly improve, researchers are producing and analyzing an ever-expanding amount of complex data called “big data.” Extracting useful knowledge from big data is a major limiting factor to understanding health and disease. BD2K is facilitating data-driven discovery by improving our ability to harvest the wealth of information contained in biomedical big data.

The BD2K program has addressed some major big data and data science challenges including lack of appropriate tools, poor data accessibility, and insufficient training in biomedical data science. It facilitated broad use of biomedical big data, developed and disseminated analysis methods and software, enhanced training relevant for large-scale data analysis, and supported efforts toward making data sets “FAIR” Findable, Accessible, Interoperable, and Reusable (link is external). The tools, methods, and educational resources developed by BD2K are made available to the research community, and are informing the NIH Strategic Plan for Data Science, which provides a roadmap for modernizing the NIH-funded biomedical data science ecosystem.

BD2K Achievements

Learn more about the Big Data to Knowledge program's achievements. 

BD2K Achievements Infographic

BD2K Training Coordination Center

Word cloud demonstrating topics from BD2K training courses

Program Reports:

BD2K Behavioral and Social Sciences (BSS) and Big Data Workshop Executive Summary
On March 9-10, 2018 the NIH Common Fund sponsored the BD2K BSS Big Data Workshop, bringing together BSS researchers with computational big data and informatics researchers to encourage cross-disciplinary discussion and collaboration.

Examining existing data science and environmental health science (EHS) resources to address EHS-specific training goals in Data Science
On August 14-15, 2018 The National Institutes of Environmental Health Sciences (NIEHS) hosted a Data Science Training Workshop with a session relating BD2K accomplishments to environmental health science (EHS) training goals. The goal of this session was to examine existing training resources relevant to the intersection of EHS and data science, including existing data science training resources.

BD2K Training Report
Learn about BD2K's investments in biomedical data science training from fiscal years 2014-16 in this report.


This page last reviewed on March 22, 2024