Skip to content

Releases: CBIIT/INS-ETL

v1.2.0

04 May 16:42
dd1d493
Compare
Choose a tag to compare

Summary

The “Core Project Remodel” of the Index of NCI Studies (INS) is available now at studycatalog.cancer.gov. The
primary focus of this release is to reorganize the data and display of the INS to center around (core) projects
rather than grants, which addresses one of the major improvements identified in the initial release.
As before, we are only including outputs from projects funded at least in part by extramural grants associated
with the Moonshot and Childhood Cancer Data Initiative (CCDI). The task of identifying and defining
programs remains a challenge to be addressed in future releases of the INS.

KEY FEATURES

  1. Redesigned data architecture to organize grants (formerly known as projects within the INS) into groups of grants. Those groups are now called projects and become the focal point of the INS
    Redesigned the Project Details Page to focus on one project with all associated grants and
    outputs displayed. Project descriptions are pulled from the most recently awarded grant,
    except for the Award Amount Total fields, which display summed values from all associated
    grants.
    Added Project Details Page links to all tabs of the Explore Page to quickly find all information for
    a project from any grant or output of that project
    Updated Explore Page filters, Programs Page, Program Details Page, and Global Search to
    reflect the new roles of projects and grants
    Updated documentation pages (About, Glossary, Release Notes)

  2. Added animation and updated the Stats Bar Graph image on the Home Page

  3. Improved data gathering date filters to reduce false positive associations

  4. Added 3 new manually curated CCDI P30 supplement datasets and projects

Data Details

This release includes the following data totals:
• 2 Programs
• 350 Projects
• 1,211 Grants
• 4,748 Publications
• 1,388 Datasets
• 349 Clinical Trials
• 117 Patents

New in this release of INS are several datasets shared due to CCDI P30 grant supplements to the following
organizations:
• Memorial Sloan-Kettering Cancer Center (MSKCC): phs003111.v1.p1 | 3P30CA008748-54S3
Note: this is an additional dataset shared due to a CCDI P30 grant supplement already
associated with another dataset in the INS (phs002620.v1.p1)
• University of Kansas Medical Center: phs002529.v1.p1 | 3P30CA168524-09S3
• University of Southern California: phs002518.v1.p1 | 3P30CA014089-45S6

Note that due to the indirect nature of grant supplements, these 3 projects and their outputs were gathered
and added to the INS through manual curation rather than the standard automated process.
This release includes a new batch of data gathered using the automated processes. It includes improvements
in the preprocessing pipeline to filter out any outputs published before the earliest associated grant was
awarded. This helps to limit false positive associations.

Projects, grants, publications, datasets, and clinical trials included in this release were gathered on April 4,
2023. Due to instability in the external source from which we gather Patents (USPTO API) in April, patents
included in this release were gathered on March 2, 2023.

v1.1.0

24 Apr 13:29
11d1f6a
Compare
Choose a tag to compare

KEY FEATURES

  • Added 4 manually curated CCDI P30 supplement datasets
  • Added buttons for users to download tables as CSV files for easier ad hoc analyses
  • Added INS Glossary and INS Release Notes PDFs to the main navigation menu bar
  • Added “Publication Date” column to Publications tables
  • Removed “NIH Percentile” column from Publications tables
  • Redesigned architecture of Project Detail and Program Detail pages to improve performance and table navigation
  • Many small UI improvements and bug fixes

Data Details

  • This release includes the following data totals:
    2 Programs
    1,041 Projects
    4,098 Publications
    1,119 Datasets
    207 Clinical Trials
    117 Patents

  • New in this release of INS are several datasets shared due to CCDI P30 grant supplement projects to the following organizations:

    1. Memorial Sloan-Kettering Cancer Center (MSKCC): 3P30CA008748-54S3
    2. Oregon Health & Science University (OHSU): 3P30CA069533-22S5
    3. Baylor College of Medicine: 3P30CA125123-14S6
    4. University of California, San Francisco (UCSF): 3P30CA082103-21S9

Note that due to the indirect nature of grant supplements, these 4 projects and their outputs were gathered
and added to the INS through manual curation rather than the standard automated process.

The data in this release uses the same automated data gathering output as previous releases of INS
(09/26/2022 for Projects, Publications, and Patents; 05/26/2022 for Datasets and Clinical Trials) but improves
the curation to remove duplicates and other minor display issues. As mentioned above, it also includes 4
manually curated CCDI projects and their datasets.

v1.0.1

22 Nov 18:20
6749fe4
Compare
Choose a tag to compare

Bug fix

  • Fix the Neo4j query on Project detail pages so that it won't cause an infinite loading screen or timeout error on the project detail page.

Disclaimer

  • Add urgent ODS disclaimer to the Home page and About page.

v1.0.0

14 Oct 18:07
e84ae2f
Compare
Choose a tag to compare

MVP release of INS project.

This pilot release was based on Bento framework and adjusted to include customized home page and about page. The pilot phase of the site (https://studycatalog.cancer.gov/#/) is temporarily limited to data generated through extramural grants from two NCI programs.