Skip to content

tanaixin1/Datathon-2023

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 

Repository files navigation

Datathon 2023

Department of Statistics and Actuarial Science @ U Iowa

Roadmap:

Problem Statement

Prize Categories and Judging Criteria

Presentation Day

Problem Statement

Strategic Screening: Developing a Predictive Model for Chronic Hepatitis B Infection Using Reliable Patient Data

Chronic Hepatitis B virus (HBV) infection can lead to cirrhosis, liver cancer, and liver failure. While treatable when detected early, many patients with chronic HBV infection remain asymptomatic and undiagnosed until severe liver damage occurs. By the time of diagnosis, the opportunity for effective treatment may have passed.

The implementation of HBV vaccination for U.S. children in 1991 has been highly effective in preventing the infection. However, despite the availability of these vaccines, there has been a stagnation in the reduction of HBV infections, and hepatitis-related deaths have risen. This may be partially attributable to immigration from regions with low vaccination rates or to instances where parents opt out of vaccinating their children.

One objective of the U.S. National Viral Hepatitis Action Plan is to reduce mortality and improve health outcomes for individuals with viral hepatitis. The Action Plan states that expediting the diagnosis, care, treatment, and potential cure of those with chronic viral hepatitis is crucial for reducing associated mortality and enhancing patient health.

Blood tests can determine a patient's history of HBV immunity and infection. The presence of antibodies to the hepatitis B core antigen (anti-HBc) signifies a past HBV infection. Current chronic infection is indicated by the hepatitis B surface antigen (HBsAg), while the hepatitis B surface antibody (anti-HBs) demonstrates immunity acquired through vaccination.

To achieve the Action Plan's objective, an extensive effort to collect and screen blood samples for chronic HBV infection could be undertaken, but this method would be costly. A more efficient strategy would be to pinpoint individuals at elevated risk for chronic HBV infection using readily available demographic and medical data, allowing health care providers to direct these high-risk individuals towards blood testing.

Use NHANES to develop this predictive model for chronic HBV infection risk. In a clinical context, such a model would primarily serve patients who are either unvaccinated or unsure of their vaccination status. Additionally, the model might exclude patients who have been previously diagnosed with an HBV infection by a health care professional. Can your estimation accurately reflect these specific populations?

When developing a predictive model for chronic HBV infection, it is critical to consider the reliability and practicality of the variables used. While certain behaviors such as illegal drug use and some sexually transmitted infections are associated with a higher risk of HBV infection, patients may not always disclose this information to health care providers, leading to underreporting and data inaccuracies.

Similarly, socioeconomic factors like income or net worth could predict HBV infection, yet patients may not know or wish to share precise financial details. Given these challenges, it is essential to identify which variables are both reliable and feasible for health care providers to collect during a typical patient encounter like a doctor’s office visit.

The model must be designed with sensitivity to these factors and incorporate data that can be reliably obtained during a medical visit. The variables chosen should be those that can be systematically and ethically collected, ensuring that the predictive model can be applied broadly and efficiently in a clinical setting.

Prize Categories and Judging Criteria

Prizes

The Winning Team $75 per member

Best Data Collection Award $35 per member

Best Data Visualization Award $35 per member


Raffle prize $25*2

It will be raffled among those who submit a valid project and are judged.

Expectations and Judging

The winning team is the team that demonstrates the best combination of data-driven insights and effective communication for the challenge. Additional prizes will be based on Best Data Collection and Best Data Visualization. Generally, the judges will use the following criteria to determine which teams will win prizes.

Teams must deliver:

  • Presentation explaining their solution.
  • A basic report that supports their presentation (provide link to the team GitHub page).
  • Coding scripts (provide link to the team GitHub page).

Scoring Rubic

Theme: Does the team develop a solution that addresses the challenge?

Data Collection (*): Do the data collected align well with the challenge? Was the relevance of data effectively communicated in the presentation?

Methods: Is the data analysis and technology behind the idea well-chosen and skillfully implemented?

Presentation: How well was the project presented? Does it make the idea more appealing?

Visualization (*): How attractive, innovative and user-friendly is the data visualization?

(*): These criteria are used for separate prizes for "Best Data Collection" and "Best Data Visualization".

Presentation Day

8:30 - 9:00am Breakfast in Schaeffer Hall, outside 140 SH.

9:00 - 10:00am Team presentations (5 min presentation followed by 3 minute Q&A per team)

10:00 - 10:20am Break

10:20 - 10:40am Award ceremony

Links to Team GitHub pages (Teams, please email Aixin your github repository url by Nov 11 to be posted below.)

Team 1. Predictive Pioneers

Team 2. Data Divers

Team 3. The Wranglers

Team 4. StatHawks

Team 5. Modeling Masters

Team 6. HealthGuard

Team 7. The Misfits

Order of Presentation (8 min per team, including Q&A)

9:02 StatHawks

9:10 The Wranglers

9:18 Data Divers

9:26 The Misfits

9:34 HealthGuard

9:42 Predictive Pioneers

9:50 Modeling Masters

About

Datathon page

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published