Skip to main content
WiDS News | December 1, 2020

Announcing the 4th Annual WiDS Datathon Challenge: Identifying Diabetes Condition to Provide Better Care for ICU Patients

We are gearing up to launch the 4th Annual Women in Data Science (WiDS) Datathon in January, in advance of our first 24-hour virtual 2021 WiDS Worldwide Conference on March 8th. This year’s datathon, organized by the WiDS Worldwide team, the West Big Data Innovation Hub, and the WiDS Datathon Committee, will focus on models to determine whether a patient admitted to an Intensive Care Unit (ICU) has been diagnosed with a particular type of diabetes.

Background on the challenge
Getting a rapid understanding of the context of a patient’s overall health has been particularly important during the COVID-19 pandemic as healthcare workers around the world struggle with hospitals overloaded by patients in critical condition. ICUs often lack verified medical histories for incoming patients. A patient in distress or a patient who is brought in confused or unresponsive may not be able to provide information about chronic conditions such as heart disease, injuries, or diabetes. Medical records  may take days to transfer, especially for a patient  from another medical provider or system. Knowledge about chronic conditions can inform clinical decisions about patient care and ultimately improve patient outcomes.

During November’s American Diabetes Month various groups raised awareness about this disease that afflicts 34.2 million Americans or 10.5% of the population. And one in five people (7.3 million) who met the laboratory criteria for it aren’t aware they are living with the disease or how to manage it. People with diabetes are also at risk for more serious outcomes from COVID-19. Between February and May, a CDC study that analyzed 10,000 deaths found that 40 percent of those who have died from COVID-19 were living with diabetes. On a global scale, 463 million adults were living with diabetes in 2019, and 1 in 2 (232 million) people were undiagnosed.

The dataset and challenge
We will take a closer look at data similar to the WiDS 2020 Datathon data from MIT’s GOSSIS (Global Open Source Severity of Illness Score) Initiative, but instead of predicting patient survival, this year the WiDS Datathon will focus on creating models to classify whether patients have been diagnosed with a certain type of diabetes which could inform treatment in the ICU.

Who can participate in the datathon
We invite anyone from those new to data science to veterans of the field to participate. For those who have never tried machine learning, we will be releasing a series of guides to help you get started with the algorithms and dataset.

The WiDS Datathon aims to inspire women worldwide to learn more about data science, and to create a supportive environment for women to connect with others in their community who share their interests. Toward these ends, we open the datathon to individuals or teams of up to 4; at least half of each team must be women (people identifying as female). Participants can be students, faculty, government workers, members of NGOs, or industry members.

How it works
The datathon will run from early January-February 2021 on Kaggle, an online community of data scientists.

Labeled training and validation sets will be provided for model development; you will then upload your classifications for an unlabeled test set to Kaggle and these will be used to determine the public leaderboard rankings and the winners of the competition.

Winners will be announced at the WiDS Worldwide Conference held virtually on March 8, 2021. Beyond the leaderboard rankings, individuals and teams will also have an opportunity to submit papers about their work to be eligible for an Excellence in Research Award from the National Science Foundation Big Data Innovation Hubs.

Getting started
Make your plans to build a team, hone your data science skills, and join us in this year’s  challenge focused on social impact. We recommend you:

  • Sign up now to participate, and we will send you the link to download the dataset on Kaggle when the competition begins. 
  • Join the WiDS community mailing list to make sure you receive news about the WiDS Datathon.
  • Consider forming a team with new collaborators. Connect with potential teammates on the Kaggle forums or datathon-focused virtual workshops hosted by WiDS Ambassadors beginning in January, and through social media with #WiDSDatathon.

Lastly, be creative, and have fun! Good luck to all participants — we’re so excited to see what you create.


Picture