Artificial intelligence techniques have inspired a new COVID-19 forecasting model provide timely information at a more localized level.
The researchers say that officials and anyone in the public can use in their decision-making processes.
“We are all overwhelmed by the data, most of which is provided at national and state levels,” says Xifeng Yan, an associate professor of and chair in computer science at the College of Engineering at the University of California, Santa Barbara.
“Humans, even trained professionals, are not able to process the massive data as effectively as computer algorithms.”
“Parents are more interested in what is happening in their school district and if it’s safe for their kids to go to school in the fall,” Yan says. “However, there are very few websites providing that information. We aim to provide forecasting and explanations at a localized level with data that is more useful for residents and decision makers.”
“The challenges of making sense of messy data are precisely the type of problems that we deal with every day as computer scientists working in AI and machine learning,” says Yu-Xiang Wang, an assistant professor of and chair in computer science. “We are compelled to lend our expertise to help communities make informed decisions.”
‘Transforming’ COVID-19 forecasting
Yan and Wang developed an innovative forecasting algorithm based on a deep learning model called Transformer. The model is driven by an attention mechanism that intuitively learns how to forecast by learning what time period in the past to look at and what data is the most important and relevant.
“If we are trying to forecast for a specific region, like Santa Barbara County, our algorithm compares the growth curves of COVID-19 cases across different regions over a period of time to determine the most-similar regions. It then weighs these regions to forecast cases in the target region,” explains Yan.
In addition to COVID-19 data, the algorithm also draws information from the US Census to factor in hyper-local details when calibrating the forecast for a local community.
“The census data is very informative because it implicitly captures the culture, lifestyle, demographics, and types of businesses in each local community,” says Wang.
“Hopefully, the next time we are in such a situation, we will be better equipped to make the right decisions at the right time.”
“When you combine that with COVID-19 data available by region, it helps us transfer the knowledge learned from one region to another, which will be useful for communities that want data on the effectiveness of interventions in order to make informed decisions.”
The researchers’ models showed that, during the recent spike, Santa Barbara County experienced spread similar to what Mecklenburg, Wake, and Durham counties in North Carolina saw in late March and early April. Using those counties to forecast future cases in Santa Barbara County, the researchers’ attention-based model outperformed the most commonly used epidemiological models: the SIR (susceptible, infected, recovered) model, which describes the flow of individuals through three mutually exclusive stages; and the autoregressive model, which makes predictions based solely on a series of data points displayed over time.
The AI-based model had a mean absolute percentage error (MAPE) of 0.030, compared with 0.11 for the SIR model and 0.072 with autoregression. The MAPE is a common measure of prediction accuracy in statistics.
Fixing models for better data
Yan and Wang say their model forecasts more accurately because it eliminates key weaknesses associated with current models. Census data provides fine-grained details missing in existing simulation models, while the attention mechanism leverages the substantial amounts of data now available publicly.
“Humans, even trained professionals, are not able to process the massive data as effectively as computer algorithms,” says Wang. “Our research provides tools for automatically extracting useful information from the data to simplify the picture, rather than making it more complicated.”
The researchers plan to make their model and forecasts available to the public via a website and to collect enough data to forecast for communities across the country.
“We hope to forecast for every community in the country because we believe that when people are well informed with local data, they will make well-informed decisions,” says Yan.
They also hope their algorithm can be used to forecast what could happen if a particular intervention is implemented at a specific time.
“Because our research focuses on more fundamental aspects, the developed tools can be applied to a variety of factors,” adds Yan. “Hopefully, the next time we are in such a situation, we will be better equipped to make the right decisions at the right time.”
The researchers will present their work later this month during the Computing Research Association (CRA) Virtual Conference. Additional researchers from Cottage Hospital in Santa Barbara contributed to the work.
Source: UC Santa Barbara