The document discusses using data science to drive healthcare operations. It describes using models to close gaps in patient care by predicting which diabetic patients will develop complications in the next 6 months based on demographic data, medical history, medications and lab tests. The challenges are class imbalance, with few patients historically developing complications, and missing lab data. Gradient boosting decision trees are able to handle these issues better than logistic regression. Testing shows the model can identify high-risk patients to call with a 24% precision and 66% recall. A trial using the model to select patients for home visits found more complications than random selection, showing the approach can improve outcomes.