Using multiple data sources to study health outcomes in a vulnerable population