Streamlining Research Data Integration for
LSTM with Boomi MDH
Client Overview
The Liverpool School of Tropical Medicine (LSTM) is a globally renowned institution dedicated to improving health outcomes in some of the world’s most disadvantaged communities. Established in 1898 as the world’s first institution devoted exclusively to tropical medicine, LSTM continues to lead the way in tackling global health challenges.
Through pioneering research, education, and clinical initiatives, LSTM focuses on combating infectious, neglected, and tropical diseases. Its work spans critical areas such as malaria, tuberculosis, HIV/AIDS, and maternal and child health.
Collaborating with partners in over 70 countries, LSTM delivers innovative solutions that address some of the most pressing health issues of our time.
Business Objective
LSTM aimed to centralize and enhance the accuracy of research personnel data (staff and students), enforce data quality standards, and provide a consistent source of truth by integrating it into the Research Information Management System Elsevier Pure. They were looking for a solution to eliminate manual intervention, ensure data quality, and consolidate duplicate records.
Industry
Education
Platform
Boomi
Service
Integration
Challenges
Manual Configuration
Manual setup of AWS EKS risked errors and inconsistencies, necessitating full automation with Terraform.
EKS Learning
Internal team’s limited EKS expertise led to setup and troubleshooting challenges, needing better coordination.
Security Compliance
Implementing robust security measures and gaining approvals in a hyper-secure environment was complex.
Migration Disruptions
Migrating APIs risked downtime and integration issues, requiring phased migration and extensive testing.
Solutions
Centralized Master Data Management
LSTM lacked a single source of truth for person data, as student and staff information resided in separate systems (Salesforce and SQL Server), resulting in data inconsistency and duplication in Elsevier Pure (RIMS) To address this, we configured Boomi Master Data Hub (MDH) as the central repository to consolidate and manage all personal data. Staff data from the relational database management system and student data from Salesforce were ingested into MDH, offering a unified view of data to feed into Pure.
Data Quality and Matching Rules
Lack of data quality management resulted in multiple records for the same individuals, especially for those with dual roles (staff and student) or rehires. To identify duplicates and validate record accuracy, our Boomi experts implemented stringent matching rules and data quality controls in the Boomi Master Data Hub. After this, only accurate Golden Records were allowed into Pure for reliable tracking and reporting of individuals.
Automated Data Flow
Our client manually uploaded personal data into their Research Information Management System (Pure), relying on stored procedures to generate XML files, which were then uploaded manually. All the manual processes are prone to errors. We build automated integration processes in Boomi to fetch data from the source systems, validate and transform it, and then push it directly into Pure. This eliminates manual effort, reduces errors, and improves efficiency.
Single Unified Entity Model
Their Research Information Management System (Pure) worked with a single “Person” entity model with affiliated roles (student and staff). However, the student and staff data had different structures in their respective source systems. Variation in data structure caused duplication and confusion in Pure. We designed Boomi MDH to store all individuals as a single “Person,” regardless of their source system structure or role. Any person who was linked to more than one affiliation record earlier now aligns with Pure’s data model, eliminating any duplication or structural inconsistencies.
Custom Business Logic
LSTM requires specific business rules for conditions like - staff re-joining the institution, students also being staff, and automatic creation of missing affiliations. These solutions were not initially supported in the Boomi setup. We implemented custom business logic within the Boomi integration layer and MDH. Re-joining staff with the same original ID were merged and historical affiliations retained, dual role individuals linked to one person record, and for missing student affiliations were created dynamically during the export to Pure.
Tailored Handling of Complex Identity Scenarios
Some records could not be confidently matched or automatically resolved due to ambiguous data (e.g., the same name or email across different entities). If not managed well, this resulted in potential data corruption. Records that couldn’t be matched automatically were sent to quarantine in MDH to be reviewed by the Data Stewards. We set up rules, like checking for similar names or emails to avoid incorrect merges. Once the review is done, the correct affiliations are automatically linked to the person's record.
Results
Automated Integration
90-93% automated integration flows replaced the manual process of preparing and uploading XML files.
Reduced Duplication
Reduced data duplication with matching rules and MDH validations that only record per person exists in RIMS (Pure).
Enhanced Quality
Enhanced 100% data quality by building a quarantine in MDH to check that records met organizational standards before integration.
Consolidated Records
Consolidated person records in Pure helped LSTM to track research activities and affiliations accurately.
Technology Stack
Related Case Studies
Get in touch
Tell us what you're looking for and we'll get you connected to the right people.