Uganda Martyrs University Institutionalnal Repository (UMU-IR)
    • Login
    View Item 
    •   UMU Dissertations
    • Faculty of Science
    • Master of Science in Information Systems
    • Master of Science in Information Systems (Dissertations)
    • View Item
    •   UMU Dissertations
    • Faculty of Science
    • Master of Science in Information Systems
    • Master of Science in Information Systems (Dissertations)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    A contextual framework for data cleaning in clinical research

    Thumbnail
    View/Open
    Pius Kavuma _SCI_MSCIS_2025_ Julius Muganji.pdf (18.43Mb)
    Date
    2025-09
    Author
    Kavuma, Pius
    Metadata
    Show full item record
    Abstract
    This study presents the design, development, and validation of a contextual data cleaning framework tailored for clinical research settings in low-resource environments, using the DPSP (Dihydroartemisinin-Piperaquine and Sulfadoxine-Pyrimethamine) trial at Masafu Hospital as a case study. The research was motivated by persistent data quality challenges—such as missing values, inconsistencies, human errors, and tool limitations—that often compromise the validity and reliability of clinical research outcomes. Employing a user-intervention methodology, the study integrated qualitative insights from data managers, clinical teams, and analysts with quantitative assessment techniques to ensure that the proposed framework aligns with real-world practices. The framework was structured into distinct phases, including data profiling, preprocessing, modular cleaning, enhancement, and quality scoring—each mapped to address specific data integrity issues. Validation on the DPSP dataset demonstrated a significant improvement in data accuracy (from 75% to 94%), completeness (from 68% to 90%), and consistency (from 70% to 93%), confirming the framework’s effectiveness and usability. SQL driven automation further improved scalability and reduced human error. The study contributes to the literature by offering a novel, context-sensitive approach that balances domain expertise with technical rigor. It recommends future work to expand the framework’s applicability to unstructured data types and to assess its operational integration and cost-effectiveness. Overall, the framework serves as a practical tool for improving data quality in clinical trials and enhancing the credibility of health research in resource-constrained settings.
    URI
    http://dissertations.umu.ac.ug/xmlui/handle/123456789/1876
    Collections
    • Master of Science in Information Systems (Dissertations) [52]

    UMU_DR copyright © 2022-2025  UMU_IR
    Contact Us | Send Feedback

    UMU_Library
     

     

    Browse

    All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    UMU_DR copyright © 2022-2025  UMU_IR
    Contact Us | Send Feedback

    UMU_Library