This cutting-edge module is designed to equip students with the essential skills to thrive in the rapidly evolving field of health data science. The module integrates modern data and engineering techniques mastering the art of efficient data querying and management within relational database systems (SQL), and programming in Python and R to clean and preprocess data addressing challenges such as missing values and duplicates, as well as how to use of version control (GiT) to ensure the reproducibility and traceability of data-related projects. The module is structured to provide a comprehensive understanding of data handling techniques, ensuring students are well-prepared for the challenges in the health data science workflow. Assessment for the module is coursework based, consisting of two assessments designed to address authentic data analysis problems encountered by health data scientists.