Data Scientist | ML Engineer | Analytics Specialist
Transforming complex data into actionable insights through cutting-edge machine learning, causal inference, and advanced analytics. Specializing in predictive modeling, uplift modeling, business intelligence, model deployment for real-world usability, and leveraging Gen AI for data-driven decision-making across healthcare, marketing, and marketplace domains.
I am a certified data scientist with a strong academic background in public health, uniquely positioned at the intersection of advanced analytics, business intelligence, and generative AI. My mission is to leverage data science methodologies and Gen AI capabilities to solve complex problems and drive strategic decision-making across diverse industries, while ensuring my solutions are accessible and usable by domain experts.
With expertise spanning machine learning, causal inference, statistical modeling, business intelligence, generative AI applications, and production model deployment, I specialize in developing predictive and causal models that translate complex data into actionable insights. My latest work extends into uplift modeling and interference detection — applying meta-learner algorithms (T-Learner, S-Learner, X-Learner) to estimate individual-level treatment effects in two-sided marketplaces, achieving a 5.02x ROI improvement through precision targeting. A key focus of my work is deploying machine learning models on platforms like Streamlit, creating intuitive interfaces that enable healthcare professionals, business analysts, and stakeholders to leverage advanced analytics without technical barriers. My portfolio spans causal ML for marketing optimization, healthcare prediction models (heart failure, fetal health outcomes), sales analytics, digital advertising performance analysis, and innovative Gen AI implementations.
End-to-end data science solutions spanning causal machine learning, predictive modeling, and business analytics across healthcare, marketing, and marketplace domains
Advanced ML system predicting heart failure mortality risk using 12 clinical parameters. Features sophisticated ratio-based feature engineering, Random Forest classification with balanced class weights, and an interactive Streamlit deployment for clinical decision support.
Comprehensive predictive analytics system for assessing fetal health outcomes using cardiotocography (CTG) data. Implements structured ML pipeline with automated preprocessing, multi-class classification, and real-time risk assessment interface.
End-to-end causal machine learning framework applied to a simulated food delivery marketplace. Implements T-Learner, S-Learner, and X-Learner meta-learners to estimate individual-level treatment effects (CATE), achieving a 5.02x ROI improvement and $140K annual savings over blanket targeting. Includes network interference detection using cluster-level ATE analysis and Intra-Cluster Correlation (ICC) to quantify spillover bias that corrupts standard A/B tests — moving beyond "who will buy" to "who will buy because of the promotion."
Through advanced data analysis, I identified critical trends in customer engagement and conversion rates, providing strategic recommendations that enhanced marketing ROI. My work has directly influenced the optimization of marketing efforts, driving higher customer satisfaction and business performance.
I conducted a comprehensive analysis of real-world internet sales data, utilizing SQL and Power BI. This project involved meticulous data cleaning, transformation, and modeling, resulting in actionable insights. I utilized DAX formulas to enhance the depth of analysis.
I conducted a comprehensive analysis of digital ad spending using Power BI. The objective was to identify key performance trends, optimize ad spend, and maximize ROI. I designed interactive dashboards that provided insights into ad channel performance, cost-per-click (CPC), and conversion rates.
Continuous learning and professional development
IBM via Coursera
12 Comprehensive Courses | October 2025
Cognifyz Technologies
Jan 2025 - Feb 2025
WorldQuant University
September 2025
Udemy - Kirill Eremenko
10 Total Hours | February 2022
Udemy - Ahmad Oyelowo
5 Total Hours | February 2022
Udemy - Jose Portilla
9 Total Hours | May 2022
Comprehensive toolkit for data-driven innovation
Open to collaborations, opportunities, and discussions about data-driven healthcare solutions