Programming for data processing
Python, pandas, NumPy; R, dplyr, tidyr, ggplot2; repeatable cleaning, transformation, validation and structured exports.
SQL & databases
PostgreSQL, MySQL; joins, CTEs, views, window functions, reporting-ready tables, source-to-output validation.
Dashboards & reporting
Power BI, Tableau, Excel; KPI reporting, drilldowns, trend views, dashboard usability and stakeholder-ready summaries.
Data quality & governance
Profiling, validation rules, standardisation, deduplication, anomaly checks, QA spot-checks, audit logs and documentation.
Analytics & modelling
RFM analysis, K-Means clustering, cohort-style comparisons, trend analysis, forecasting with SARIMAX, MAE/RMSE validation.
Data systems & delivery
FastAPI, Flask, Docker, Git, Pytest, Jira, Confluence, Bitbucket; basic automated tests and handover documentation.
AI/RAG evaluation exposure
LlamaIndex, Sentence-Transformers, pgvector, RAGAS, labelled evaluation datasets, citation audit checks and quality reporting.
Research data exposure
Public survey/population datasets, variable definition checks, comparability assessment, missingness review and methodology notes.
Stakeholder communication
Requirements clarification, plain-English issue explanation, written updates, reporting commentary and practical documentation.