Eashwar Subramanian

Data Analyst / Data Scientist • Tableau • Power BI • SQL • Python • Data Quality

About Me

I’m a Master of Data Science graduate from RMIT (Dec 2025), based in Melbourne.

I work across the full analytics lifecycle: collect data → clean/validate → structure it into report-ready tables → analyse trends/patterns → communicate insights clearly to stakeholders. I care a lot about accuracy, traceability, and repeatable outputs.

Recent work includes healthcare document ingestion (PDF/HTML), metadata governance and deduplication, database design in PostgreSQL, and building QA/audit checks to improve trust in reporting.

I’m comfortable collaborating with both technical and non-technical teams, clarifying requirements, and documenting assumptions so the data stays usable over time.

Melbourne, Australia
Open to Full-time & Contract
SQL • Python • Power BI/Tableau
Education
Master of Data Science
RMIT University — Dec 2025
Bachelor of Engineering
Electronics & Communication Engineering — 2023
Work Authorization
Details available upon request
What I’m Strong At
Data Quality: validation rules, deduplication, audit outputs
Data Systems: PostgreSQL, schema design, indexing
Automation: Python workflows, reliable pipelines
Analytics: dashboards, segmentation, forecasting
Evaluation: reproducible testing/monitoring for quality over time

Professional Experience

Data Science Intern
Cultural Infusion • Melbourne, Australia
Jan 2026 – Present
  • Build and maintain automated pipelines to collect and structure publicly available data (APIs + website/RSS sources) into analysis-ready datasets.
  • Apply data quality controls (deduplication, normalisation, timestamp validation, and QA flags) and document rules/assumptions to keep outputs reliable.
  • Develop text-processing workflows to convert unstructured content into consistent fields for trend and theme analysis over time.
  • Produce stakeholder-friendly summaries of “what changed / why it matters / what to do next”, and iterate based on feedback to improve signal-to-noise.
Data Science Intern
Solara Health • Melbourne, Australia
Jul 2025 – Nov 2025
  • Built ingestion pipelines for a healthcare content library (PDF/HTML), including metadata governance and SHA-256 deduplication to improve dataset reliability.
  • Developed automated download and parsing workflows with retries and content-type handling to improve consistency for downstream analysis and retrieval.
  • Designed PostgreSQL data structures for content (metadata + embeddings) with indexing patterns, and exposed curated datasets via a FastAPI service with basic automated tests (Pytest).
  • Implemented tenant-aware access controls using PostgreSQL Row-Level Security (RLS) to enforce segregation across multi-hospital deployments.
  • Built QA and audit tooling (flag logging, CSV exports, citation audit harness, labelled evaluation dataset) and tracked quality trends using reproducible metrics.
Data Analyst Intern
PrepInsta Pvt Ltd • Remote (India)
Dec 2023 – Feb 2024
  • Delivered 5+ stakeholder dashboards (Tableau, Excel) and performed source-to-dashboard cross-checks to improve metric consistency.
  • Optimised SQL query performance, reducing data retrieval time by ~25% for executive reporting.
  • Automated reporting workflows with Python (BeautifulSoup), cutting manual effort by ~30%.

Featured Projects

Selected work across Business Intelligence, analytics delivery, and applied data systems. Each project links directly to the proof (repo / live demo).

Healthcare RAG Evaluation (Sanitized)

Documentation + metrics only (no NDA code)

Sanitized case study capturing documented design decisions and evaluation artifacts without sharing proprietary code.

Evaluation Documentation Reproducibility
View Repo

Australian Retail Customer Segmentation (Python + Power BI)

RFM features • K-Means clustering • Dashboard delivery

Segmented Australian retail customers using RFM-style features and clustering, then delivered insights through a Power BI dashboard built on a customer-level dataset.

Python scikit-learn Power BI Data Quality
View Repo

Sales Analytics Dashboard (Power BI + MySQL)

Reproducible model • MySQL dump included

Power BI sales dashboard packaged with a MySQL database dump so the dataset can be restored locally and the model reproduced.

Power BI MySQL Reporting
View Repo

Australian Climate Forecast Dashboard (Flask + SARIMAX + Folium)

Time-series forecasting • Web dashboard

Flask dashboard that generates on-demand weekly SARIMAX forecasts for Australian locations and renders an interactive map.

Python Flask SARIMAX Folium
View Repo

Real Estate Data Cleaning (MySQL)

Standardization • Parsing • De-duplication

Data cleaning workflow for the Nashville housing dataset using MySQL statements (date standardization, address parsing, duplicate removal).

MySQL Data Cleaning Data Quality
View Repo

Technical Expertise

Data & Analytics
SQL querying • Data profiling • Dashboarding (Power BI/Tableau) • Stakeholder reporting • Exploratory analysis
Data Quality & Governance
Validation rules • Deduplication • Audit outputs • Metadata management • Tenant isolation (RLS concepts)
Engineering
Python automation • ETL workflows • PostgreSQL schema design • Indexing • APIs (FastAPI) • Testing (Pytest)
ML & Evaluation
Retrieval systems (RAG) • Embeddings • Monitoring/evaluation runs • Forecasting (SARIMA) • Clustering (K-Means)
Tools
Python (Pandas, NumPy) • SQL • Power BI/Tableau • Git • PostgreSQL • AWS (S3/EC2/Lambda exposure)

Let’s Connect

Get in Touch

Open to full-time and contract roles in Data Analytics / Data Science, with a focus on reliable data systems, data quality, and decision-ready reporting.

Send a Message
Resume PDF
* Required fields