visitor@portfolio:~$
visitor@portfolio:~$
cat about.txt
3+ years building production data pipelines across healthcare and enterprise analytics.
I care about data that's reliable - not just data that exists.

Currently at Innovaccer, working on EMR & payer analytics - building pipelines that process ~20k patients/month and keeping clinical data systems honest.

Outside of work, I build open source tools that solve real DE problems I've actually faced in production.
visitor@portfolio:~$
ls -la skills/
programming/
PythonSQLpandasNumPyMatplotlib
visualization/
Power BIDAXExcelGoogle Sheets
cloud/
SnowflakeAzure DatabricksADLSAzure Data Factory
orchestration/
Apache AirflowGit
domain/
EMR & ClaimsPayer AnalyticsHealthcare Data
visitor@portfolio:~$
cat experience.log
Innovaccer Inc.
Analytics Engineer  ·  Aug 2024 – Present  ·  Noida
  • Identified systematic patient misassignments - fully refactored a 200+ line SQL query, eliminating incorrect care program enrollments
  • Built production SQL logic assigning PCPs for ~20k patients/month against facility thresholds
  • Automated payer file monitoring via Python, eliminating 4hrs/week of manual checks with real-time alerts
  • Built NC Medicaid dashboard with 10+ visuals identifying coverage gaps, optimized to sub-5-second load times
MAQ Software
Analytics Engineer  ·  May 2022 – Sep 2023  ·  Noida & Hyderabad
  • Architected 200+ DAX measures using Tabular Editor - dashboards ranked top 10 across the org's BI portfolio
  • Optimised Azure Data Factory pipelines, contributing to 50% reduction in data extraction time
  • Led ETL transformation validation across 100+ Azure Databricks notebooks
visitor@portfolio:~$
ls projects/
sqlanalyzer
open source
Python  ·  sqlglot  ·  CLI  ·  AST
AST-powered Snowflake SQL complexity analyzer. Detects 8 performance anti-patterns including non-SARGable filters, cartesian joins, and SELECT * usage. Context-aware weighted scoring engine - penalises issues relative to query size, not hardcoded thresholds.
ehr-validator
in progress
Python  ·  pandas  ·  difflib
Pre-ingestion EHR schema validation and auto-healing tool. Detects column drift, fuzzy matches renamed columns above 0.85 similarity threshold, reorders files to match DDL, and quarantines malformed files - without stopping other files from processing.
visitor@portfolio:~$
contact --methods