About Me
Data Engineer with 15+ years of experience designing and delivering end-to-end data solutions. Focused on building and optimising data pipelines, integrating diverse data sources, and enabling data-driven insights in agile environments.
I build and deliver solutions across the full data lifecycle — from gathering stakeholder requirements and designing data models to developing pipelines, creating dashboards, and deploying production-ready solutions.
Domains: Oil & Gas, ESG, Credit Risk Management, Digital Marketing, Telecom, Retail, HR, and Finance.
Technical Skills
Data Engineering
Data Warehousing, Data Modelling (Dimensional, Data Vault), Knowledge Graph
ETL / ELT
Python, Spark, Scala, Airflow, Dagster, Airbyte, SSIS, Pentaho
SQL & Query Engines
SQL, SparkSQL, Trino, Athena, Delta/Databricks, ArcadeDB, Redshift
Visualisation
Superset, Power BI, SiSense, Metabase, Tableau, Oracle BI, SSRS
Cloud Platforms
Databricks, AWS, Microsoft Azure
Programming
Python, Scala, Spark, LinkML, Docker, Git
AI / ML
AI Agents, LLM Integration, LiteLLM, Ollama
Agile Practices
Scrum, Kanban, Requirement Gathering, Refinement, Estimation
Education
Master of Science (Data Science)
Liverpool John Moores University, UK
Bachelor of Technology (Electronic Engineering)
Rajasthan Technical University, IN
Experience
Dec 2022 – Present
Principal Data Engineer
Context Labs B.V. — Amsterdam, NL
- Design and develop data processing solutions using a Data Lakehouse architecture with knowledge graphs and ontology-driven data modelling
- Collaborate with stakeholders to gather and refine requirements, building end-to-end data pipelines for reporting and analytics
- Built an agentic data profiler using LLM-based agents, automating schema inference, validation, and anomaly detection
Tech: Python, Spark, Scala, Delta Lake, Dagster, Trino, Airbyte, Superset, LinkML, ArcadeDB, LiteLLM, Ollama, Azure
Mar 2021 – Nov 2022
Senior Data Engineer
ABN AMRO Bank N.V. — Amsterdam, NL
- Migrated risk data processing from legacy Excel/Access workflows to an automated cloud-based platform
- Implemented new data models, quality controls, and automated data integration pipelines
- Bridged business and technical teams to translate requirements into technical specifications
Tech: Azure, Databricks, Python, PySpark, SparkSQL, Delta Lake, Power BI
Sep 2019 – Feb 2021
Data Operations Lead
Mendix B.V. (Siemens) — Rotterdam, NL
- Built a data warehouse platform from scratch, defining architecture and technical direction for the Data Centre of Excellence
- Established data quality, completeness, and compliance processes
Tech: AWS (Athena, S3, Glue, Redshift), Python, Spark, Delta Lake, Airflow, Docker, Power BI
Oct 2017 – Aug 2019
Business Intelligence Architect
Crowd Media B.V. (now UNITH) — Amsterdam, NL
- Owned the full BI solution from requirements to dashboards and data pipeline management
- Guided teams in applying data quality checks and modelling techniques
Tech: AWS (Redshift, S3, RDS, SageMaker), Pentaho, Sisense, MySQL, Python
Nov 2015 – Sep 2017
Senior Analyst
Accenture B.V. — Amsterdam, NL
- Developed data models and interactive dashboards for telecom reporting and analytics
- Automated manual processes through Oracle APEX and BI solutions
Tech: OBIEE / Oracle BI, Oracle APEX, PL/SQL, Python
Oct 2014 – Oct 2015
Data Intelligence Engineer
EMC2 Pvt. Ltd. (now Dell EMC) — Bangalore, IN
- Built data models and ETL processes for HR and finance reporting
- Analysed storage rack logs to support data science proof of concept
Tech: MSBI (SSIS, SSAS, SSRS), T-SQL, SQL Server, Greenplum, Python
Apr 2011 – Oct 2014
Software Engineering Analyst
Accenture Technology Pvt. Ltd. — Bangalore, IN
- Built end-to-end ETL pipelines and dashboards using MSBI, SQL Server, and Tableau for multiple clients
- Designed dimensional models for enterprise data warehouses
Tech: MSBI (SSIS, SSAS, SSRS), T-SQL, SQL Server, Hadoop (Hive/Pig), Tableau
Certifications
Azure Data Scientist Associate
Azure Data Engineer Associate
Data Analyst Associate
Scrum Developer Certified (SDC)