Hello, I'm

Pawan Kumar
Principal Data Engineer

15+ years of experience designing and delivering end-to-end data solutions. Building data pipelines, integrating diverse data sources, and enabling data-driven insights.

About Me

Data Engineer with 15+ years of experience designing and delivering end-to-end data solutions. Focused on building and optimising data pipelines, integrating diverse data sources, and enabling data-driven insights in agile environments.

I build and deliver solutions across the full data lifecycle — from gathering stakeholder requirements and designing data models to developing pipelines, creating dashboards, and deploying production-ready solutions.

Domains: Oil & Gas, ESG, Credit Risk Management, Digital Marketing, Telecom, Retail, HR, and Finance.

Technical Skills

Technologies and tools I work with

Data Engineering

Data Warehousing Dimensional Modelling Data Vault Knowledge Graph

ETL / ELT

Python Spark Scala Airflow Dagster Airbyte SSIS Pentaho

SQL & Query Engines

SQL SparkSQL Trino Athena Delta/Databricks ArcadeDB Redshift

Visualisation

Superset Power BI SiSense Metabase Tableau Oracle BI

Cloud Platforms

Databricks AWS Microsoft Azure

AI / ML

AI Agents LLM Integration LiteLLM Ollama

Experience

My professional journey

2022
Principal Data Engineer
Context Labs B.V. — Amsterdam, NL
  • Design and develop data processing solutions using a Data Lakehouse architecture with knowledge graphs and ontology-driven data modelling
  • Collaborate with stakeholders to gather and refine requirements, building end-to-end data pipelines for reporting and analytics
  • Built an agentic data profiler using LLM-based agents, automating schema inference, validation, and anomaly detection
PythonSparkScalaDelta LakeDagsterTrinoAirbyteSupersetLinkMLArcadeDBLiteLLMAzure
2021
Senior Data Engineer
ABN AMRO Bank N.V. — Amsterdam, NL
  • Migrated risk data processing from legacy Excel/Access workflows to an automated cloud-based platform
  • Implemented new data models, quality controls, and automated data integration pipelines
  • Bridged business and technical teams to translate requirements into technical specifications
AzureDatabricksPythonPySparkSparkSQLDelta LakePower BI
2019
Data Operations Lead
Mendix B.V. (Siemens) — Rotterdam, NL
  • Built a data warehouse platform from scratch, defining architecture and technical direction for the Data Centre of Excellence
  • Established data quality, completeness, and compliance processes
AWSAthenaS3GlueRedshiftPythonSparkAirflowDockerPower BI
2017
Business Intelligence Architect
Crowd Media B.V. (now UNITH) — Amsterdam, NL
  • Owned the full BI solution from requirements to dashboards and data pipeline management
  • Guided teams in applying data quality checks and modelling techniques
AWSRedshiftS3SageMakerPentahoSisenseMySQLPython
2015
Senior Analyst
Accenture B.V. — Amsterdam, NL
  • Developed data models and interactive dashboards for telecom reporting and analytics
  • Automated manual processes through Oracle APEX and BI solutions
OBIEEOracle BIOracle APEXPL/SQLPython
2014
Data Intelligence Engineer
EMC2 Pvt. Ltd. (now Dell EMC) — Bangalore, IN
  • Built data models and ETL processes for HR and finance reporting
  • Analysed storage rack logs to support data science proof of concept
MSBISSISSSASSSRST-SQLGreenplumPython
2011
Software Engineering Analyst
Accenture Technology Pvt. Ltd. — Bangalore, IN
  • Built end-to-end ETL pipelines and dashboards using MSBI, SQL Server, and Tableau for multiple clients
  • Designed dimensional models for enterprise data warehouses
MSBISSISSSASSSRST-SQLHadoopHiveTableau

Education

Master of Science
Data Science
Liverpool John Moores University, UK
Bachelor of Technology
Electronic Engineering
Rajasthan Technical University, IN

Certifications

Azure Data Scientist
Azure Data Scientist Associate
Azure Data Engineer
Azure Data Engineer Associate
Power BI Data Analyst
Power BI Data Analyst Associate
Python PCAP
Certified Associate in Python (PCAP)
Scrum Developer Certified
Scrum Developer Certified (SDC)

Let's Connect

You can reach out to me using any of the channels below