// Kathmandu, Nepal (UTC+5:45) · Open to Remote

PRASHANTNEPAL

Data Engineer · Spark Optimization · Lakehouse Architecture · K8s Platforms

I build data infrastructure that survives production at terabyte scale — fast pipelines, Iceberg lakehouses, and Kubernetes platforms that actually work.

AVAILABLE FOR NEW OPPORTUNITIES
Let's Talk → ▶ Play the Game
80%
Pipeline SpeedupLIVE
Spark GainLIVE
99%
Job SuccessLIVE
3TB+
Daily VolumeLIVE
SCROLL
// 01 · Live Architecture

The lakehouse in motion.

● LIVE 3.2TB processed today
Bronze — Raw Ingestion
Silver — Transformed
Gold — Analytics Ready
DLQ — Failed Events
// 02 · Arsenal

Stack, ranked by depth.

[ EXPERT ] — daily production
⚡ Apache Spark
93%
Rewrote partitioning for 4× perf on TB workloads
☸ Kubernetes/OpenShift
91%
Multi-tenant Spark Operator, SCCs, network policies
🧊 Apache Iceberg
90%
Bronze→Silver→Gold medallion, ACID, time travel
🐍 PySpark / Python
92%
ETL pipelines, UDFs, performance tuning at scale
🌊 Apache Airflow
88%
DAG authoring, SLA monitoring, cross-system deps
🟠 Apache NiFi
86%
3TB+/day from SFTP, Oracle CDC, REST ingest
🐳 Docker & Helm
87%
Containerized platform services, chart templating
📦 Parquet / Avro / ORC
89%
128MB blocks, column pruning, 3× read throughput
🗃 Oracle / PostgreSQL / SQL
80%
128MB blocks, column pruning, 3× read throughput
[ SOLID ] — built and shipped
🔷 Trino
72%
Federated queries across Iceberg, Hive, Postgres
📡 Kafka
70%
Event streaming from Oracle CDC into Bronze layer
🗄 Hive Metastore
68%
Catalog for Spark + Trino, schema management
🧪 DBT
70%
Silver→Gold transformations, data quality tests
🪣 MinIO / S3
67%
Object storage backend for Iceberg data lake
📊 Superset
65%
Dashboards on Gold layer via Trino connector
☁ Azure
66%
VM provisioning for OpenShift CRC cluster nodes
🔑 Lakekeeper
68%
REST Iceberg catalog with fine-grained authz
[ LEARNING ] — working knowledge
⭐ StarRocks
45%
OLAP queries on flat tables, exploring adoption
🎯 Doris
42%
Real-time analytics, evaluating for BI workloads
☕ Java
48%
Spark internals reading, custom UDF extension
⚙ CI/CD
52%
GitHub Actions pipelines, ArgoCD GitOps basics
🔴 ODF / Ceph
40%
Distributed block/object storage on OpenShift
// 03 · Experience

Where I've shipped.

Feb 2025 — Present
DLytica Inc.
Client: Ooredoo
Kathmandu, Nepal
Data Engineer — Platform & ETL
4× Spark performance — rewrote partitioning, broadcast joins, shuffle configs. TB jobs: hours → minutes.
Spark Connect on OpenShift — centralized compute for 15+ devs, 70% faster onboarding.
Oracle → Iceberg lakehouse — bronze/silver/gold, ACID guarantees, 60% latency cut.
3TB+ daily ingestion — NiFi + Kafka from SFTP, Oracle CDC, REST APIs at 95% reliability.
85% downstream error reduction — automated reconciliation and data quality frameworks.
SparkOpenShiftIcebergTrinoAirflowNiFiKafkaDBT
Feb 2025 — Present
DLytica Academy
Tutor — Data Engineering
Mentored 10+ Data Engineering Fellows on Spark, K8s, DBT, Airflow, and Iceberg.
Feb 2023 — Jan 2024
Offer Sewa Pvt. Ltd.
Junior Software Engineer
REST APIs in Laravel/Python. 40% response time reduction via query indexing.
LaravelPythonPostgreSQL
// 04 · Impact

Numbers I own.

0%
Pipeline Runtime Cut
Terabyte Spark workloads from hours to minutes.
Spark · OpenShift
Spark Performance
Broadcast joins, partitioning, executor rewrite.
Spark SQL · PySpark
0%
Job Success Rate
K8s Spark Operator with tuned pod templates.
Kubernetes · Helm
0%
Error Reduction
Automated reconciliation, zero silent corruption.
Data Quality
Read Throughput
Parquet 128MB blocks, pruning, column projection.
Parquet · MinIO
0%
Faster Onboarding
Spark Connect unlocked 15 engineers overnight.
Spark Connect · DBT
// 05 · Interactive Terminal

Type help to explore.

prashant@data-platform:~$ · ↑↓ history · Tab to complete
╔══════════════════════════════════════════════╗ ║ Prashant Nepal · Data Platform Terminal ║ ╚══════════════════════════════════════════════╝ Type 'help' · ↑/↓ history · Tab to complete
prashant@platform:~$ 
// 06 · Data Pipeline Defender

Protect your lakehouse.

A/D or ← → to move · Space to fire · Enter to start · Beat the leaderboard 🏆

0SCORE
♦♦♦INTEGRITY
1WAVE
BRONZETIER
0BEST
How to play
← → / A D — move
Space / Enter — fire
NULL/CORRUPT — 10–15pts
SCHEMA DRIFT — 20pts
DUPLICATE — 25pts
Score × Wave multiplier
▶ BRONZE 0–299
▶ SILVER 300–699
▶ GOLD 700–1199
▶ PLATINUM 1200+
Leaderboard
// 07 · Contact
AVAILABLE FOR NEW OPPORTUNITIES
Let's build
something extraordinary.

Data engineering roles, platform architecture, distributed systems. Kathmandu — available globally.

Email
404.nepal.prashant@gmail.com
LinkedIn
nepalprashant
GitHub
nepalprashant
Phone · UTC+5:45
+977 982 432 1005

Download Resume

Full CV · opens in browser · Ctrl+P to save as PDF