// Kathmandu, Nepal (UTC+5:45) · Open to Remote

PRASHANTNEPAL

Data Engineer · Spark Optimization · Lakehouse Architecture · K8s Platforms

I build data infrastructure that survives production at terabyte scale — fast pipelines, Iceberg lakehouses, and Kubernetes platforms that actually work.

AVAILABLE FOR NEW OPPORTUNITIES

Let's Talk → ▶ Play the Game

80%

Pipeline SpeedupLIVE

4×

Spark GainLIVE

99%

Job SuccessLIVE

3TB+

Daily VolumeLIVE

SCROLL

// 02 · Arsenal

Stack, ranked by depth.

[ EXPERT ] — daily production

⚡ Apache Spark

93%

Rewrote partitioning for 4× perf on TB workloads

☸ Kubernetes/OpenShift

91%

Multi-tenant Spark Operator, SCCs, network policies

🧊 Apache Iceberg

90%

Bronze→Silver→Gold medallion, ACID, time travel

🐍 PySpark / Python

92%

ETL pipelines, UDFs, performance tuning at scale

🌊 Apache Airflow

88%

DAG authoring, SLA monitoring, cross-system deps

🟠 Apache NiFi

86%

3TB+/day from SFTP, Oracle CDC, REST ingest

🐳 Docker & Helm

87%

Containerized platform services, chart templating

📦 Parquet / Avro / ORC

89%

128MB blocks, column pruning, 3× read throughput

🗃 Oracle / PostgreSQL / SQL

80%

128MB blocks, column pruning, 3× read throughput

[ SOLID ] — built and shipped

🔷 Trino

72%

Federated queries across Iceberg, Hive, Postgres

📡 Kafka

70%

Event streaming from Oracle CDC into Bronze layer

🗄 Hive Metastore

68%

Catalog for Spark + Trino, schema management

🧪 DBT

70%

Silver→Gold transformations, data quality tests

🪣 MinIO / S3

67%

Object storage backend for Iceberg data lake

📊 Superset

65%

Dashboards on Gold layer via Trino connector

☁ Azure

66%

VM provisioning for OpenShift CRC cluster nodes

🔑 Lakekeeper

68%

REST Iceberg catalog with fine-grained authz

[ LEARNING ] — working knowledge

⭐ StarRocks

45%

OLAP queries on flat tables, exploring adoption

🎯 Doris

42%

Real-time analytics, evaluating for BI workloads

☕ Java

48%

Spark internals reading, custom UDF extension

⚙ CI/CD

52%

GitHub Actions pipelines, ArgoCD GitOps basics

🔴 ODF / Ceph

40%

Distributed block/object storage on OpenShift

// 03 · Experience

Where I've shipped.

Feb 2025 — Present

DLytica Inc.
Client: Ooredoo
Kathmandu, Nepal

Data Engineer — Platform & ETL

4× Spark performance — rewrote partitioning, broadcast joins, shuffle configs. TB jobs: hours → minutes.

Spark Connect on OpenShift — centralized compute for 15+ devs, 70% faster onboarding.

Oracle → Iceberg lakehouse — bronze/silver/gold, ACID guarantees, 60% latency cut.

3TB+ daily ingestion — NiFi + Kafka from SFTP, Oracle CDC, REST APIs at 95% reliability.

85% downstream error reduction — automated reconciliation and data quality frameworks.

SparkOpenShiftIcebergTrinoAirflowNiFiKafkaDBT

Feb 2025 — Present

DLytica Academy

Tutor — Data Engineering

Mentored 10+ Data Engineering Fellows on Spark, K8s, DBT, Airflow, and Iceberg.

Feb 2023 — Jan 2024

Offer Sewa Pvt. Ltd.

Junior Software Engineer

REST APIs in Laravel/Python. 40% response time reduction via query indexing.

LaravelPythonPostgreSQL

// 04 · Impact

Numbers I own.

Pipeline Runtime Cut

Terabyte Spark workloads from hours to minutes.

Spark · OpenShift

0×

Spark Performance

Broadcast joins, partitioning, executor rewrite.

Spark SQL · PySpark

Job Success Rate

K8s Spark Operator with tuned pod templates.

Kubernetes · Helm

Error Reduction

Automated reconciliation, zero silent corruption.

Data Quality

0×

Read Throughput

Parquet 128MB blocks, pruning, column projection.

Parquet · MinIO

Faster Onboarding

Spark Connect unlocked 15 engineers overnight.

Spark Connect · DBT

// 05 · Interactive Terminal

Type `help` to explore.

prashant@data-platform:~$ · ↑↓ history · Tab to complete

╔══════════════════════════════════════════════╗ ║ Prashant Nepal · Data Platform Terminal ║ ╚══════════════════════════════════════════════╝ Type 'help' · ↑/↓ history · Tab to complete

prashant@platform:~$

// 06 · Data Pipeline Defender

Protect your lakehouse.

A/D or ← → to move · Space to fire · Enter to start · Beat the leaderboard 🏆

0SCORE

♦♦♦INTEGRITY

1WAVE

BRONZETIER

0BEST

How to play

← → / A D — move

Space / Enter — fire

NULL/CORRUPT — 10–15pts

SCHEMA DRIFT — 20pts

DUPLICATE — 25pts

Score × Wave multiplier

▶ BRONZE 0–299

▶ SILVER 300–699

▶ GOLD 700–1199

▶ PLATINUM 1200+

Leaderboard

// 07 · Contact

AVAILABLE FOR NEW OPPORTUNITIES

Let's build
something extraordinary.

Data engineering roles, platform architecture, distributed systems. Kathmandu — available globally.

404.nepal.prashant@gmail.com

Download Resume

Full CV · opens in browser · Ctrl+P to save as PDF

PRASHANTNEPAL

The lakehouse in motion.

Stack, ranked by depth.

Where I've shipped.

Numbers I own.

Type help to explore.

Protect your lakehouse.

Download Resume

Type `help` to explore.