Amar Prakash Pandey - ᕦ(ò_óˇ)ᕤ
Home
About
Projects
Blog
2025
Apr, 07
From Bottlenecks to Balance: Dynamic Skew Join Fixes in Spark
Mar, 30
4TB RAM, Yet an OOM Error? Debugging a Spark Memory Mystery
Mar, 23
Deep Dive into Spark Jobs and Stages
2024
Nov, 05
Balancing the RUM Conjecture: Navigating Database Trade-Offs
Oct, 15
The CAP Theorem: Balancing the Big Three in Distributed Databases
May, 24
Fine-Tuning Shuffle Partitions in Apache Spark for Maximum Efficiency
May, 22
Handling Large Broadcast Joins in Apache Spark
2022
Jul, 19
Symptoms of Bad Code
Jan, 23
Docker - the right way
Jan, 04
GitOps - the easy way
2018
Jul, 28
Finger Detection and Tracking using OpenCV and Python
2017
Jul, 02
What is Google Summer of Code? How to prepare for it?
Tags
#spark
#big data
#optimization
#data engineering
#performance
#spark-joins
#Apache-Spark
#Spark
#Big-Data
#Data-Processing