Technical Handbook
In my tech journey, I’ve held diverse roles in both Data (Scientist, Analyst, Engineer) and Software Engineering (Fullstack, Backend, Data Platforms, Machine Learning). Such a varied background has ingrained in me the value of perpetual learning. From my undergraduate years to the present, I’ve ceaselessly honed my programming skills, research and analytical skills, tools, and frameworks, striving to excel in every endeavor. While the path isn’t always straightforward, my unwavering dedication to continuous learning and illustrating that transparently deeply resonates with my peers, aspiring engineers, and scientists.
I established this page to champion the spirit of lifelong learning, sharing insights I’ve gleaned from exploring diverse technologies. I hope these notes will aid your own learning journey.
Current Work & Personal Tech Stack: AWS, Docker, Kubernetes, Apache Pulsar, PySpark, Airflow, Flink, AWS Kinesis, Elastic Search, PrestoSQL / Trino, Gitlab CI/CD, Terraform, Python (Pandas, Numpy, Scikit-Learn, MLflow, Pytorch, Tensorflow, FastAPI, Flask, etc.)
Programming Languages
- ⭐ Python
- 🗄️ SQL - Structured Query Language
- 💡 JavaScript
- 📊 R
- 📝 YAML
- ⚡️ Spark
- 🔧 Scala
Computer Science Fundamentals
- 🚀 Software Engineering Concepts
- 📚 Data Structures
- 🔍 Algorithms
- 🧪 Object Oriented Programming
- 🎯 Patterns for Coding Interviews
- 🌟 Interview CheatSheet (Templates)
Systems Design & Infrastructure Concepts
- 🏛️ Fundemental Background (Computer and App Architecture)
- 🌍 Networking Topics
- 🤝 APIs - Application Programming Interface
- 🔄 Caching Basics & CDNs
- ⚖️ Load Balancers, Proxies & Hashing
- 🗄️ Storage (Databases, Replication & Sharding)
- 🗃️ Big Data - Mapreduce & Message Queues
Full Stack Software Engineering
- 🐳 Docker
- ☸️ Kubernetes
- 📮 Postman & Protobuf
- 🐧 Linux/Unix, and Terminal
- 🐙 Git and Github
- 🚀 CI/CD Fundamentals
- 🖥️ Microservices, VMs & Cloud Migration
Data Science Concepts & Tools
- 🔢 Linear Algebra Foundations
- ➗ Single and Multivariate Calculus Fundementals
- 🎲 Probability Theory - Random Variables and Distributions
- 🎯 Inference - Hypotheis Testing & Confidence Intervals
- 📈 Time Series Analysis
- 🧪 A/B Testing
- 📊 Tableau
- 📚 PrestoSQL / Trino
- 🔧 Alteryx Software
Big Data Engineering Tools
- 🛠️ Big Data - Comprehensive Tech Stack
- 🐘 Hadoop, Hive and HDFS
- 🚀 Apache Spark
- 🌀 Apache Pulsar
- 🏔️ Apache Iceberg
- 🌬️ Apache Airflow
- 🔥 Apache Flink
- 📨 Apache Kafka
- 🌊 AWS Kinesis
- 🔍 Elastic Search