Loading
Loading
Welcome to my portfolio
Specializing in AI Voice Agent platforms, real-time telephony integrations (Pipecat, Daily.co, Twilio), LLM orchestration, and Graph RAG pipelines
Featured Work
AI Voice Agents, distributed microservices, event-driven architectures, and real-time streaming applications
Express, FastAPI, Whisper.cpp, Neo4j, Graph RAG, OpenAI
A voice-enabled conversational agent for navigating product user manuals. Features a speech-to-text pipeline using Whisper.cpp integrated with Node.js via WebSocket, a RAG system using Neo4j as a knowledge graph, and a FastAPI backend integrating OpenAI LLMs for contextual response generation.
Next.js, TypeScript, Socket.IO, Google Gemini AI, Node.js, Express
An intelligent form assistant that guides users through data entry via natural conversation, automatically extracting and populating form fields in real-time. Built with WebSocket-based communication between a Next.js frontend and Node.js backend, integrating Google's Gemini AI for conversational form completion.
Go, WebSockets, Redis Pub/Sub, JWT, PostgreSQL
Cross-platform synchronization backend supporting 1,000+ concurrent users across macOS and Android. Features scalable messaging architecture using Redis Pub/Sub with JWT-based authentication, WebSockets for real-time communication, and PostgreSQL for data persistence.
Go, Docker, Nginx, Redis, Docker Compose
A distributed API gateway with a Go-based rate-limiting microservice, leveraging Redis and a custom Token Bucket algorithm to manage and protect API endpoints. Engineered a scalable microservices architecture using Docker and Nginx for load-balanced request distribution.
Python, Redis, Decorators
A Python decorator providing Redis-backed distributed caching with automatic invalidation. Detects code modifications via source code hashing and automatically purges stale cache entries across all application instances, eliminating manual cache management.
Insights & Knowledge
Deep dives into distributed systems, database internals, performance optimization, and scalability patterns
This article explores how Node.js streams work under the hood, tracing the complete journey of HTTP request data from network packets to your application code, and response data from your code back to the network. Covers internal buffering, backpressure mechanisms, and the multi-layered architecture that makes streams efficient.
A comprehensive guide to PostgreSQL's Multi-Version Concurrency Control (MVCC) system. Explores transaction IDs, system columns (xmin/xmax), and infomask flags through hands-on examples using pageinspect. Demonstrates how PostgreSQL maintains data consistency across concurrent transactions and explains the role of vacuum in cleaning up obsolete row versions.
This article discusses building a distributed rate limiter using Go, Redis, and Lua to control the flow of requests, prevent abuse, ensure fair use, and protect backend resources from overload.
An overview of fault-tolerant systems, which are designed to continue operating without interruption despite the failure of one or more of its components
An article that explains how traditional databases that use B-Trees data structure and indexes can limit the shelf life of Solid States Drives (SSD).
An explanation of native graph databases, which are optimized for storing and processing graph data. They use index-free adjacency, meaning that each node directly references its adjacent nodes.
A look into the Node.js event loop, a mechanism that allows Node.js to perform non-blocking I/O operations despite being single-threaded, by offloading operations to the system kernel whenever possible.
An article detailing SSTables (Sorted String Tables) and LSM-Trees (Log-Structured Merge-Trees), which are used in modern databases to optimize for high-speed read/write operations and handle massive data sets.
Deep dive into PostgreSQL covering indexes, shared buffers, and memory optimization. Learn why column order matters, how to size shared buffers properly, tackle large dataset challenges through partitioning, and understand MVCC's impact on query performance.
Get to Know Me

Full-Stack Software Engineer specializing in scalable GenAI applications, AI Voice Agent platforms, and production backend systems.
Building real-time conversational AI pipelines with Pipecat, Daily.co, and WebRTC telephony. Expert in LangChain/LangGraph orchestration, Graph RAG systems, and prompt engineering for voice agents achieving sub-500ms latencies.
Master of Science in Computer Science
California State University, Fullerton • 2022 – 2024
Technical Skills
Certifications
1K+
Daily Users Served
35%
API Response Gains
70+
Students Taught