Software Engineer

|
Software Engineer at Oracle building production Java services on OCI. I design AI agent systems, backend APIs, and scalable full-stack applications across enterprise and startup environments.

πŸ“‹ The Short Version:

  • Currently: Software Engineer at Oracle building Java REST APIs on OCI for enterprise SaaS
  • Previously: Anguliyam (multi-agent systems, RAG), Adobe (React/TS), EPAM (data engineering)
  • Core Focus: Multi-agent orchestration, enterprise retrieval systems, and production backend reliability
  • Impact: 60%+ SQL response improvement, 70% lower manual processing, and 0%β†’91% test coverage growth
  • Stack: Python, Java, TypeScript Β· AWS, GCP, OCI Β· FastAPI, Spring Boot, Next.js
Lahari Karrotu

What I Bring to the Table

Numbers from real work β€” not hypotheticals

πŸ“¦
4
Core Repos Highlighted
HealthScan, SmartBuy, Resume Tailor, Blinds
πŸ€–
3+
Agentic AI Projects
Orchestration, RAG, automation
🏒
4
Companies Shipped At
Oracle, Anguliyam, Adobe, EPAM
πŸš€
60%+
DB Improvement
Oracle DB 23ai response-time cut
βœ…
70%
Manual Work Reduced
Agent workflows at Anguliyam
⚑
+40%
RAG Relevance Lift
Hybrid retrieval pipelines
🎨
91%
Coverage Increase
Adobe feature module tests
πŸ‘₯
0
Rollbacks
Oracle sprint delivery track record

What I Build

A quick look at what keeps me busy

AI Healthcare Apps β€” prescription scanning, drug interaction checking, medical document processing
Full-Stack Platforms β€” eCommerce with AI assistants, task management, fitness tracking
Computer Vision Systems β€” window detection, virtual try-on, 3D overlay rendering
Voice Assistants β€” speech-to-text, wake-word detection, accessibility-first design
Production Backend APIs β€” Spring Boot, FastAPI, Redis caching, OAuth2, HL7/FHIR
Cloud & DevOps β€” Oracle Cloud, Azure, AWS, Jenkins CI/CD, Docker, Liquibase

About Me

The person behind the commits

I'm Lahari β€” a software engineer based in San Francisco focused on AI agent systems, backend APIs, and scalable full-stack products. At Oracle, I build production Java services on OCI with reliability, performance, and compliance constraints in mind.
Outside of work, I ship project-heavy systems: agentic healthcare workflows, enterprise-style RAG pipelines, multimodal shopping assistants, and practical AI tooling for job applications. I care about systems that hold up in production, not just demos.
I hold a Master's in Computer Science from Florida Tech and an AWS Solutions Architect certification. I like building clean, maintainable software that solves real business problems under real-world constraints.

Where I've Worked

Production code at 4 companies β€” here are the highlights

Software Engineer

Oracle

May 2025 – Present

  • β–ΈBuilding Java REST APIs on OCI for enterprise SaaS clients in healthcare and finance with strict uptime/compliance requirements
  • β–ΈResolved a critical Oracle DB 23ai query bottleneck under concurrent load β€” execution plan analysis + index strategy drove 60%+ response-time improvement
  • β–ΈAutomated manual data-load workflow using Python on OCI, removing repeated weekly engineer intervention
  • β–ΈMaintaining Jenkins CI/CD quality gates with JUnit coverage and consistent zero-rollback sprint delivery

Software Engineer β€” AI & Full-Stack Systems

Anguliyam

Jun 2024 – Apr 2025

  • β–ΈDesigned and deployed multi-agent orchestration systems with LangGraph and CrewAI, reducing manual processing by 70%
  • β–ΈBuilt production RAG with hybrid retrieval (dense vector + keyword), improving answer relevance by 40% over baseline
  • β–ΈImplemented human-in-the-loop governance with approval gates and audit logging for sensitive enterprise workflows
  • β–ΈIntegrated external tools, CRMs, and voice systems to trigger downstream actions beyond text-only responses

Software Engineer Intern

Adobe

Jan 2022 – Aug 2022

  • β–ΈBuilt React and TypeScript UI components for Adobe Experience Cloud, focusing on accessible and consistent interfaces
  • β–ΈIntegrated Adobe Experience Platform APIs with graceful handling for partial data and API error states
  • β–ΈAdded Jest + React Testing Library coverage, catching a regression and increasing module coverage from 0% to 91%
  • β–ΈContributed to sprint design/code reviews and technical documentation in a 20+ engineer cross-functional setup

Software Engineer Intern

EPAM Systems

Dec 2020 – Mar 2021

  • β–ΈBuilt Python and Java ETL modules with tests and documentation for client-facing data engineering workflows
  • β–ΈWrote SQL extraction/transformation/reporting queries powering downstream analytics and client financial reporting
  • β–ΈDelivered sprint commitments in a global Agile team using JIRA, Git branching, and peer-reviewed code practices

GitHub Activity

Live commit activity and open source contributions

Commit Activity

Active
GitHub Commit Activity Graph

Real-time commit activity over the past year

Real-Time Workflow Orchestration

Live visualization of production AI pipelines and distributed system workflows

uptime
99.9%
latency
<300ms
throughput
1K+ req/min
cache Hit Rate
95%

LLM Processing

Queries/min
0

Vector Search

Embeddings
0

API Gateway

Requests/sec
0

Frontend

Active Users
0
Distributed System
Real-time Processing

Live Workflow: This visualization demonstrates real-time AI pipeline orchestrationβ€”from LLM processing and vector search to API gateway and frontend delivery. Each step shows actual metrics and processing states, reflecting production system behavior with fault tolerance, monitoring, and scalable architecture.

System Design

Architecture, scalability, and distributed systems expertise

SmartBuy AI: Production E-Commerce Platform with AI Integration

A scalable full-stack eCommerce platform with AI-powered navigation, real-time inventory management, and optimized performance.

Architecture Overview

  • β€’Frontend: Next.js (SSR/SSG) with React, deployed on Vercel Edge Network for global CDN distribution
  • β€’Backend: FastAPI microservices with async/await for scalable request handling
  • β€’Database: PostgreSQL with connection pooling, fact/dimension tables for analytics, Redis cache layer (95% hit rate)
  • β€’Data Pipelines: Python + SQL ETL workflows with Airflow orchestration, data quality checks, and analytics-ready schemas
  • β€’AI Service: OpenAI API with request batching, response caching, and fallback mechanisms
  • β€’Vector Search: Pinecone for semantic product search with 50K+ product embeddings

Scalability & Performance

  • β€’Load Handling: Scalable architecture with horizontal scaling, connection pooling, and caching strategies
  • β€’Response Times: API latency <300ms (p95), AI queries <2s, page loads <1.2s
  • β€’Caching Strategy: Multi-layer (CDN, Redis, in-memory) reducing DB load by 80%
  • β€’Database: Read replicas for scaling, connection pooling (max 100 connections), query optimization
  • β€’Monitoring: Real-time metrics (Prometheus), error tracking (Sentry), APM (New Relic)

Key Design Decisions & Trade-offs

1. Microservices vs Monolith

Chose FastAPI microservices for independent scaling of AI service vs. core eCommerce logic, accepting added complexity for better resource utilization.

2. Vector DB Selection

Pinecone over self-hosted (FAISS/Milvus) for managed scalability and lower ops overhead, trading cost for reliability.

3. Caching Strategy

Multi-layer caching (CDN β†’ Redis β†’ DB) prioritized read performance, accepting eventual consistency for product data.

4. Failure Handling

Circuit breakers for AI API, graceful degradation to keyword search, retry logic with exponential backoff for improved reliability during service outages.

Scaling Strategy

Horizontal Scaling

FastAPI instances behind load balancer, auto-scaling based on CPU/memory (2-10 instances), stateless design for easy scaling.

Database Scaling

Read replicas for query distribution, connection pooling, query optimization, and planned sharding for 100K+ products.

Cost Optimization

AI request batching, response caching (60% cache hit rate), CDN for static assets, reducing infrastructure costs by 40%.

Why Work With Me

Here's what I think sets me apart

πŸš€

I Ship Real Products

I build complete systems, not isolated demos: HealthScan (agentic healthcare assistant), SmartBuy v2 (multimodal shopping agent), Blinds & Boundaries (AI try-on), and AI Resume Tailor (application workflow automation).

πŸ”§

I Fix What's Broken

At Oracle: fixed API failures (87%β†’98%), optimized a 6-second query to 2 seconds. At Adobe: found a Redux race condition nobody else could trace. I debug production issues with patience and precision.

🧠

I Learn Fast & Go Deep

I work across Java, Python, and TypeScript based on system needs. That range lets me move from DB performance debugging and CI/CD to agent orchestration and production UI delivery without handoff bottlenecks.

Projects

Selected shipped projects with production-grade architecture

SmartBuy v2 β€” Multimodal AI Shopping Agent
Live
Full-Stack β€’ 2024

SmartBuy v2 β€” Multimodal AI Shopping Agent

πŸŽ™οΈInteraction Mode
Voice + Webcam + Text
🧡Runtime
Persistent WebSocket

A real multimodal shopping agent with persistent WebSocket connectivity to Gemini Multimodal Live API for simultaneous voice, webcam context, and two-way interaction.

ReactTypeScriptGemini Multimodal Live APIWebSockets+3
ScanX (HealthScan) β€” AI Healthcare Assistant
Completed
AI/ML β€’ 2025

ScanX (HealthScan) β€” AI Healthcare Assistant

πŸ—οΈArchitecture
3-Engine Agent System
πŸ’ŠMedication Safety
RxNav/RxNorm Checks

Agentic AI healthcare assistant with a 3-engine architecture for prescription extraction, action planning, and browser automation with interaction safety checks.

Next.js 15TypeScriptFastAPIReact Native+7
AI Resume Tailor β€” Full-Stack Job Application Platform
Completed
Full-Stack β€’ 2025

AI Resume Tailor β€” Full-Stack Job Application Platform

🧱Architecture
Next.js API Routes Only
πŸ€–Model
Anthropic Claude

A Vercel-only Next.js App Router platform that parses job descriptions with Claude AI, tailors resume DOCX output via XML manipulation, and logs applications to Google Sheets.

Next.js 14TypeScriptAnthropic Claude APIGoogle Sheets API+2
Blinds & Boundaries β€” AI Virtual Try-On Platform
Live
AI/ML β€’ 2024

Blinds & Boundaries β€” AI Virtual Try-On Platform

🎯Detection Accuracy
88%
πŸ“ΈImages Processed
500+

Virtual try-on application for window blinds using computer vision and 3D rendering. Enables customers to visualize products in their actual space with realistic lighting and perspective. Deployed on Azure with real-time image processing.

ReactTypeScriptFastAPIPython+5
Blinds Pro - Advanced Visualization Platform
Completed
AI/ML β€’ 2024

Blinds Pro - Advanced Visualization Platform

⚑Processing Speed
2.0s
πŸ“ŠBatch Processing
10x

Enhanced version of the virtual try-on application with additional features for professional interior design and commercial applications.

TypeScriptReactNext.jsThree.js+3
Railway Predictive Maintenance System
Production
AI/ML β€’ 2024

Railway Predictive Maintenance System

🎯Model Accuracy
90%+
⚑Processing Latency
<200ms

Real-time predictive maintenance system for railway equipment using machine learning. Processes sensor data streams with TensorFlow LSTM models to predict equipment failures, enabling proactive maintenance and reducing downtime.

PythonTensorFlowApache SparkAWS Lambda+4
Auto Loan AI Processing System
Live
Full-Stack β€’ 2024

Auto Loan AI Processing System

🎯OCR Accuracy
95%+
⚑Processing Time
30-60s

AI-powered loan processing system using AWS Textract OCR and voice assistants, reducing processing time by 40% and improving accuracy.

ReactTypeScriptPythonFastAPI+5
Fitness Transformation Application
Completed
Full-Stack β€’ 2024

Fitness Transformation Application

πŸ“ˆUser Retention
78%
🎯Voice Recognition Accuracy
94%

AI-driven fitness application with personalized recommendations, voice navigation, and intelligent workout planning using LLM APIs.

ReactTypeScriptNode.jsOpenAI API+3
AI Job Hunter - Intelligent Job Search Platform
Completed
Full-Stack β€’ 2024

AI Job Hunter - Intelligent Job Search Platform

🎯Match Accuracy
88%
πŸ“ŠJob Discovery Rate
+60%

AI-powered job search and application platform that uses machine learning to match candidates with relevant opportunities and optimize application strategies.

TypeScriptReactNext.jsPython+4
Taskify Pro - Smart Task Management
Completed
Full-Stack β€’ 2023

Taskify Pro - Smart Task Management

⚑Real-time Sync Latency
<50ms
πŸ‘₯Concurrent Users
1000+

Enterprise-grade task management application with real-time collaboration, AI-powered prioritization, and advanced notification systems.

JavaScriptNode.jsExpress.jsMongoDB+3
Personal Portfolio & Showcase
Completed
Full-Stack β€’ 2023

Personal Portfolio & Showcase

πŸ†Lighthouse Score
100
⚑Page Load Time
0.8s

Comprehensive personal portfolio website showcasing professional projects, skills, and achievements with modern design and interactive features.

TypeScriptReactNext.jsTailwind CSS+1

Let's Work Together

I'm always open to interesting conversations β€” whether it's about a role, a collaboration, or just a cool technical problem. Let's chat.

Get In Touch

I'd love to hear from you! Let's discuss your next project.

Or email me directly at laharikarrothu@gmail.com

Professional Profiles

What I'm Currently Into

  • β€’ Building AI agents that can actually do useful things (not just chatbots)
  • β€’ Distributed systems design β€” event-driven architecture, message queues
  • β€’ Healthcare tech β€” making HL7/FHIR less painful for everyone
  • β€’ Open source contributions β€” looking for interesting projects to contribute to

Case Study: SmartBuy AI

How I shipped an AI shopping assistant end-to-end

Overview

SmartBuy AI embeds a conversational assistant inside a full-stack eCommerce experience. It handles search, recommendations, and navigation with a lightweight vector index and FastAPI services.

Problem

Users struggled to find products quickly and dropped off after slow search or poor recommendations.

Solution

Conversational assistant + semantic search backed by FastAPI, embeddings, and cached intents for sub-3s responses.

Tech Stack

Next.js, TypeScript, FastAPI, PostgreSQL, OpenAI API, Tailwind, Vercel

My Role

Architecture, API design, AI integration, frontend UX, deployment.

Results

  • β€’ AI query responses in <3s with cached intents
  • β€’ Faster product discovery via semantic search and voice/text navigation
  • β€’ Production-ready deployment with monitoring and uptime focus

Architecture (high level)

Client (Next.js) β†’ API Gateway (FastAPI) β†’ LLM Orchestrator (OpenAI, LangChain)
Vector Store (embeddings) for semantic search + intent caching
PostgreSQL for products/orders Β· Background jobs for sync/index updates
Vercel + Azure/AWS services Β· Observability via logs/metrics
Next.js UIFastAPI GatewayLLM (OpenAI)LangChain/RAGVector StorePostgreSQL

Architecture: ScanX Agent Loop

Frontend (Next.js) captures screenshots/specs β†’ uploads to FastAPI
FastAPI routes to Orchestrator (LangChain + OpenAI) to parse layout and tasks
Vector store (PostgreSQL/pgvector) for similarity + retrieval
Action planner emits task cards β†’ returned to UI for human-in-the-loop confirmation
Next.js UIFastAPI IngestLLM OrchestratorParser / PlannerVector StoreTask Cards UI

Tech Stack

Tools I actually use β€” not just list on a resume

Languages
PythonJavaTypeScriptJavaScriptSQL
Frameworks
FastAPISpring BootReactNext.jsReact NativeNode.js
AI/ML
LangGraphCrewAILangChainAWS BedrockOpenAI GPT-4oGemini ProRAG
Cloud
OCIAWSGCPAzure App ServiceAzure Blob StorageVercel
Databases
Oracle DB 23aiPostgreSQLRedisMongoDBSupabase
Observability
PrometheusSentryPostmanPlaywright
Tools
DockerJenkinsGitHub ActionsGitJIRAAgile/Scrum

Technical Expertise

Technologies and tools I work with

🧩Full-Stack & Backend

βš›οΈReact
πŸ”„Redux
πŸ“˜TypeScript
🟒Node.js
⚑FastAPI
πŸƒSpring Boot
🐘PostgreSQL
πŸ”΄Oracle Database
🧠Redis
πŸ”—REST APIs

πŸ€–AI / ML & Agents

🧠LangGraph
🀝CrewAI
⛓️LangChain
πŸ“šRAG Pipelines
✨Gemini Pro / GPT-4o
πŸ“Tesseract OCR

πŸ“‘Data Engineering & Pipelines

πŸ—„οΈSQL (PostgreSQL, MySQL)
πŸ”΄Oracle Database
πŸ”„ETL Pipelines (Python/Pandas)
βœ…Data Validation
πŸ“‹Liquibase Migrations
πŸ”EXPLAIN PLAN / ANALYZE

☁️Cloud & Infrastructure

πŸ”΄Oracle Cloud (OCI)
☁️AWS (Lambda, EC2, S3, Bedrock)
🌐GCP
πŸ”·Azure (App Service, Blob Storage)
🐳Docker
πŸ”„Jenkins CI/CD
πŸ› οΈGitHub Actions
⚑Vercel
πŸ“‘Prometheus + Sentry

Education

Academic background in Computer Science

Master of Science in Computer Science

Florida Institute of Technology

Aug 2022 – May 2024

GPA: 3.6/4.0

Bachelor of Science in Computer Science

KL University

2022

GPA: 3.8/4.0

Certifications & Badges

Professional certifications and verified achievements

I Write Too

Thoughts on AI, distributed systems, and what I'm learning

How I Think About Engineering

Deep dives into the problems I've solved and the trade-offs I've made

Backend Engineering

Scaling FastAPI for High-Throughput AI Workloads

How I optimized FastAPI microservices for scalable request handling, implementing connection pooling, async/await patterns, and request batching for AI API calls.

Async ArchitectureConnection PoolingRequest BatchingPerformance Optimization
Metrics:Optimized request handling, improved latency, cost-effective scaling
AI/ML Systems

Vector Database Optimization for Semantic Search

Deep dive into optimizing Pinecone vector search for 50K+ product embeddings, achieving 99.5% retrieval accuracy with &lt;500ms query time through embedding compression and indexing strategies.

Vector DatabasesEmbedding OptimizationSemantic SearchRAG Systems
Metrics:50K+ embeddings, 99.5% accuracy, &lt;500ms query time
Distributed Systems

Building Fault-Tolerant LLM Pipelines

Designing resilient AI workflows with circuit breakers, graceful degradation, retry logic, and fallback mechanisms for improved reliability during AI service outages.

Fault ToleranceCircuit BreakersGraceful DegradationError Handling
Metrics:Improved reliability, minimal monitoring overhead, robust error handling
Performance Engineering

Multi-Layer Caching Strategy for E-Commerce

Implementing CDN β†’ Redis β†’ Database caching layers, achieving 95% cache hit rate and reducing database load by 80% while maintaining data consistency.

Caching StrategiesCDN OptimizationRedisCache Invalidation
Metrics:95% cache hit rate, 80% DB load reduction, &lt;1.2s page loads
Observability

Real-Time Monitoring for Production AI Systems

Building comprehensive monitoring with Prometheus, Sentry, and custom metrics for LLM pipelines, enabling real-time alerting and performance tracking with &lt;100ms overhead.

MonitoringObservabilityAlertingPerformance Tracking
Metrics:Minimal monitoring overhead, real-time alerting, improved system reliability
Data Engineering

ETL Pipeline Design for Analytics at Scale

Architecting Python + SQL ETL pipelines processing 1,000+ structured records/day with data quality checks, fact/dimension table design, and Airflow orchestration, improving query performance by 35% and reducing data issues by 40%.

ETL PipelinesData ModelingSQL OptimizationData QualityAirflow
Metrics:1,000+ records/day, 35% query improvement, 40% fewer data issues