rishabh@web:~$ cat README.md

---

name: Rishabh Pandey

role: Production Engineer @ Meta

education: Purdue CS, Dec 2024

status: open to backend & applied AI roles

---

Rishabh Pandey

I'm a production engineer at Meta, working on the reliability of large distributed systems — observability, capacity, and incident response on a database tier powering an ads platform. Before that: event pipelines and cloud infrastructure at Geico, and internships at Geico and Texas Instruments.

These days the problems that pull me in sit at the intersection of backend systems and applied AI — I've shipped LLM pipelines at Meta and agentic projects on the side, and I'm looking to go deeper on both.

## start here

→experience/meta.mdwhat I do now →skills.yamllanguages & tooling →projects/things I've built →contact.jsonget in touch

rishabh@web:~$ cat skills.yaml

languages:

python

go

typescript

node

java

c/c++ sql

hack

infra:

docker

kubernetes

helm

terraform

github-actions

unix/linux

cloud:

aws — developer associate azure

gcp

ai_mlops:

fastapi

langchain

scikit-learn vector search eval / benchmarking

rishabh@web:~$ cat experience/meta.md

Meta — Production Engineer

Sep 2025 — present

Reliability of a distributed database tier powering an ads platform.

Owned the investigation that root-caused recurring database-throttling SEVs (~5–6 per half, ~30–40 unactioned alerts) to cross-team callers reusing shared functions and exhausting the owning team's QPS budget.

Instrumented per-callsite QPS observability via 70%/90% threshold detectors and method-signature attribution; designed and drove a callsite-forking remediation that segments ownership and isolates traffic per caller.

Built an applied ML pipeline on Llama 3.3 — structured summarization, embeddings, cosine-similarity clustering — grouping 8,000+ SEV follow-up tasks into actionable clusters for batch resolution.

Executed a subdomain migration to a dedicated tenant for a platform serving thousands of DAU; cut p95 latency 57% and shrank the fault-isolation domain.

→ next: experience/geico.md

rishabh@web:~$ cat experience/geico.md

Geico — Software Engineer

Jan — Sep 2025

Serverless event pipelines and contact-center observability on AWS.

Authored Python Lambda services that normalize contact events, batch-write to DynamoDB, and fan out via EventBridge with DLQs and exponential backoff — 500–700K events/day at ~90% test coverage on least-privilege Terraform modules.

Rolled out Amazon Connect queue observability across 800+ queues with SNS + Slack alerting; cut batch metrics latency 12× (120 s → 10 s) via request coalescing and concurrent SDK fan-out.

Software Engineering Intern

Summer 2024

Deployed JupyterHub on Azure Kubernetes Service with Azure AD SSO/RBAC and Spark/ADLS connectivity; the lake-first analytics workflow contributed to a ~25% reduction in team Snowflake spend.

→ next: experience/texas-instruments.md

rishabh@web:~$ cat experience/texas-instruments.md

Texas Instruments — SWE Intern

Summer 2023

Internal tooling for licensing operations.

Engineered an Oracle APEX + PL/SQL internal tool that replaced a 4–6 hour manual lookup with a ~5-second self-serve flow, used daily by licensing ops across 250+ products.

→ next: projects/paste-service.md

rishabh@web:~$ cat projects/paste-service.md

Local Paste Service

repo ↗

go

sqlite

launchd

A zero-dependency pastebin that lives entirely on your machine — a single 20 MB Go binary backed by SQLite in WAL mode for concurrent local storage. CLI-first workflow, automatic TTL expiration via a background garbage collector.

→ next: projects/interview-agent.md

rishabh@web:~$ cat projects/interview-agent.md

LinkedIn Interview Agent

repo ↗

fastapi

react gpt-4o

faiss

An agentic RAG interview platform: ingests a job description and resume, maps skills semantically with FAISS, and generates adaptive questions conditioned on the candidate's gaps. Instrumented end to end — p50/p95 latency and $/session tracking, with question generation at ~2 s p95.

→ next: contact.json

rishabh@web:~$ git log --oneline -- career

a3f9c21 (HEAD -> main) feat(meta): root-cause recurring db-throttling sevs to cross-team qps exhaustion

e8d04b7 feat(meta): per-callsite qps detectors + callsite-forking remediation

5c19f3e feat(meta): llama-3.3 pipeline clusters 8,000+ sev follow-ups for batch resolution

9b72a4d perf(meta): tenant migration cuts p95 latency 57%

f4c8e21 feat(geico): serverless event pipeline, 500–700k events/day at ~90% coverage

d2c7b90 perf(geico): batch metrics latency 120s → 10s (12×)

7a3d5e8 feat(geico): jupyterhub on aks with azure ad sso/rbac

1e9f6c2 feat(ti): 4–6h licensing lookup → ~5s self-serve flow

8c4b7d1 docs(purdue): b.s. computer science, dec 2024

0000000 init: hello, world

Full detail lives in the individual files — experience/ and projects/ in the tree.

rishabh@web:~$ cat contact.json

{

"email": "",

"github": "github.com/rishabhxpandey",

"linkedin": "linkedin.com/in/pandey-rishabh",

"response_time": "usually < 24h"

}

Email is fastest. Happy to talk about production engineering, reliability, infra tooling — or anything you've found on this site.