2026 flagship builds

Projects

The portfolio is now anchored around AI-heavy, backend-heavy projects that better reflect the direction I am taking professionally: model building, agent orchestration, secure autonomy, and workflow engineering.

2026 Flagship builds

2026 · Decoder-only language model

Phoenix 125M

A LLaMA-style 125M parameter model trained from scratch on a single RTX 3080 Ti with a custom tokenizer, data pipeline, and training loop.

The project is an end-to-end exercise in model building: corpus curation, tokenization, training stability, benchmarking, and open-source packaging.

✓ ~2B tokens processed, Apache 2.0 release, WinoGrande 0.507.

PyTorch Transformers Tokenization Benchmarking Distributed training

2026 · Multilingual language models

Sweta-Hi and Sweta-Kn

Hindi and Kannada pretraining efforts built on a LLaMA-style architecture with custom tokenizers and an end-to-end multilingual data pipeline.

This work is focused on underrepresented language coverage, practical training throughput, and evaluation quality ahead of release.

✓ Custom tokenizers, async data loading, distributed training, near release.

Multilingual NLP Data engineering Custom tokenizers Model evaluation

2026 · Agentic content pipeline

LinkedIn Post Swarm

A multi-agent publishing workflow that uses Claude, Ollama, Playwright, and Telegram for draft generation, review, approval, and scheduled publishing.

The system includes critic-revision loops, source aggregation, state management, retries, and escalation paths so autonomy stays controllable.

✓ Human-in-the-loop approvals, resilient retries, scheduled output.

Agent orchestration Prompt engineering Playwright Telegram Bot API Workflow reliability

2026 · Autonomous AI security orchestrator

Rudra

A multi-agent offensive security architecture built around strict scope guardrails, sandboxed execution, and auditable event-driven workflows.

The emphasis is on safe autonomy: typed validation, retry budgets, isolation boundaries, and guardrails that make the system usable for serious testing.

✓ Scope validation, sandbox strategy, structured audit trail design.

AI security Sandbox design Distributed systems Event-driven architecture API integration

2026 · AI-powered lead generation workflow

LocalLeads

An end-to-end backend system for business discovery, AI content generation, site assembly, deployment, and personalized outreach.

Operational controls include SQLite state tracking, Telegram approvals, deployment automation, and delivery flows aimed at production-style reliability.

✓ Business discovery, approval gates, deployment automation, outreach delivery.

Backend development Playwright SQLite Workflow engineering Operational automation

Earlier work

Before the current generation of work, I used smaller ML and web projects to build the habits that still matter now: experimentation, debugging, and shipping complete systems.

2023

Semantic Search Engine

An earlier information retrieval build that combined semantic search ideas with enterprise documentation use cases and set up later work in retrieval-heavy AI systems.

2022

Super Resolution

An image enhancement project built to understand GAN-based vision pipelines and experiment rigor in visual ML work.

2022

Photo to Monet-style art

A CycleGAN style-transfer exploration that taught me a lot about training instability, qualitative evaluation, and visual debugging.

2022

Library Management

A MERN-stack build that sharpened my full-stack fundamentals around CRUD, search, and practical product structure.

Open-source and public work

Explore models, code, and experiments

All flagship projects are documented on GitHub. Model weights and cards are published on Hugging Face.