2026 flagship builds
Projects
The portfolio is now anchored around AI-heavy, backend-heavy projects that better reflect the direction I am taking professionally: model building, agent orchestration, secure autonomy, and workflow engineering.
2026 · Decoder-only language model
Phoenix 125M
A LLaMA-style 125M parameter model trained from scratch on a single RTX 3080 Ti with a custom tokenizer, data pipeline, and training loop.
The project is an end-to-end exercise in model building: corpus curation, tokenization, training stability, benchmarking, and open-source packaging.
✓ ~2B tokens processed, Apache 2.0 release, WinoGrande 0.507.
2026 · Multilingual language models
Sweta-Hi and Sweta-Kn
Hindi and Kannada pretraining efforts built on a LLaMA-style architecture with custom tokenizers and an end-to-end multilingual data pipeline.
This work is focused on underrepresented language coverage, practical training throughput, and evaluation quality ahead of release.
✓ Custom tokenizers, async data loading, released on HuggingFace.
2026 · Fine-tuning · Text-to-SQL
SQLForge: Mistral 7B QLoRA
A 4-bit QLoRA fine-tune that turns Mistral 7B v0.3 into a reliable text-to-SQL model. The same 12 GB GPU used for Phoenix 125M, with a 3.75 GB VRAM headroom budgeted up front, and a schema-aware evaluation rebuild after the first WikiSQL run showed the metric was lying.
The project is a focused engineering exercise in capability lift on consumer hardware: model selection by VRAM math, LoRA rank tuning, instruction-template-correct loss masking, and an evaluation harness that does true execution-accuracy comparison against table rows rather than string match.
✓ +77.8 percentage point exact-match lift, 97.4 percent valid SQL, ~8.25 GB peak VRAM on a 12 GB card.
2026 · Agentic content pipeline
LinkedIn Post Swarm
A multi-agent publishing workflow that uses Claude, Ollama, Playwright, and Telegram for draft generation, review, approval, and scheduled publishing.
The system includes critic-revision loops, source aggregation, state management, retries, and escalation paths so autonomy stays controllable.
✓ Human-in-the-loop approvals, resilient retries, scheduled output.
2026 · Autonomous AI security orchestrator
Rudra
A multi-agent offensive security architecture built around strict scope guardrails, sandboxed execution, and auditable event-driven workflows.
The emphasis is on safe autonomy: typed validation, retry budgets, isolation boundaries, and guardrails that make the system usable for serious testing.
✓ Recon + Analyst agents complete. Scope validation, sandbox, and audit trail fully designed.
2026 · AI-powered lead generation workflow
LocalLeads
An end-to-end backend system for business discovery, AI content generation, site assembly, deployment, and personalized outreach.
Operational controls include SQLite state tracking, Telegram approvals, deployment automation, and delivery flows aimed at production-style reliability.
✓ ~25 businesses contacted, live deployment automation, Telegram approval gates.
2026 · Autonomous trading intelligence system
ATIS
A 6-tier autonomous system that ingests research papers and filings, builds a causal knowledge graph, backtests theses with walk-forward and Monte Carlo validation, and generates daily ranked swing trade signals on 600 NSE/BSE stocks.
The system is built on three principles: every signal traces to a validated thesis in the knowledge graph, every LLM reasoning step is verified against Neo4j facts, and the architecture self-improves through Elo-based thesis lifecycle management and agent decision auditing.
✓ 59 agents built, Rust hot path implemented, 85/100 system effectiveness score on free data alone.
Earlier work
Before the current generation of work, I used smaller ML and web projects to build the habits that still matter now: experimentation, debugging, and shipping complete systems.
2023
Semantic Search Engine
An earlier information retrieval build that combined semantic search ideas with enterprise documentation use cases and set up later work in retrieval-heavy AI systems.
2022
Super Resolution
An image enhancement project built to understand GAN-based vision pipelines and experiment rigor in visual ML work.
2022
Photo to Monet-style art
A CycleGAN style-transfer exploration that taught me a lot about training instability, qualitative evaluation, and visual debugging.
2022
Library Management
A MERN-stack build that sharpened my full-stack fundamentals around CRUD, search, and practical product structure.
Open-source and public work
Explore models, code, and experiments
All flagship projects are documented on GitHub. Model weights and cards are published on Hugging Face.