Portfolio
Featured Projects
Production-grade AI systems that demonstrate the intersection of cutting-edge ML research and practical engineering.
NeverAFK.ai
RAG-Powered Creator Support Platform
Production SaaS platform with LangGraph multi-agent RAG pipeline using GPT-4 for automated student support from indexed course content.
- ▸Multi-agent RAG pipeline with semantic chunking
- ▸Hybrid search (BM25 + vector) with reranking
- ▸Serving 1000+ users in production
- ▸Lemon Squeezy billing integration
Next.js 15FastAPILangGraphGPT-4PineconeSupabaseOpenAI Whisper
01
LLM Council
Multi-Agent AI Consensus System
3-stage Agentic AI system where GPT-4, Claude, and Gemini generate divergent answers, perform peer review, then synthesize consensus with citations.
- ▸Multi-model orchestration with peer review
- ▸Healthcare NER with ICD-10 codes (61% F1)
- ▸Finance classification (100% accuracy)
- ▸80%+ Redis cache hit rate
Next.js 15TypeScriptPostgreSQLDrizzle ORMRedisGPT-4ClaudeGemini
02
AI Agent Evaluation Framework
Anthropic/Toloka Partnership
Comprehensive evaluation framework for Claude AI agents across virtual environments with 15-20 integrated tools including Slack, Jira, and GitHub.
- ▸Multi-step agentic workflow validation
- ▸97%+ accuracy gates in CI/CD
- ▸MCP server integration
- ▸Automated grading system
PythonPytestDockerMCPGitHub ActionsClaude
03
MetalQuery
Enterprise NLP-to-SQL System
Multimodal RAG chatbot for manufacturing KPI analysis with natural language to SQL conversion across 29 database tables.
- ▸90-100% query accuracy
- ▸12-layer security architecture
- ▸Jailbreak & prompt injection prevention
- ▸Role-based access control
FastAPIDjangoReactPostgreSQLGroq LlamaChromaDB
04