Kush kush0o7

Kushagra Tandon

CS @ Arizona State University · Graduating May 2026
I build AI agents, break LLMs for a living, and occasionally predict electricity prices

Featured Projects

🔴 Redline — LLM Safety Evaluation Platform

Adversarial evaluation harness for LLM agents with automated safety grading

Proposer → Critic → Revision debate pipeline to expose model failure modes
Automated metrics: hallucination rate, refusal accuracy, policy compliance
Async evaluation via Redis/RQ with full trace storage in Postgres
Prompt-injection detection and controlled execution environments
Tech: Python · FastAPI · PostgreSQL · Redis · RQ · Docker

🛡️ Constitutional Safety Agent

Full-stack AI safety application enforcing configurable rules on LLM outputs

Classifier-based pre/post generation gating with configurable harm thresholds
Model-independent rule engine preventing jailbreak overrides
Red-team evaluation pipeline with structured trace logging
Tech: Python · FastAPI · React · Scikit-learn · Docker

🤖 VoiceAI Medical

Production-grade AI system for voice-driven patient scheduling

Real-time chat and voice workflows with hallucination-safe backend validation
Deployed on AWS EC2 with Nginx, Redis session persistence
Tech: Node.js · TypeScript · React · Vapi.ai · Groq · Redis · AWS

⚡ DE/LU Power Price Forecasting

Quantile forecasting model for EU electricity markets

LightGBM q10/q50/q90 model — 60% reduction in MAE vs baseline
Integrated Anthropic API with JSON audit logs and hard fallback paths
Tech: Python · LightGBM · Anthropic API · Pandas

🧠 QuantAI — Intent-Level Market Model

Multi-tenant backend for organizational intent inference

Semantic drift detection using embeddings to surface early signals
Tenant-scoped REST APIs with pgvector similarity and timeline queries
Tech: FastAPI · PostgreSQL · pgvector · SQLAlchemy · Docker

🌍 WorldweLivein

Macro scenario engine with probabilistic modeling

Monte Carlo simulations across geopolitical and economic scenarios
Integrated FRED and GDELT data sources with 1.03% MAE validation
Tech: React · TypeScript · Python · Monte Carlo

🐝 Hive — Outcome-Driven Agent Framework

Agent orchestration framework focused on measurable outcomes

Tech: Python

⚽ EPL Betting Model

Calibrated sports outcome predictor

XGBoost classifier · ROC-AUC 0.79 on 570 out-of-sample matches
Feature engineering with odds-derived metrics and ELO ratings
Tech: Python · XGBoost · Scikit-learn · Streamlit

📈 Momentum Strategy Lab

Systematic trading strategy with walk-forward validation

SMA crossover with ATR-based stops and portfolio simulation
Backtested across stocks, ETFs, and crypto
Tech: Python · Backtrader · yfinance

Skills

Languages: Python · C++ · TypeScript · Java · SQL
ML/AI: LightGBM · XGBoost · Scikit-learn · TensorFlow · Anthropic SDK · pgvector
LLM Systems: Adversarial evaluation · Red-team pipelines · Safety gating · Prompt-injection mitigation
Backend: FastAPI · Flask · Django · Redis · PostgreSQL · Docker · REST APIs
Frontend: React · TypeScript
Tools: Git · Docker · CI/CD · SQLAlchemy · Backtrader · AWS

Experience

Teleoperation Associate · Objectways Technologies LLC (2026 – Present)
Collecting and recording robot demonstration data using Trossen systems for ML training pipelines

Research Aide · ASU School of Earth & Space Exploration (Oct 2023 – May 2025)
Built data pipelines and virtual field modules for NSF-funded WORM Portal

Software Dev Intern · Hindustan Aeronautics Limited (Jun–Aug 2023)
Built internal dashboards for real-time aerospace telemetry monitoring

Open to roles in LLM evaluation, AI safety, ML engineering, quant systems, and forward deployed engineering

Provide feedback

Saved searches

Use saved searches to filter your results more quickly