Ethan Boneh

Ethan Boneh

EE & CS @ Stanford

About

Hi! I'm Ethan, an Electrical Engineering and Computer Science student at Stanford. I'm a research assistant at the Scaling Intelligence Lab (SAIL) and previously interned on Apple's Platform Architecture team. I'm interested in AI systems, GPU kernel optimization, and hardware-software co-design. My recent work spans evolutionary search methods, domain-specific languages for GPU programming, and autonomous semiconductor fabrication. Outside of work, I like old movies, magical realism, tennis, and playing surf rock.

Publications

DSL-Monkeys: Self-Generated In-Context Examples for Low-Resource GPU DSL Kernels

Nathan Paek, Simon Guo, Vishnu Sarukkai, Willy Chan, William Hu, Ethan Boneh, Simran Arora, Ludwig Schmidt, Kayvon Fatahalian, Azalia Mirhoseini

ICLR 2026 Workshop on Test-Time Updates

EvoX: Meta-Evolution for Automated Discovery

Shu Liu, Shubham Agarwal, Monishwaran Maheswaran, Mert Cemri, Zhifei Li, Qiuyang Mang, Ashwin Naren, Ethan Boneh, Audrey Cheng, Melissa Z. Pan, Alexander Du, Kurt Keutzer, Alexandros G. Dimakis, Koushik Sen, Matei Zaharia, Ion Stoica

arXiv Preprint

Multi-VT in Oxide-Semiconductor Transistors Leveraging Sub-1-nm Dipoles for Low-Refresh Energy Gain Cell Memory

Fabia Farlin Athena, Jimin Kang, Matthias Passlack, Nathaniel Safron, Didem Dede, Koustav Jana, Balreen Saini, Xinxin Wang, Shuhan Liu, Jonathan Hartanto, Ethan Boneh, Hugo J.-Y. Chen, Chi-Hsin Huang, Qing Lin, Donglai Zhong, Kaitlyn Leitherer, Paul C. McIntyre, Gregory Pitner, Iuliana P. Radu, H.-S. Philip Wong

IEEE Transactions on Electron Devices, 2025

Optimization is Key to High-Temperature Reliability in Oxide-Semiconductor FETs

Jack C. Evans, Fabia F. Athena, Koustav Jana, Balreen Saini, Shuhan Liu, Ethan Boneh, Paul C. McIntyre, H.-S. Philip Wong

Device Research Conference (DRC), 2025

Projects

Desktop Autonomous Chip Fabrication

TreeHacks 2026PyTorchClaude

AI-driven closed-loop system that autonomously designs, executes, evaluates, and improves semiconductor fabrication experiments. Uses evolutionary search and computer vision to optimize lithography parameters without human intervention.

KernelBench-Tinker

RLCUDAModal

End-to-end RL pipeline for GPU kernel optimization. Language models generate CUDA kernels, which are evaluated on cloud GPUs via Modal, with results converted into training rewards for distributed LoRA fine-tuning.

HelionEvolve

HelionGPUEvolutionary Search

Evolutionary optimization for GPU kernels using the Helion DSL. Applies automated search to discover high-performance kernel implementations.

MangoKart

CRISC-V3D Graphics

3D graphics engine and driving game on a RISC-V MangoPi board. Built a full rendering pipeline with LookAt projection, occlusion culling, and flat shading, controlled by a real steering wheel via gyroscope.

ViReal

UnityCUDANeRF

VR social platform that converts ordinary photos and videos into interactive 3D environments using neural radiance fields and real-time mesh generation. No expensive equipment needed.