About

Hello, I'm Gabe Mukobi! I am a Stanford M.S. student in Computer Science building a career in AI safety, alignment, and governance research. Additionally, as an AI safety field-builder, I lead the Stanford AI Alignment student group and research community.

Currently, my research focuses on technical AI governance, or AI/ML research that improves our capacity to govern and regulate advanced AI systems but isn't strictly alignment or capabilities. Particularly, I've been working on evaluating, securing, and preventing the misuse of language model systems. Read my list of current research projects here and my agenda of research questions here.

My other interests include animal welfare, games, music, film, 3D art, photography, virtual reality, fantasy, and tea!

Gabriel Mukobi

Highlighted Publications:

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

Juan-Pablo Rivera*, Gabriel Mukobi*, Ann-Katrin Reuel*, Max Lamparth, Chandler Smith, Jacquelyn Schneider

Accepted to the MASEC NeurIPS 2023 workshop.

Welfare Diplomacy: Benchmarking Language Model Cooperation

Gabriel Mukobi*, Hannah Erlebach, Niklas Lauffer, Lewis Hammond, Alan Chan, Jesse Clifton

In review at ICML 2024, accepted to the SoLaR NeurIPS 2023 workshop.

SuperHF: Supervised Iterative Learning from Human Feedback

Gabriel Mukobi*, Peter Chatain*, Su Fong*, Robert Windesheim, Gitta Kutyniok, Kush Bhatia, Silas Alberti

Accepted to the SoLaR NeurIPS 2023 workshop.

In-Progress Publications:

Towards Un-Adaptable Language Models

Gabriel Mukobi, anonymous co-authors.

Towards Societal AI Resilience

Gabriel Mukobi, anonymous co-authors.

Language Model Weight Watermarking for Controlled Release Through Canary Trap Detection

Gabriel Mukobi, anonymous co-authors.

TBA Machine Unlearning Project

Gabriel Mukobi, anonymous co-authors.

In review at ICML 2024.

Portfolio Sites

Sticks and Stones Software website screenshot

Software Portfolio

Come to my software development portfolio to see some of the AI projects, games, websites, and desktop software I create!

Digital 3D Art

My ArtStation portfolio is where I post 3D art made with Blender, Houdini, Unreal Engine, and more!

ArtStation Porfolio Thumbnail
photography portfolio screenshot

Photography

A minimalistic photography portfolio arranged in a neat dynamic grid layout.

Mukobi Music

A website for anything regarding the musical side of me, including information about my music, groups I've played with, and more.

Mukobi Music website screenshot
film portfolio screenshot

Film

A minimalistic film portfolio to highlight short films, video essays, and other motion picture works I've made.

Contact

Please use one of the following methods to contact me. I will try to get back to you as soon as possible!