Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
COLMAnja Surina , Amin Mansouri , Lars Quaedvlieg , Amal Seddas , Maryna Viazovska , Emmanuel Abbe , Caglar Gulcehre
Conference on Language Modeling (COLM) · 2025
AI researcher · London
I'm a big believer in iterative self-improvement and reinforcement learning, and I love building things like apps, tools, this site, etc!
Member of Technical Staff @ Jump Trading · previously Meta FAIR & EPFL
scroll · follow the search ↓
01 / About
In January 2026, I joined Jump Trading as a Member of Technical Staff (AI Researcher). I was previously a Research Scientist intern on the reinforcement learning team in the Core Learning & Reasoning pillar at Meta FAIR, and finished my MSc in Data Science at EPFL as a research scholar in the Caglar Gulcehre Lab for AI Research, on an EPFL Excellence Fellowship.
Off the clock: calisthenics parks 🤸 and unreasonably competitive GeoGuessr 🌎.
02 / Highlights
Jan 2026
Joined Jump Trading in London as a Member of Technical Staff (AI Researcher)! 🚀
Nov 2025
Interviewed with OpenAI for a Research Scientist position.
Sept 2025
Invited to do research at Azalia Mirhoseini’s Scaling Intelligence Lab at Stanford. 🌲
Jul 2025
Meta featured me for National Intern Day 2025! 🎉
Jan '26 – Present
AI research for trading, based in London.
Mar '25 – Sep '25
Learning a distribution of successor features for zero-shot reinforcement learning, on the RL team in the Core Learning & Reasoning pillar.
Oct '23 – Feb '25
Research scholar in the Caglar Gulcehre Lab for AI Research: evolutionary search with LLMs, AI for math, and in-context reinforcement learning with state space models.
Jul '23 – Jan '24
Self-supervised pre-training of transformer agents on expert trajectories (PASTA, RLJ 2024), evaluated across behavioral cloning, offline RL, sensor-failure robustness, and dynamics adaptation.
Nov '22 – Oct '23
Self-supervised learning for combinatorial optimization (NeurIPS 2023); RL + GNNs for scheduling.
Sep '22 – Aug '25
Master's Excellence Fellowship (awarded to ~3% of students). 5.7/6.0 GPA.
Feb '21 – Aug '22
Multi-camera multi-object tracking, plant-layout generation, and production-line throughput optimization.
Sep '19 – Jul '22
Graduated summa cum laude (9.5/10, ranked 1st of 104). University-wide Best Bachelor’s Thesis Award for “Multi-Agent Reinforcement Learning with Graph Neural Networks for Online Multi-Hoist Scheduling”.
Anja Surina , Amin Mansouri , Lars Quaedvlieg , Amal Seddas , Maryna Viazovska , Emmanuel Abbe , Caglar Gulcehre
Conference on Language Modeling (COLM) · 2025
Lars Quaedvlieg
arXiv preprint arXiv:2501.19063 · 2025