🧠🤖

Wannan (Winnie) Yang
Welcome to My Corner of the Web! 👋
I’m Winnie, a PhD candidate at New York University (NYU).
My work spans from biological brains (my works were published in Science and Nature to autonomous LLM research agents (current research at Meta). I’ve spent years understanding biological learning machines and currently building artificial ones. The brain taught me how intelligence works. AI taught me how to build it. Now I want to close this circle—using insights from biological intelligence to build better AI, then wielding that AI to decode biology itself.
Current Focus
My current interests include:
- Open-endedness: Evolving self-improving LLM Coding Agent using evolutionary algorithm (collaborating with Jenny Zhang)
- AI Research Scientist: In November 2025, I will join the AIRA Team at FAIR (manager: Yoram Bachrach), building toward a fully automated AI research scientist.
Past Research Highlight
In early 2024 I was beginning my transition from Neuroscience to AI. By then, progress in AI for several years had already exploded in a myriad of promise and hope. But what struck me in 2024 was that the rate of progress was continuing unabated, and my feelings of open-ended benefit for humanity turned to worries for our extinction as we barrel unprepared toward the direction of ASI. My main motivation is that in order for humanity to benefit from the plethora of benefits that AI brings, we must make sure the system is aligned, and we must build such safe systems now.
Within the broad goal of helping superalignment, my have conducted research revolve around building automated alignment research assistants – perhaps by better leveraging interpretability tools to monitor their internals.
One area, for instance, that I have worked on is to monitor/evaluate the trustworthiness/deception of both alignment assistant (supervisor) and the supervisee via interpretability tools (Preprint).
Using interpretability insight, I have devellped an effiicient post-training agorithm to reduce hallucination in LLMs (Preprint).
Delving into the black box is within my existing expertise as a neuroscientist (expertise in ‘internal oversight’ of the brain) as well as my recent research experience in LLM deception.