sun icon

YUNIAN PAN

Blog ▼
About Me 2022 2023 2024 2025
Statement/CV Contact Photos
© 2022-2025 all rights reserved
Reinforcement Learning with Human Feedback
Aug 25, 2025

Reinforcement Learning with Human Feedback

I would love to think of the ICML 2023 conference in Honolulu as a pivotal event for my P.h.D. research. During that time I was kind of talking to …
Bayesian Holonic (Global) Games
Jul 1, 2025

Bayesian Holonic (Global) Games

This past year I suffered a lot mentally and financially, my funding got paused and my girlfriend Sam (check out her spotify!) lost her boojee A.P.C. …
About Me
Aug 26, 2024

About Me

When you write an About Me, you want something permanent. Well, I go by ‘Union’ — a homophone of my Chinese name that I find neat …
What (on earth) is a Sufficient Statistic?
Apr 28, 2024

What (on earth) is a Sufficient Statistic?

It certainly bothered me (not any more) to hear people casually dropping the word “sufficient statistics” in their talks, often describing …
The Lorenz dynamics and Butterfly Effect
Apr 8, 2024

The Lorenz dynamics and Butterfly Effect

Chaotic behavior can emerge even in simple dynamics such as replicator, and Rock-Paper-Scissor oscillators, depending on the game settings. To begin …
Talagrand's Isoperimetry inequality
Mar 26, 2024

Talagrand's Isoperimetry inequality

This post is in celebration of Michel Talagrand winning Abel prize. Not to overly romanticize this but this is pretty much a come back story because …
A Variational Perspective On Gradient Descent
Feb 11, 2024

A Variational Perspective On Gradient Descent

In this post I just want to share a simple and elegant idea from a Control System Letter paper written by Maxim Raginsky et al.1. This idea largely …
 Some Notes on Dueling Bandits
Oct 11, 2023

Some Notes on Dueling Bandits

The dueling bandit problem natrually fits the description of a variety of recommendation systems that require ‘’learning on the …
Erdos-Szekeres Theorem
Jul 24, 2023

Erdos-Szekeres Theorem

This post is dedicated to Erdos-Szekeres Theorem, which says: ES Thm. (Monotone sequence) For any positive integer $n$, any sequence of $n^2 + 1$ …
Wardrop Equilibrium
Jan 30, 2023

Wardrop Equilibrium

I remember having those unpleasant lines in our high school canteen, flooded by the starving students queeing for their lunch, I would always pick a …
Monge Partition
Dec 31, 2022

Monge Partition

There are many service centers in our city, such as MTA subway station, Vaccination sites, Wifi hot-spots, Blue Bicycles, hospitals, parking lots …
Nesterov
Dec 31, 2022

Nesterov

I did a presentation in a group meeting to briefly review the lower complexity bound of first-order convex optimization; and how Nesterov proceed to …
Capacity of Deception
Nov 16, 2022

Capacity of Deception

This is pretty much scribed from Tor Lattimore’s Bandit Algorithms book,Bandit Algorithms. This book has been the most informational item for me …