Jingfeng Wu (@uuujingfeng) 's Twitter Profile
Jingfeng Wu

@uuujingfeng

Bsky: bsky.app/profile/uuujf.…

Postdoc @SimonsInstitute @UCBerkeley; alumnus of @JohnsHopkins @PKU1898; DL theory, opt, and stat learning.

ID: 1933510801

linkhttps://uuujf.github.io calendar_today04-10-2013 07:50:15

98 Tweet

1,1K Takipçi

1,1K Takip Edilen

Jingfeng Wu (@uuujingfeng) 's Twitter Profile Photo

GD with LARGE stepsize induces an oscillatory loss that may sound scary, but the oscillation eventually accelerates optimization, provably Core proof in <= 5 pages, which made me very proud of :) New paper w/ Peter Bartlett, Matus Telgarsky, Bin Yu arxiv.org/abs/2402.15926

GD with LARGE stepsize induces an oscillatory loss that may sound scary, but the oscillation eventually accelerates optimization, provably

Core proof in &lt;= 5 pages, which made me very proud of :)

New paper w/ Peter Bartlett, Matus Telgarsky, Bin Yu

arxiv.org/abs/2402.15926