DPO論文(Your Language Model is Secretly a Reward Model)とその後続論文(... is Secretly a Q-Function)の解説動画。
youtu.be/s4OqzfDyjXY
【AI論文解説】RLHF不要なLLMの強化学習手法Direct Preference Optimization(+α) - YouTube
Want to hear a friend in a noisy café? We designed deep learning-based headphones that let you isolate the speech from a specific person just by *looking* at them for a few seconds. CHI'24 honorable mention award.
Paper: arxiv.org/abs/2405.06289
Code: github.com/vb000/LookOnce…
We are holding a competition for the automatic evaluation of text-to-audio generation. We will evaluate the semantic alignment between text and audio. Please join us!
Pre-registration:forms.gle/s5pNf72Z7KWCfS…
*Pre-registration does not obligate you to participate in competition