Tao Li (@tao__li) Twitter Tweets • TwiCopy

Vivek Gupta

2 years ago

Excited to share our #ACL2023NLP (#NLProc) paper on "Information Synchronization Across Multilingual Semi-Structured Tables"! 📚🔍 We explore the problem of Information Synchronization across multilingual tables and construct a large-scale dataset InfoSync. #NLPResearch -1/n

thumb_up_off_alt69

chat_bubble_outline3

repeat15

shareShare

Vivek Gupta

@keviv9

2 years ago

Excited to share our #ACL2023NLP (#NLProc) work on "Evaluating Inter-Bilingual Semantic Parsing for Indian Languages" appearing in @5thnlp4convai workshop. We propose IE-SEMPARSE, an Inter-bilingual Seq2seq Semantic parsing dataset for 11 Indian languages. - 1/n

thumb_up_off_alt18

chat_bubble_outline1

repeat7

shareShare

Maitrey Mehta

@my_tray

2 years ago

I’ll be presenting our work “Verifying Annotation Agreement without Multiple Experts: A Case Study with Gujarati SNACS” as a virtual poster on Tuesday at #ACL2023 and in-person at LAW on Thursday. Joint work with Vivek Srikumar Paper: tinyurl.com/3989kvnp (1/3)

thumb_up_off_alt20

chat_bubble_outline1

repeat4

shareShare

Zhichao Xu Brutus

@zhichaoxu_ir

2 years ago

👏🔥our paper received Distinguished Paper award !

thumb_up_off_alt20

chat_bubble_outline4

repeat4

shareShare

AK

@_akhaliq

2 years ago

A Zero-Shot Language Agent for Computer Control with Structured Reflection paper page: huggingface.co/papers/2310.08… Large language models (LLMs) have shown increasing capacity at planning and executing a high-level goal in a live computer environment (e.g. MiniWoB++). To perform a

thumb_up_off_alt218

chat_bubble_outline3

repeat47

shareShare

Ana Marasović

@anmarasovic

2 years ago

.Hugging Face appreciation at UtahNLP baking party 🤗🎄❤️☃️

.<a href="/huggingface/">Hugging Face</a> appreciation at UtahNLP baking party 🤗🎄❤️☃️

thumb_up_off_alt74

chat_bubble_outline1

repeat9

shareShare

Maitrey Mehta

@my_tray

2 years ago

New preprint 🚨 "Do LLM predictors provide structurally consistent outputs in the zero- and few-shot regime?" Our new work "Promptly Predicting Structures: The Return of Inference" shows that they do not, and we show how to fix it. (1/n) 🧵

thumb_up_off_alt51

chat_bubble_outline1

repeat12

shareShare

Google DeepMind

@googledeepmind

2 years ago

Introducing SIMA: the first generalist AI agent to follow natural-language instructions in a broad range of 3D virtual environments and video games. 🕹️ It can complete tasks similar to a human, and outperforms an agent trained in just one setting. 🧵 dpmd.ai/3TiYV7d

thumb_up_off_alt3,3K

chat_bubble_outline207

repeat833

shareShare

Joe Davison

@joeddav

2 years ago

🤗➕🏔️ UtahNLP is now organized on the Hugging Face hub I recommend you 👀 it huggingface.co/utahnlp

thumb_up_off_alt36

chat_bubble_outline1

repeat8

shareShare

Haoyu Wang

@haoyu_wang_97

2 years ago

Excited to introduce our new paper BLINK! It’s a new benchmark for MLLMs, focusing on visual perception capabilities. We show that there’s still a gap between SOTA MLLMs and human performance in 14 tasks that can be solved by humans within a blink~

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

fly51fly

@fly51fly

a year ago

[AI] Devil's Advocate: Anticipatory Reflection for LLM Agents H Wang, T Li, Z Deng, D Roth, Y Li [Google DeepMind] (2024) arxiv.org/abs/2405.16334 - Proposes a novel approach that integrates introspection into LLM agents to enhance their consistency and adaptability in solving

thumb_up_off_alt13

chat_bubble_outline0

repeat4

shareShare

Haoyu Wang

@haoyu_wang_97

a year ago

Multiple Reflections NOT helping much? Tired of changing plans and NOT seeing utmost effort in their execution? Introducing Devil’s Advocate 😈: Equipping LLM Agents with *Anticipatory* Reflection before action execution #LLM #Agent #AI #ML arxiv.org/pdf/2405.16334…

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Tao Li

@tao__li

a year ago

Reflection doesn't have to be post-hoc. We show that agent can benefit from "anticipatory" failures. An extra bonus of doing so is that reflective trials can now run in parallel.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Haoyu Wang

@haoyu_wang_97

a year ago

NeurIPS’24 authors: if you are desk rejected due to a missing checklist, add your email to this appeal letter! docs.google.com/document/d/16_… Getting desk rejected after filling out the checklist form carefully in OpenReview… This does not make any sense!

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Google AI

@googleai

a year ago

Congratulations to the authors of the “Rich Human Feedback for Text-to-Image Generation” paper, which received the #CVPR2024 Best Paper Award. Check out the paper at: arxiv.org/pdf/2312.10240

thumb_up_off_alt174

chat_bubble_outline18

repeat43

shareShare

Zhichao Xu Brutus

@zhichaoxu_ir

a year ago

Are compressed LLMs less toxic and biased against different demographic groups❓In this new📜, we study 4 pruning methods and 3 quantization methods and evaluate on 7 bias/toxicity benchmarks. arxiv.org/abs/2407.04965 (Un)surprising answer is: they are not less toxic/biased

thumb_up_off_alt11

chat_bubble_outline1

repeat4

shareShare

Qingyao Ai

@qingyaoai

a year ago

Thrilled to know that our paper on Scaling Laws for Dense Retrieval has won the #SIGIR2024 Best Paper Award! 🏆Our study reveals a power-law scaling of dense retrieval models, which can help optimize training and resource allocation. Huge thanks and congrats to all collaborators!

thumb_up_off_alt128

chat_bubble_outline12

repeat5

shareShare

Jeff Dean

@jeffdean

8 months ago

Got a picture that isn't quite right? Try our native image generation in Gemini Flash 2.0. "Can you remove the stuff on the couch?". "Can you make the curtains light green?" "Can you put a unicorn horn on the person in the green pants?" Editing in human language, not image

thumb_up_off_alt785

chat_bubble_outline25

repeat67

shareShare

Fabian Mentzer

@mentzer_f

8 months ago

Glad we launched => aistudio.google.com

thumb_up_off_alt4,4K

chat_bubble_outline79

repeat307

shareShare

Google DeepMind

@googledeepmind

7 months ago

Think you know Gemini? 🤔 Think again. Meet Gemini 2.5: our most intelligent model 💡 The first release is Pro Experimental, which is state-of-the-art across many benchmarks - meaning it can handle complex problems and give more accurate responses. Try it now →

thumb_up_off_alt2,2K

chat_bubble_outline93

repeat522

shareShare