Alexandre L.-Piché (@alexpiche_) Twitter Tweets • TwiCopy

Alexandre L.-Piché

@alexpiche_

+ Follow

Searching for Q* at @ServiceNowRSRCH, Prev. PhD @MilaMontreal & Research intern at @DeepMind.

ID: 394263815

linkhttp://alexpiche.github.io calendar_today19-10-2011 20:28:01

122 Tweet

1,1K Followers

4,4K Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Thrilled to share the release of StarCoder2! ServiceNow , Hugging Face, and NVIDIA have partnered to deliver a family of open-access code LLMs to help developers everywhere tap the power of GenAI to build software better. Check out model checkpoints on the Hugging Face Hub!

thumb_up_off_alt41

chat_bubble_outline0

repeat17

shareShare

Alexandre Lacoste

@alex_lacoste_

a year ago

How capable are web agents at solving knowledge work tasks? 🤔 Are LLMs up to the challenge? 🤖 Introducing WorkArena: a benchmark where agents meet the world 𝘸𝘪𝘭𝘥 web of enterprise software 🌐🖥️ Paper: bit.ly/4a7FiFV Website: bit.ly/3VkdJ87 🧵 1/7

thumb_up_off_alt132

chat_bubble_outline7

repeat51

shareShare

Alexandre L.-Piché

@alexpiche_

a year ago

We can tweak the target accuracy to obtain different behaviors. High target accuracy: ReSearch is very cautious and produces less claims on average. Low target accuracy: ReSearch is less cautious, produces more claims, and yet is *still* more accurate than default behavior.

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Rosie Zhao

@rosieyzh

a year ago

In our new work on evaluating optimizers for LLM training, we perform a series of experiments to investigate the role of adaptivity in optimizers like Adam in achieving good performance and stability. A thread: 🧵

thumb_up_off_alt185

chat_bubble_outline6

repeat30

shareShare

Alexandre Lacoste

@alex_lacoste_

a year ago

Most of our team is at #ICML2024 , reach out if you want to meet. We'll be presenting WorkArena and BrowserGym: Poster Session 2 on Tuesday, Hall C 4-9 #610 arxiv.org/abs/2403.07718

thumb_up_off_alt24

chat_bubble_outline5

repeat16

shareShare

Alexandre Drouin

@alexandredrouin

9 months ago

Interested in time series forecasting and LLMs? We are looking for visiting researchers to work on context-aided forecasting (example below): * Benchmarking * Multimodal Foundation Models * Agentic forecasting assistants When: Jan '25 - 8 months Details: bit.ly/sc25q1

thumb_up_off_alt23

chat_bubble_outline0

repeat21

shareShare

🇺🇦 Dzmitry Bahdanau

@dbahdanau

8 months ago

🚨 New agent framework! 🚨 My team at ServiceNow Research is releasing TapeAgents: a holistic framework for agent development and optimization. At its core is the tape: a structured agent log. Repo: github.com/ServiceNow/Tap… Paper: servicenow.com/research/TapeA… Why you should care: 🧵

🚨 New agent framework! 🚨

My team at <a href="/ServiceNowRSRCH/">ServiceNow Research</a> is releasing TapeAgents: a holistic framework for agent development and optimization. At its core is the tape: a structured agent log.

Repo: github.com/ServiceNow/Tap…
Paper: servicenow.com/research/TapeA…

Why you should care: 🧵

thumb_up_off_alt154

chat_bubble_outline5

repeat40

shareShare

Krishnamurthy (Dj) Dvijotham

@djdvij

8 months ago

The dominant paradigm in AI alignment is to learn from human feedback. But what form should this feedback take? A simple thumbs up/down suffice? Finer-grained attributes ? Our paper ojs.aaai.org/index.php/AIES… led by the amazing Katie Collins at #AIES studies these questions

thumb_up_off_alt37

chat_bubble_outline1

repeat12

shareShare

Krishnamurthy (Dj) Dvijotham

@djdvij

8 months ago

I am also hiring for my new team at ServiceNow Research, please reach out if you are at the conference and interested in building the future of secure AI for the enterprise. We have openings for interns, engineers and researchers

thumb_up_off_alt11

chat_bubble_outline1

repeat6

shareShare

Alexandre Lacoste

@alex_lacoste_

8 months ago

Anthropic Early results with Claude 3.5 sonnet for our new paper. We're probably not even using it right yet and its performance is through the roof, leaving o1-mini in the dust (o1-preview results are coming). See github.com/ServiceNow/Bro… for a growing amount of web-ui benchmarks.

<a href="/AnthropicAI/">Anthropic</a> Early results with Claude 3.5 sonnet for our new paper. We're probably not even using it right yet and its performance is through the roof, leaving o1-mini in the dust (o1-preview results are coming).

See github.com/ServiceNow/Bro…
for a growing amount of web-ui benchmarks.

thumb_up_off_alt19

chat_bubble_outline0

repeat7

shareShare

🇺🇦 Dzmitry Bahdanau

@dbahdanau

2 months ago

I am excited to open-source PipelineRL - a scalable async RL implementation with in-flight weight updates. Why wait until your bored GPUs finish all sequences? Just update the weights and continue inference! Code: github.com/ServiceNow/Pip… Blog: huggingface.co/blog/ServiceNo…

thumb_up_off_alt507

chat_bubble_outline6

repeat114

shareShare

Alexandre L.-Piché

Gate.io

Nicolas Chapados

Alexandre Lacoste

Alexandre L.-Piché

Rosie Zhao

Alexandre Lacoste

Alexandre Drouin

🇺🇦 Dzmitry Bahdanau

Krishnamurthy (Dj) Dvijotham

Krishnamurthy (Dj) Dvijotham

Alexandre Lacoste

🇺🇦 Dzmitry Bahdanau