Jr Kibs (@jrkibs) Twitter Tweets • TwiCopy

elvis

a month ago

Evaluating LLM-based Agents This report has a comprehensive list of methods for evaluating AI Agents. Don't ignore evals. If done right, they are a game-changer. Highly recommend it to AI devs. (bookmark it)

thumb_up_off_alt882

chat_bubble_outline24

repeat171

shareShare

Chubby♨️

@kimmonismus

a month ago

Today I read that AI agents are already being introduced into law firms around the world, initially to perform repetitive tasks and later to take on real legal work. In the meantime, there are already numerous use cases on Reddit, such as those in which AI helps to conduct

thumb_up_off_alt616

chat_bubble_outline38

repeat68

shareShare

Jr Kibs

@jrkibs

a month ago

The trajectory that will lead us here.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Machine Learning Street Talk

@mlstreettalk

a month ago

Very interesting work by Sakana AI - they have designed a MoE / novel test time inference framework inspired by MCTS which finds the best "switching path" of frontier models which at depth 0 generates a code solution and from depth >0 iteratively edits the existing solution

thumb_up_off_alt104

chat_bubble_outline4

repeat26

shareShare

Chubby♨️

@kimmonismus

a month ago

4 out of the top 10 YouTube channels are now AI-generated So AI even took the YouTube influencer dream, eh?

thumb_up_off_alt483

chat_bubble_outline44

repeat44

shareShare

Ai2

@allen_ai

a month ago

Today we released SciArena, an open evaluation platform where researchers can compare and vote on foundation models for scientific literature tasks. 👇

thumb_up_off_alt95

chat_bubble_outline2

repeat12

shareShare

Jr Kibs

@jrkibs

a month ago

Here’s a show that gives us clues on how we might deal with Alien Intelligence. AI Safety folks, this one’s for you. I invite you to watch this show carefully. I invite you to be like Mitsuki : think like them in order to hack them. Anthropic Amanda Askell Jan Leike

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Maitreyee Wairagkar

@maitreyee_w

a month ago

Check out the new nature Research Briefings article “Brain implant decodes neural activity to produce expressive speech” which summarizes our recent voice-synthesis neuroprosthesis paper. It also gives a sneak peek into the story behind this paper. doi.org/10.1038/d41586…

Check out the new <a href="/Nature/">nature</a> Research Briefings article “Brain implant decodes neural activity to produce expressive speech” which summarizes our recent voice-synthesis neuroprosthesis paper. It also gives a sneak peek into the story behind this paper. doi.org/10.1038/d41586…

thumb_up_off_alt105

chat_bubble_outline6

repeat21

shareShare

Tom Yeh

@proftomyeh

a month ago

Context Engineering by hand ✍️ This exercise shows you how it goes far beyond prompt engineering. Do you think this new AI buzzword will stick around?

thumb_up_off_alt175

chat_bubble_outline4

repeat35

shareShare

Jr Kibs

@jrkibs

a month ago

"You might have heard rumors of companies looking to acquire us. We are flattered by their attention but are focused on seeing our work through." Zuck, this one's for you.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jr Kibs

@jrkibs

a month ago

Freelancers using Codex can easily handle 3 to 4 clients. It’s just insane. Especially when you pair it with Claude (for design), oh my Gosh. You can build apps that look like they came straight out of a sci-fi movie. I literally feel like we’re in the middle of a takeoff.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jr Kibs

@jrkibs

a month ago

Just ask Claude: Impress me. As Sama put it so well : The Gentle Singularity.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jr Kibs

@jrkibs

a month ago

It’s Karpathy who sets the direction for the whole community. In early 2024, he popularized “prompt engineering” A year later, he made “vibe coding” mainstream. And now ? Everyone’s talking about 'context engineering' ever since he tweeted about it a few days ago.

thumb_up_off_alt3

chat_bubble_outline2

repeat0

shareShare

Chubby♨️

@kimmonismus

a month ago

Further empirical data proving the significant job losses caused by AI. This fact does not seem to have been established yet.

thumb_up_off_alt544

chat_bubble_outline49

repeat80

shareShare

Jr Kibs

@jrkibs

a month ago

Here, it’s not the answers that matter; it’s the questions being asked. Each one invites us to dig deep, to respond based on what we believe to be true. They push us to question the very nature of our reality.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Rohan Paul

@rohanpaul_ai

a month ago

this story is going wildy viral on reddit. ChatGPT flagged a hidden gene defect that doctors missed for a decade. ChatGPT ingested the patient’s MRI, CT, broad lab panels and years of unexplained symptoms. It noticed that normal serum B12 clashed with nerve pain and fatigue,

thumb_up_off_alt660

chat_bubble_outline31

repeat88

shareShare