Atakan Tekparmak (@atakantekparmak) Twitter Tweets • TwiCopy

Morena

@morenadevil4

9 years ago

Twitter Beğeni Hilesi

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

People into agents, let me pitch something to you: 🌟 An agent that works across every platform (web, desktop & mobile) 🌟 Visual perception only, no messy & often incomplete HTML or a11y tree 🌟 SOTA performance across 6 agent benchmarks Sounds too good to be true? Continue ⬇️

thumb_up_off_alt409

chat_bubble_outline18

repeat87

shareShare

vittorio

@iterintellectus

20 days ago

silly claude taking breaks

thumb_up_off_alt1,1K

chat_bubble_outline56

repeat79

shareShare

Pliny the Liberator 🐉

@elder_plinius

17 days ago

AI RED-TEAMING PLINY-AGENTS HAVE ARRIVED! 🦾 Here's the video of liberated Claude autonomously jailbreaking Perplexity to produce a meth synthesis recipe! And it was all done from a SINGLE PROMPT in less than 10 minutes 😻 What a time to be alive!

thumb_up_off_alt527

chat_bubble_outline35

repeat66

shareShare

Maziyar PANAHI

@maziyarpanahi

17 days ago

The base model for Pixtral just dropped on Hugging Face! 🔥 And here’s the big news: it’s licensed under Apache 2.0! 🚀 huggingface.co/mistralai/Pixt…

thumb_up_off_alt215

chat_bubble_outline7

repeat35

shareShare

Yuling Gu

@gu_yuling

17 days ago

⚠️ Introducing SimpleToM, exposing a jarring gap in the Theory-of-Mind capabilities of current frontier LLMs: 😲 They fail to implicitly apply mental state inferences, even when they can easily infer these states for two-sentence stories. 😲 📜 arxiv.org/abs/2410.13648 1/

thumb_up_off_alt156

chat_bubble_outline9

repeat39

shareShare

Niels Rogge

@nielsrogge

16 days ago

A new video LLM by Meta dropped on the hub, and it's the new SOTA for open-source video understanding > builds on top of SigLIP/DINOv2 and Qwen2/Llama 3.2 > includes a 3B parameter model for on-device use cases Weights: huggingface.co/collections/Vi… Demo: huggingface.co/spaces/Vision-…

A new video LLM by <a href="/Meta/">Meta</a> dropped on the hub, and it's the new SOTA for open-source video understanding

> builds on top of SigLIP/DINOv2 and Qwen2/Llama 3.2
> includes a 3B parameter model for on-device use cases

Weights: huggingface.co/collections/Vi…
Demo: huggingface.co/spaces/Vision-…

thumb_up_off_alt411

chat_bubble_outline3

repeat69

shareShare

Paul Calcraft

@paul_cal

16 days ago

LLMs play pictionary!

thumb_up_off_alt3,3K

chat_bubble_outline61

repeat307

shareShare

Atakan Tekparmak

@atakantekparmak

15 days ago

Gemini-1.5-flash is a gift from Google. What else has 1500 free requests per day for you to try it and is this good for the (rumoured) size and price?

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Varun

@varun_mathur

15 days ago

This is a game-changer announcement by Apple around cryptography. It is the “HTTPS moment for AI” in some ways.. Here is what this means: your private confidential data can be pooled with other data sources and used to securely improve your UX and that of the wider community

thumb_up_off_alt2,2K

chat_bubble_outline44

repeat307

shareShare

Alexander Doria

@dorialexander

15 days ago

Releasing my detailed commented introduction to LLM sampling colab.research.google.com/drive/18-2Z4TM… We get back to the basics and slowly build up to a reproduction of the adaptive temperature strategy from "Softmax is not enough" (from Petar Veličković et al.)

thumb_up_off_alt500

chat_bubble_outline9

repeat77

shareShare

Marcel Binz

@marcel_binz

14 days ago

Excited to announce Centaur -- the first foundation model of human cognition. Centaur can predict and simulate human behavior in any experiment expressible in natural language. You can readily download the model from Hugging Face and test it yourself: huggingface.co/marcelbinz/Lla…

thumb_up_off_alt1,1K

chat_bubble_outline39

repeat212

shareShare

TuringPost

@theturingpost

12 days ago

.Google DeepMind, Google AI and KAIST AI introduce new methods to turn large LLMs into smaller models: - Recursive Transformers that reuse layers multiple times - Relaxed Recursive Transformers with LoRA - Continuous Depth-wise Batching for speeding up processing Details 🧵

.<a href="/GoogleDeepMind/">Google DeepMind</a>, <a href="/GoogleAI/">Google AI</a> and <a href="/kaist_ai/">KAIST AI</a> introduce new methods to turn large LLMs into smaller models:

- Recursive Transformers that reuse layers multiple times
- Relaxed Recursive Transformers with LoRA
- Continuous Depth-wise Batching for speeding up processing

Details 🧵

thumb_up_off_alt411

chat_bubble_outline5

repeat87

shareShare

jack morris

@jxmnop

12 days ago

just open-sourced the training and evaluation code for cde, our state-of-the-art small text embedding model includes code for lots of hard stuff: * efficient clustering large datasets * contrastive training for SOTA retrieval models * our custom two-stage model architecture that

thumb_up_off_alt546

chat_bubble_outline13

repeat64

shareShare

MetaGPT

@metagpt_

12 days ago

🌟 Excited to open-source SELA, a powerful experimentation system integrating MCTS with LLM agents. Across 20 datasets, SELA achieves a 75% win rate against AIDE (OpenAI's top pick in MLE-Bench) and beats traditional AutoML methods developed over years. 💻 Code:

thumb_up_off_alt141

chat_bubble_outline6

repeat43

shareShare

Vaibhav (VB) Srivastav

@reach_vb

11 days ago

🚨Meta released MobileLLM - 125M, 350M, 600M, 1B model checkpoints! 🔥 Notes on the release: Depth vs. Width: Contrary to the scaling law (Kaplan et al., 2020), depth is more critical than width for small LLMs, enhancing abstract concept capture and final performance Embedding

thumb_up_off_alt599

chat_bubble_outline19

repeat95

shareShare

Alexander Doria

@dorialexander

11 days ago

With Catherine Arnett, Eliot Jones and Ivan Yamshchikov we are releasing a missing block to pretrain language models on cultural heritage: Detoxifying the Commons.

With <a href="/linguist_cat/">Catherine Arnett</a>, <a href="/eliotkjones/">Eliot Jones</a> and <a href="/kr0niker/">Ivan Yamshchikov</a> we are releasing a missing block to pretrain language models on cultural heritage: Detoxifying the Commons.

thumb_up_off_alt56

chat_bubble_outline3

repeat14

shareShare

Logan Kilpatrick

@officiallogank

11 days ago

Say hello to Grounding with Google Search, available in the Gemini API + Google AI Studio! You can now access real time, fresh, up to date information from Google Search when building with Gemini by enabling the Grounding tool. developers.googleblog.com/en/gemini-api-…

thumb_up_off_alt1,1K

chat_bubble_outline88

repeat221

shareShare

Vaibhav (VB) Srivastav

@reach_vb

11 days ago

Fuck it - it’s raining smol LMs - SmolLM2 1.7B - beats Qwen 2.5 1.5B & Llama 3.21B, Apache 2.0 licensed, trained on 11 Trillion tokens 🔥 > 135M, 360M, 1.7B parameter model > Trained on FineWeb-Edu, DCLM, The Stack, along w/ new mathematics and coding datasets > Specialises in

thumb_up_off_alt247

chat_bubble_outline11

repeat25

shareShare

Atakan Tekparmak

Morena

Yu Su @ACL

vittorio

Pliny the Liberator 🐉

Maziyar PANAHI

Yuling Gu

Niels Rogge

Paul Calcraft

Atakan Tekparmak

Varun

Alexander Doria

Marcel Binz

TuringPost

jack morris

MetaGPT

Vaibhav (VB) Srivastav

Alexander Doria

Logan Kilpatrick

Vaibhav (VB) Srivastav