Nash (@nash_function) 's Twitter Profile
Nash

@nash_function

mind over matter
AI Engineer, Startup Founder. Cybersecurity geek. Coding for fun.

ID: 1617255174241939456

calendar_today22-01-2023 20:19:39

109 Tweet

29 Takipçi

192 Takip Edilen

Alvaro Cintas (@dr_cintas) 's Twitter Profile Photo

🚨LLAMA 3.1 IS HERE! The open source AI model that you can fine-tune, distill and deploy anywhere. It is available in 8B, 70B and 405B versions. Here is everything you need to know and how to try it:

🚨LLAMA 3.1 IS HERE!

The open source AI model that you can fine-tune, distill and deploy anywhere. It is available in 8B, 70B and 405B versions. 

Here is everything you need to know and how to try it:
elvis (@omarsar0) 's Twitter Profile Photo

Transformer Explainer Really cool interactive tool to learn about the inner workings of a Transformer model. Apparently, it runs a GPT-2 instance locally in the user's browser and allows you to experiment with your own inputs. This is a nice tool to learn more about the

Arfur Rock (@arfurrock) 's Twitter Profile Photo

🚨New unicorn alert — Etched, word's first transformer ASIC Closed an unannounced $85M at $1.5B, following two other stealth rounds at $500M then $750M. The $750M round was just ~2 months ago.

Garry Tan (@garrytan) 's Twitter Profile Photo

Every single one of these engineers just has to spend literally just one day making projects with Windsurf and it will be like they strapped on rocket boosters Don’t try to build a building with your bare hands. Use heavy machinery!

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Inference-Time Scaling for Generalist Reward Modeling New paper from DeepSeek! "We propose a novel approach, Self-Principled Critique Tuning (SPCT), to foster effective inference-time scalability for generalist reward modeling, resulting in DeepSeek-GRM models. And we further

Inference-Time Scaling for Generalist Reward Modeling

New paper from DeepSeek!

"We propose a novel approach, Self-Principled Critique Tuning (SPCT), to foster
effective inference-time scalability for generalist reward modeling, resulting in
DeepSeek-GRM models. And we further
Ahmad Al-Dahle (@ahmad_al_dahle) 's Twitter Profile Photo

Introducing our first set of Llama 4 models! We’ve been hard at work doing a complete re-design of the Llama series. I’m so excited to share it with the world today and mark another major milestone for the Llama herd as we release the *first* open source models in the Llama 4

Introducing our first set of Llama 4 models!

We’ve been hard at work doing a complete re-design of the Llama series. I’m so excited to share it with the world today and mark another major milestone for the Llama herd as we release the *first* open source models in the Llama 4
Together AI (@togethercompute) 's Twitter Profile Photo

Introducing Open Deep Research! A fully open-source Deep Research tool that: • writes comprehensive reports • does multi-hop search and reasoning • generates cover images & pod-casts! We’re releasing everything: evaluation dataset, code and blog.🔥 Example output report👇

Chip Huyen (@chipro) 's Twitter Profile Photo

I’m slowly beginning to accept that my productivity, when working with AI coding agents, is limited by my human brain. AI can do many tasks in parallel, but I can only track the context of a few, so I only run a few tasks at a time. I am the bottleneck.