Knut Jägersberg(@JagersbergKnut) 's Twitter Profileg
Knut Jägersberg

@JagersbergKnut

Content Strategy & AI

@[email protected]

https://t.co/xnBUK02hWS

ID:1010498049058201600

linkhttps://www.linkedin.com/in/knut-jägersberg calendar_today23-06-2018 12:21:23

76,6K Tweets

5,5K Followers

4,6K Following

Philip Vollet(@philipvollet) 's Twitter Profile Photo

Our friends at Unbody launched today on Product Hunt! Let's support them! Unbody is an AI-native content API that automates your full pipeline.

Connect your data and start building with a single line of code.

Build with Weaviate • vector database

producthunt.com/posts/unbody

account_circle
Junyang Lin(@JustinLin610) 's Twitter Profile Photo

I guess you might have tried the demo (huggingface.co/spaces/Qwen/Qw…). Now the weights of Qwen1.5-110B are out! Temporarily only the base and chat models, AWQ and GGUF quantized models are about to be released very soon!

Blog: qwenlm.github.io/blog/qwen1.5-1…

Hugging Face:…

account_circle
Binyuan Hui(@huybery) 's Twitter Profile Photo

🤠 Qwen1.5-110B model weights released. Qwen2 is on the way, let's take it step by step!
👇🏻 Enjoy it!
hf.co/spaces/Qwen/Qw…

account_circle
Rohan Paul(@rohanpaul_ai) 's Twitter Profile Photo

'Quantizing Llama 3 8B seems more harmful compared to other models'

The 8B model is packed so full of information it's tensores can no longer be as robustly mathematically/ structurally encoded compared to the older 7Bs.

Similar thoughts were explored in the paper - 'How Good…

account_circle
Alignment Lab AI(@alignment_lab) 's Twitter Profile Photo

dont you just love how the industry standard for the major labs is to release a model thats very good, then over the following weeks, make it almost entirely useless and tell no one at all.

i think theres a word for that particular type of thing.

Anthropic 's claude is now…

account_circle
clem 🤗(@ClementDelangue) 's Twitter Profile Photo

Could replace 'serendipity' by 'impact'.

Aka 'The amount of impact that will occur in your life is directly proportional to the degree to which you do something you're passionate about combined with the total number of people to whom this is effectively communicated.'…

account_circle
InternLM(@intern_lm) 's Twitter Profile Photo

🥳Multi-modal Phi-3-mini is here! -Phi-3-mini outperforms LLaVA-v1.5-7B and matches the performance of LLaVA-Llama-3-8B in multiple benchmarks.
😊For easy applications, weights are provided.
👉github.com/InternLM/xtuner
AK

🥳Multi-modal Phi-3-mini is here! #LLaVA-Phi-3-mini outperforms LLaVA-v1.5-7B and matches the performance of LLaVA-Llama-3-8B in multiple benchmarks. 😊For easy applications, #GGUF weights are provided. 👉github.com/InternLM/xtuner @_akhaliq #Phi3
account_circle
AK(@_akhaliq) 's Twitter Profile Photo

Meta presents Layer Skip

Enabling Early Exit Inference and Self-Speculative Decoding

We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for

Meta presents Layer Skip Enabling Early Exit Inference and Self-Speculative Decoding We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for
account_circle