Knut Jägersberg (@JagersbergKnut) Twitter Tweets • TwiCopy

Knut Jägersberg

@JagersbergKnut

+ Follow

Content Strategy & AI

@[email protected]

https://t.co/xnBUK02hWS

ID:1010498049058201600

linkhttps://www.linkedin.com/in/knut-jägersberg calendar_today23-06-2018 12:21:23

76,6K Tweets

5,5K Followers

4,6K Following

Philip Vollet

@philipvollet

1 day ago

Our friends at Unbody launched today on Product Hunt! Let's support them! Unbody is an AI-native content API that automates your full pipeline.

Connect your data and start building with a single line of code.

Build with Weaviate • vector database

producthunt.com/posts/unbody

thumb_up_off_alt6

chat_bubble_outline0

repeat3

shareShare

account_circle

nat://TheAIObserverX

@TheAIObserverX

1 day ago

I find this puzzling: Why does Anthropic seem to be crawling the web more extensively than Google?

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

account_circle

Junyang Lin

@JustinLin610

1 day ago

I guess you might have tried the demo (huggingface.co/spaces/Qwen/Qw…). Now the weights of Qwen1.5-110B are out! Temporarily only the base and chat models, AWQ and GGUF quantized models are about to be released very soon!

Blog: qwenlm.github.io/blog/qwen1.5-1…

Hugging Face:…

account_circle

Binyuan Hui

@huybery

1 day ago

🤠 Qwen1.5-110B model weights released. Qwen2 is on the way, let's take it step by step!
👇🏻 Enjoy it!
hf.co/spaces/Qwen/Qw…

thumb_up_off_alt34

chat_bubble_outline0

repeat8

shareShare

account_circle

Rohan Paul

@rohanpaul_ai

1 day ago

'Quantizing Llama 3 8B seems more harmful compared to other models'

The 8B model is packed so full of information it's tensores can no longer be as robustly mathematically/ structurally encoded compared to the older 7Bs.

Similar thoughts were explored in the paper - 'How Good…

thumb_up_off_alt17

chat_bubble_outline0

repeat5

shareShare

account_circle

Alex Yanko 🇺🇦

@LeopolisDream

1 day ago

Fine tune LLAMA3 on million scale dataset in consumer GPU using QLora, Deepspeed

medium.com/@sumandas0/fin…

account_circle

Knut Jägersberg

@JagersbergKnut

1 day ago

Time flies, so I've updated my performance LLM collections

huggingface.co/KnutJaegersberg

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare

account_circle

Alignment Lab AI

@alignment_lab

1 day ago

dont you just love how the industry standard for the major labs is to release a model thats very good, then over the following weeks, make it almost entirely useless and tell no one at all.

i think theres a word for that particular type of thing.

Anthropic 's claude is now…

thumb_up_off_alt16

chat_bubble_outline0

repeat1

shareShare

account_circle

clem 🤗

@ClementDelangue

2 days ago

Could replace 'serendipity' by 'impact'.

Aka 'The amount of impact that will occur in your life is directly proportional to the degree to which you do something you're passionate about combined with the total number of people to whom this is effectively communicated.'…

thumb_up_off_alt38

chat_bubble_outline0

repeat6

shareShare

account_circle

InternLM

@intern_lm

2 days ago

🥳Multi-modal Phi-3-mini is here! #LLaVA -Phi-3-mini outperforms LLaVA-v1.5-7B and matches the performance of LLaVA-Llama-3-8B in multiple benchmarks.
😊For easy applications, #GGUF weights are provided.
👉github.com/InternLM/xtuner
AK #Phi3

account_circle

AK

@_akhaliq

1 day ago

Meta presents Layer Skip

Enabling Early Exit Inference and Self-Speculative Decoding

We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for