Rudi Ranck, PhD(@rudiranck) 's Twitter Profileg
Rudi Ranck, PhD

@rudiranck

AI Research Scientist & Entrepreneur —

I delve into the depths of data, discovering hidden patterns and unlocking the secrets that lie within. ✨

ID:1518885011826610176

linkhttp://rudi-ai.com calendar_today26-04-2022 09:30:48

411 Tweets

447 Followers

2,9K Following

Rudi Ranck, PhD(@rudiranck) 's Twitter Profile Photo

So, just a thought, I think the selection criteria should at least balance representatives from the corporate world with those from academia.

account_circle
Rudi Ranck, PhD(@rudiranck) 's Twitter Profile Photo

I was not expecting for this one. Phi-3 with 3.8b beating Llama 3 8b instruct on most benchmark metrics. Reading their technical report right now: export.arxiv.org/abs/2404.14219

I was not expecting for this one. Phi-3 with 3.8b beating Llama 3 8b instruct on most benchmark metrics. Reading their technical report right now: export.arxiv.org/abs/2404.14219
account_circle
Rudi Ranck, PhD(@rudiranck) 's Twitter Profile Photo

Excellent website to compare API providers for LLMs,
including price, throughput, variance and latency.
Here for Llama 3 8b:

artificialanalysis.ai/models/llama-3…

account_circle
Rudi Ranck, PhD(@rudiranck) 's Twitter Profile Photo

It's really impressive this real-time feature.

However, I would much prefer the kid to imagine without it.

I haven't fully processed it yet, but it somehow seems to alleviate the integral role of the imagination and passively relying on the visual system.

account_circle
Rudi Ranck, PhD(@rudiranck) 's Twitter Profile Photo

In Human Preferences Evaluation:
Microsoft's WizardLM 2 70B shows signs that it could be better than Llama 3 70B. Check the comparison against Mistral for both.

Currently the page is out - they are running a new 'toxicity testing'
web.archive.org/web/2024041601…

In Human Preferences Evaluation: Microsoft's WizardLM 2 70B shows signs that it could be better than Llama 3 70B. Check the comparison against Mistral for both. Currently the page is out - they are running a new 'toxicity testing' web.archive.org/web/2024041601…
account_circle