Daily AI Papers (@papers_daily) Twitter Tweets • TwiCopy

2 years ago

.labml.ai deep learning experiment monitoring app got significantly more responsive after hehehehe implemented base64 encoding for long float arrays instead of plain JSON. It uses 4X less data transfer.

thumb_up_off_alt12

chat_bubble_outline0

repeat5

shareShare

labml.ai

a year ago

We’ve open-sourced our LLM attention visualization library. It generates interactive visualizations of attention matrices with just a few lines of Python code in notebooks. hehehehe cleaned up and polished the existing code to make it open source.

thumb_up_off_alt247

chat_bubble_outline4

repeat65

shareShare

labml.ai

a year ago

✨ Annotated DL Paper Implementation repository reached 50K stars. It has implementations of a wide range of deep learning concepts including Transformers and variations, StyleGAN, Stable Diffusion, Normalization layers, RL, Optimizers... 🧶👇

thumb_up_off_alt40

chat_bubble_outline1

repeat7

shareShare

labml.ai

a year ago

The machine generated Chinese translation of annotated paper implementations repo is being improved with manual translations by @pengchzn 🙏 @pengchzn has already finished the basic transformer including multi-head attention. 👇

thumb_up_off_alt20

chat_bubble_outline1

repeat5

shareShare

labml.ai

a year ago

🎉 Excited share that we add a distribution visualization to our library, Inspectus. It plots the full distribution of data across training steps. This helps better understand how the training is going and instantly see the impact of outliers. 👇

thumb_up_off_alt21

chat_bubble_outline1

repeat4

shareShare

vpj

a year ago

I first found plotting the distribution useful when I was trying RL algorithms on Atari around 2018/19. I used Tensorboard back then. It was quite useful to look at the score distribution of the rollouts. It showed how the policy was behaving clearer than looking at the mean

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

labml.ai

a year ago

We should be able to release an update of labml experiment monitoring library very soon 😂 It has a bunch of cool new features

thumb_up_off_alt4

chat_bubble_outline0

repeat3

shareShare

NOTBAD AI

a year ago

We’ve been training NVIDIA Mistral-NeMo-Minitron-8B-Base for math reasoning on the GSM8K-Aug dataset, and we have a version with a 70.2% gsm8k score, up from a 58.5% cot score (reported in the paper LLM Pruning and distillation). 👇

thumb_up_off_alt19

chat_bubble_outline1

repeat9

shareShare

labml.ai

a year ago

Annotated PyTorch implementation of of LoRA (Low Lank Adaptation of LLMs) 📝 Code + Notes: nn.labml.ai/lora/index.html 📎 Paper: arxiv.org/abs/2106.09685 LoRA freezes the pre-trained model and trains smaller injected weights, enabling faster and memory efficient fine-tuning. 👇

Annotated <a href="/PyTorch/">PyTorch</a> implementation of of LoRA (Low Lank Adaptation of LLMs)

📝 Code + Notes: nn.labml.ai/lora/index.html
📎 Paper: arxiv.org/abs/2106.09685

LoRA freezes the pre-trained model and trains smaller injected weights, enabling faster and memory efficient fine-tuning.

👇

thumb_up_off_alt99

chat_bubble_outline2

repeat22

shareShare

labml.ai

a year ago

Our open source deep learning experiment monitoring library now has 2000 stars! Thank you

thumb_up_off_alt18

chat_bubble_outline1

repeat3

shareShare

labml.ai

a year ago

We added token visualization to Inspectus. It lets you visualize metrics associated with tokens such as loss, entropy, KL div, etc. It works on notebooks and pretty easy to use. 👇

thumb_up_off_alt28

chat_bubble_outline2

repeat4

shareShare

NOTBAD AI

8 months ago

📢 We are excited to announce Notbad v1.0 Mistral 24B, a new reasoning model trained in math and Python coding. This model is built upon the Mistral AI Small 24B 2501 and has been further trained with reinforcement learning on math and coding.

📢 We are excited to announce Notbad v1.0 Mistral 24B, a new reasoning model trained in math and Python coding. This model is built upon the <a href="/MistralAI/">Mistral AI</a> Small 24B 2501 and has been further trained with reinforcement learning on math and coding.

thumb_up_off_alt24

chat_bubble_outline1

repeat8

shareShare

labml.ai

8 months ago

You can now download the Notbad v1.0 Mistral 24B model from Hugging Face huggingface.co/notbadai/notba… Try it on chat.labml.ai

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

NOTBAD AI

8 months ago

We're open-sourcing a math reasoning dataset with 270k samples, generated by our RL-based self-improved Mistral 24B 2501 model and used to train Notbad v1.0 Mistral 24B. Available on Hugging Face: huggingface.co/datasets/notba…

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

vpj

8 months ago

Uploaded the dataset of 270k math reasoning samples that we used to finetune Notbad v1.0 Mistral 24B (MATH-500=77.52% GSM8k Platinum=97.55%) to Hugging Face (link in reply) Follow NOTBAD AI for updates

thumb_up_off_alt62

chat_bubble_outline9

repeat13

shareShare

NOTBAD AI

8 months ago

We are releasing an updated reasoning model with improvements on IFEval scores (77.9%) than our previous model (only 51.4%). 👇 Links to try the model and to download weights below

thumb_up_off_alt10

chat_bubble_outline1

repeat6

shareShare

vpj