hardmaru(@hardmaru) 's Twitter Profileg
hardmaru

@hardmaru

Building Collective Intelligence @SakanaAILabs 🧠

ID:2895499182

linkhttps://otoro.net/ml calendar_today10-11-2014 11:05:07

26,7K Tweets

285,6K Followers

1,4K Following

hardmaru(@hardmaru) 's Twitter Profile Photo

The key to using image generation AI successfully is to produce images that don’t look like the typical AI-generated images.

account_circle
hardmaru(@hardmaru) 's Twitter Profile Photo

Tokyo is ranked the sixth most walkable city in the world

I personally bicycle and walk way more than driving, in total distance traveled over the past couple of years.
timeout.com/tokyo/news/tok…

account_circle
hardmaru(@hardmaru) 's Twitter Profile Photo

I’m looking forward to the tens of thousands of PEFT’ed Llama-3 70B and 400B models with various new superpowers that will be created and released by the community.

account_circle
SethBling(@SethBling) 's Twitter Profile Photo

Almost 10 years ago I released MarI/O, a neural network that taught itself to play video games using Neuro Evolution of Augmenting Topologies (NEAT).

Check out my new interview with Ken Stanley, the author of NEAT:

youtu.be/5zg_5hg8Ydo

account_circle
hardmaru(@hardmaru) 's Twitter Profile Photo

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

PEFT algorithms are useful for dealing with LLMs with high parameter counts, as even fine-tuning these models from scratch can be computationally expensive and resource-intensive.

arxiv.org/abs/2403.14608

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey PEFT algorithms are useful for dealing with LLMs with high parameter counts, as even fine-tuning these models from scratch can be computationally expensive and resource-intensive. arxiv.org/abs/2403.14608
account_circle
lmsys.org(@lmsysorg) 's Twitter Profile Photo

Congrats Microsoft for the open release of Phi-3, their next generation of fast and capable model!

We've collected 6K+ votes for Phi-3 and pushed a new leaderboard release. The model is definitely showing great potentials of its size. Excited to see more community fine-tunes!

Congrats @Microsoft for the open release of Phi-3, their next generation of fast and capable model! We've collected 6K+ votes for Phi-3 and pushed a new leaderboard release. The model is definitely showing great potentials of its size. Excited to see more community fine-tunes!
account_circle
hardmaru(@hardmaru) 's Twitter Profile Photo

Japan’s New Appetite for Risk Lures Venture Investors to Tokyo

“If we had started Sakana AI in the Bay Area, it would have been a strategic blunder, because we would have looked more like everyone else. It would have been very hard to differentiate.”

theinformation.com/articles/japan…

account_circle
hardmaru(@hardmaru) 's Twitter Profile Photo

We are looking to hire an HR & Accounting manager to help Sakana AI expand, as we grow our team!

If you are interested, or know someone who might be, please check out our careers website for more information: sakana.ai/careers

account_circle
Maxime Labonne(@maximelabonne) 's Twitter Profile Photo

🎉 Great news for model merging!

Charles Goddard implemented an evolutionary technique à la Sakana AI to MergeKit (cc hardmaru)

He also released an excellent tutorial on how to use it with lm-evaluation-harness and vllm.

📝 Article: blog.arcee.ai/tutorial-tutor…

🎉 Great news for model merging! @chargoddard implemented an evolutionary technique à la @SakanaAILabs to MergeKit (cc @hardmaru) He also released an excellent tutorial on how to use it with lm-evaluation-harness and vllm. 📝 Article: blog.arcee.ai/tutorial-tutor…
account_circle
Sakana AI(@SakanaAILabs) 's Twitter Profile Photo

Sakana AIが提案した「進化的モデルマージ」により構築した画像生成モデル「EvoSDXL-JP」を公開しました。構築したモデルは日本語に対応しており、従来の日本語モデルと比べ10倍高速に画像を生成できます。

ブログ → sakana.ai/evosdxl-jp
デモ → huggingface.co/spaces/SakanaA…

Sakana AIが提案した「進化的モデルマージ」により構築した画像生成モデル「EvoSDXL-JP」を公開しました。構築したモデルは日本語に対応しており、従来の日本語モデルと比べ10倍高速に画像を生成できます。 ブログ → sakana.ai/evosdxl-jp デモ → huggingface.co/spaces/SakanaA…
account_circle
Robert Lange(@RobertTLange) 's Twitter Profile Photo

🦎Can we teach Transformers to perform in-context Evolutionary Optimization? Surely! We propose Evolutionary Algorithm Distillation for pre-training Transformers to mimic teachers 🧑‍🏫

🎉 Work done Google DeepMind 🗼with Yingtao Tian & Yujin Tang 🤗

📜: arxiv.org/abs/2403.02985

🦎Can we teach Transformers to perform in-context Evolutionary Optimization? Surely! We propose Evolutionary Algorithm Distillation for pre-training Transformers to mimic teachers 🧑‍🏫 🎉 Work done @GoogleDeepMind 🗼with @alanyttian & @yujin_tang 🤗 📜: arxiv.org/abs/2403.02985
account_circle
Edoardo Cetin(@edo_cet) 's Twitter Profile Photo

Super excited to share that I joined Sakana AI as a Research Scientist!

Looking forward to working with an amazing team to develop new nature-inspired methods and tackle some of AI's most relevant challenges ^^

Super excited to share that I joined @SakanaAILabs as a Research Scientist! Looking forward to working with an amazing team to develop new nature-inspired methods and tackle some of AI's most relevant challenges ^^
account_circle