Anwesha Chowdhury (@anweshac1211) Twitter Tweets • TwiCopy

Anwesha Chowdhury

@anweshac1211

+ Follow

AI Research Engineer | Open Source Contributor |
CTO of an AI Fashion Startup based in UK

ID: 1658986663803228161

linkhttps://achowdhury1211.github.io calendar_today18-05-2023 00:03:55

163 Tweet

38 Followers

378 Following

apolinario 🌐

@multimodalart

23 days ago

i love this model

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

Anwesha Chowdhury

@anweshac1211

22 days ago

Reached 100 followers today Yay!!! Thank you to the ones following me 🫶💐 Onto 200 next🔥🚀

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

interesting read on parallelism some takeaways: - expert parallelism is the same as data parallelism for non-moe layers - for dp: the input batches of data are divided among the GPUs - for model parallelism: the inputs batches are replicated in each core.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

21 days ago

would anybody be interested if I write a blog on How to optimize inference speed for diffusion transformer models like Wan?

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

20 days ago

I'm quite sad that Papers with code sunsetted. It was one of my most favourite platforms for papers

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

18 days ago

Sparse attention + DMD

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

18 days ago

another interesting blog from Owl they are really on fire🔥

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Anwesha Chowdhury

@anweshac1211

18 days ago

A study on RL based AR image editing model Been searching for more RL based papers in multimodal gen space

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

18 days ago

Qwen nailed it at following complicated and long prompts.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Robert Scoble

@scobleizer

18 days ago

Been talking about it for years. Wrote two books about it. The Holodecks are arriving! Huge.

thumb_up_off_alt173

chat_bubble_outline27

repeat12

shareShare

Anwesha Chowdhury

@anweshac1211

18 days ago

Qwen's text generation capability is also quite good

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

18 days ago

here comes the next frontier in AI

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

17 days ago

realtime everyone

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

17 days ago

More demos of Genie 3 😮

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

17 days ago

jaw dropping honestly

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Anwesha Chowdhury

@anweshac1211

17 days ago

Open source project similar to Genie 3. will be diving deep into this paper for sure

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

A.I.Warper

@aiwarper

15 days ago

Fun workflow I was playing with last night 1) Kontext to remove Thor from the shot 2) Photopea to place Shrek 3) Kontext + Relight lora to blend him into the shot 4) Wan2.2 i2V to animate Very addicting... 😍 Prompts are written up in the corner. Wan 2.2 prompt below 👇

thumb_up_off_alt1,1K

chat_bubble_outline39

repeat90

shareShare

Ostris

@ostrisai

13 days ago

Trained a sidechain LoRA to compensate for the quantization precision loss when quantizing Qwen Image to 3 bit. It works well. This can be active during training and should allow us to fine tune Qwen Image on <24GB of VRAM. This can be done to all models.

thumb_up_off_alt432

chat_bubble_outline21

repeat44

shareShare