Mark Woodward (@datasciencemw) Twitter Tweets • TwiCopy

Mark Woodward

@datasciencemw

+ Follow

Data Scientist at SIL International | Machine Learning | NLP for low-resource languages.

ID: 1412358446247186432

linkhttps://github.com/woodwardmw calendar_today06-07-2021 10:33:28

279 Tweet

292 Followers

510 Following

Mark Woodward

@datasciencemw

3 years ago

How long until people start choosing their remote working location based on timezones where ChatGPT isn't over capacity during working hours?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

I'd love to have an email assistant that not only helped me compose emails, but was fine-tuned on the thousands of emails that I've written so it could do so while sounding like me, rather than sounding like a generic English speaker.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Mark Woodward

@datasciencemw

3 years ago

Today I learned that in Python the string '\n' has length 1, not 2...!

thumb_up_off_alt17

chat_bubble_outline2

repeat0

shareShare

Mark Woodward

@datasciencemw

3 years ago

I enjoy doing the bnomial machine learning question each day. I think this latest one must still have been in my subconscious last night, because I dreamed about it and had an insight in my sleep into a machine learning project I'm advising someone on! today.bnomial.com/?20230121

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Mark Woodward

@datasciencemw

3 years ago

My Saturday afternoon project was going to be attempting to run Stable Diffusion via Modal. It turns out it only took me 10 minutes, because they already have the code, and it was so straightforward...! modal.com/docs/guide/ex/…

thumb_up_off_alt47

chat_bubble_outline2

repeat8

shareShare

Sahil Patel

@saaaahiiiil

3 years ago

I'm currently pursuing an MA in Linguistics from the University of Mumbai. We have a mandatory year long field linguistics course where every year, we try to document either an endangered or a lesser studied language. A few things I learnt from this course, a 🧵

thumb_up_off_alt362

chat_bubble_outline12

repeat40

shareShare

Cristiano Giardina

@crisgiardina

3 years ago

Tired of waiting for GPT-4-32k? Bing Chat Creative Mode uses a GPT-4 model with a context window of ~13,500 tokens! Here's how you can use it & how I found this out!

thumb_up_off_alt955

chat_bubble_outline18

repeat111

shareShare

Yann LeCun

@ylecun

3 years ago

MMS: Massively Multilingual Speech. - Can do speech2text and text speech in 1100 languages. - Can recognize 4000 spoken languages. - Code and models available under the CC-BY-NC 4.0 license. - half the word error rate of Whisper. Code+Models: github.com/facebookresear… Paper:

thumb_up_off_alt5,5K

chat_bubble_outline168

repeat1,1K

shareShare

Adam Azzam

@aaazzam

3 years ago

You can do constrained sampling in ChatGPT and micromanage its output. The catch? You only get one token. But you can do a lot with that one token. You can make it a classifier, a logic gate, or have it choose tools deductively. Here's how 🧵 From my talk this week Chroma

thumb_up_off_alt131

chat_bubble_outline5

repeat25

shareShare

Mark Woodward

@datasciencemw

3 years ago

Realizing (again) how much more useful this app is if I switch to the Following tab. Posts from people I've chosen to follow, which only have a handful of likes and comments, are generally way more informative than posts with 10k likes which have been optimized for the algorithm.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Towards Data Science

@tdatascience

3 years ago

In a new deep dive, Rahul Nayak walks us through the process of building an AI research agent capable of answering questions about the intricate details within the Mahabharata, the longest epic poem ever composed. buff.ly/3L3e60W

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Jeremy Howard

@jeremyphoward

2 years ago

For those that hope (or worry) that LLMs will do breakthrough scientific research, I've got good (or bad) news: LLMs are particularly, exceedingly, marvellously ill-suited to this task. (if you're a researcher, you'll have noticed this already) Here's why🧵

thumb_up_off_alt3,3K

chat_bubble_outline106

repeat519

shareShare