Mark Woodward (@datasciencemw) 's Twitter Profile
Mark Woodward

@datasciencemw

Data Scientist at SIL International | Machine Learning | NLP for low-resource languages.

ID: 1412358446247186432

linkhttps://github.com/woodwardmw calendar_today06-07-2021 10:33:28

279 Tweet

292 Followers

510 Following

Mark Woodward (@datasciencemw) 's Twitter Profile Photo

How long until people start choosing their remote working location based on timezones where ChatGPT isn't over capacity during working hours?

Mark Woodward (@datasciencemw) 's Twitter Profile Photo

I'd love to have an email assistant that not only helped me compose emails, but was fine-tuned on the thousands of emails that I've written so it could do so while sounding like me, rather than sounding like a generic English speaker.

Mark Woodward (@datasciencemw) 's Twitter Profile Photo

I enjoy doing the bnomial machine learning question each day. I think this latest one must still have been in my subconscious last night, because I dreamed about it and had an insight in my sleep into a machine learning project I'm advising someone on! today.bnomial.com/?20230121

Mark Woodward (@datasciencemw) 's Twitter Profile Photo

My Saturday afternoon project was going to be attempting to run Stable Diffusion via Modal. It turns out it only took me 10 minutes, because they already have the code, and it was so straightforward...! modal.com/docs/guide/ex/…

Sahil Patel (@saaaahiiiil) 's Twitter Profile Photo

I'm currently pursuing an MA in Linguistics from the University of Mumbai. We have a mandatory year long field linguistics course where every year, we try to document either an endangered or a lesser studied language. A few things I learnt from this course, a 🧵

Cristiano Giardina (@crisgiardina) 's Twitter Profile Photo

Tired of waiting for GPT-4-32k? Bing Chat Creative Mode uses a GPT-4 model with a context window of ~13,500 tokens! Here's how you can use it & how I found this out!

Tired of waiting for GPT-4-32k?

Bing Chat Creative Mode uses a GPT-4 model with a context window of ~13,500 tokens!

Here's how you can use it & how I found this out!
Yann LeCun (@ylecun) 's Twitter Profile Photo

MMS: Massively Multilingual Speech. - Can do speech2text and text speech in 1100 languages. - Can recognize 4000 spoken languages. - Code and models available under the CC-BY-NC 4.0 license. - half the word error rate of Whisper. Code+Models: github.com/facebookresear… Paper:

Adam Azzam (@aaazzam) 's Twitter Profile Photo

You can do constrained sampling in ChatGPT and micromanage its output. The catch? You only get one token. But you can do a lot with that one token. You can make it a classifier, a logic gate, or have it choose tools deductively. Here's how 🧵 From my talk this week Chroma

Mark Woodward (@datasciencemw) 's Twitter Profile Photo

Realizing (again) how much more useful this app is if I switch to the Following tab. Posts from people I've chosen to follow, which only have a handful of likes and comments, are generally way more informative than posts with 10k likes which have been optimized for the algorithm.

Towards Data Science (@tdatascience) 's Twitter Profile Photo

In a new deep dive, Rahul Nayak walks us through the process of building an AI research agent capable of answering questions about the intricate details within the Mahabharata, the longest epic poem ever composed. buff.ly/3L3e60W

Jeremy Howard (@jeremyphoward) 's Twitter Profile Photo

For those that hope (or worry) that LLMs will do breakthrough scientific research, I've got good (or bad) news: LLMs are particularly, exceedingly, marvellously ill-suited to this task. (if you're a researcher, you'll have noticed this already) Here's why🧵