Shreya Shankar (@sh_reya) Twitter Tweets • TwiCopy

1 week ago

i am giving a guest lecture on scaling up vibe checks for your custom LLM pipelines, which is essential to constructing a good fine-tuning dataset!! super stoked for it, and excited to be in the company of the awesome folks lecturing ☺️

sign up here: maven.com/s/course/76981…

thumb_up_off_alt47

repeat5

account_circle

Shreya Shankar

1 week ago

scores super high on faithfulness (h/t valerie)

thumb_up_off_alt16

repeat0

account_circle

Ian Arawjo (@[email protected])

@IanArawjo

2 weeks ago

Ethan Mollick Why does everyone see these tools then say 'prompt engineering is dead'? It's just been emphasized even more! And how do we evaluate the prompt is better? 'Automatically generates good prompts' --how do we know it's good? It 'works pretty well' --what tests did you perform?

thumb_up_off_alt13

repeat1

account_circle

Shreya Shankar

2 weeks ago

a chief reason for fine-tuning is to get rid of slop in the outputs of your pipelines

thumb_up_off_alt34

repeat1

account_circle

Shreya Shankar

2 weeks ago

Agreed. daniel bashir is a fantastic interviewer. I remember being so impressed by all the preparation that went into my episode. I’ve never seen anything like it

thumb_up_off_alt16

repeat3

account_circle

Shreya Shankar

2 weeks ago

i hate when i have a UX idea that i think is really good on a sunday night so i rush to implement it & get bogged down with debugging my bad typescript & emerge with a working solution hours later only to find that it was a bad idea & i should have just gone to bed early

thumb_up_off_alt59

repeat0

account_circle

Ian Arawjo (@[email protected])

@IanArawjo

2 weeks ago

We will talk about our work, “Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences”, at HEAL (well, at least I will be there 😎). Excited to chat with all the amazing people working in this space!

thumb_up_off_alt22

repeat3