Shreya Shankar (@sh_reya) Twitter Tweets • TwiCopy

vor 1 tag

huge fan of using AI and copilots to help write common data science code patterns but this marketing is so funny. like 1 million lines of data science code can be one Jupyter notebook with the cell outputs

thumb_up_off_alt43

repeat3

account_circle

Madelon Hulsebos

@MadelonHulsebos

vor 4 tage

Why is finding the right dataset for analytics use-cases still time-consuming?

We surveyed ~90 practitioners on why, what, and how of dataset search, and surface key challenges and desired system capabilities 🚀!

paper: madelonhulsebos.com/assets/dataset…
talk: HILDA 2024 ACM SIGMOD’24✨!

account_circle

Shreya Shankar

vor 1 woche

would be very curious to know if and how people are using chatgpt memory & custom instructions features to actually teach the LLM things

thumb_up_off_alt17

repeat0

account_circle

Shreya Shankar

vor 1 woche

gpt 4o is way worse than gpt 4 at latex-wrangling for me. e.g., it gives bullets in md format instead of itemize env. i thought about abusing the memory feature (“remember that bullets in latex should start with \begin{itemize}”) but im lazy af and just switched back to gpt4

thumb_up_off_alt22

repeat1

account_circle

Shreya Shankar

vor 1 woche

i am giving a guest lecture on scaling up vibe checks for your custom LLM pipelines, which is essential to constructing a good fine-tuning dataset!! super stoked for it, and excited to be in the company of the awesome folks lecturing ☺️

sign up here: maven.com/s/course/76981…

thumb_up_off_alt46

repeat5

account_circle

Shreya Shankar

vor 1 woche

scores super high on faithfulness (h/t valerie)

thumb_up_off_alt16

repeat0

account_circle

Ian Arawjo (@[email protected])

@IanArawjo

vor 2 wochen

Ethan Mollick Why does everyone see these tools then say 'prompt engineering is dead'? It's just been emphasized even more! And how do we evaluate the prompt is better? 'Automatically generates good prompts' --how do we know it's good? It 'works pretty well' --what tests did you perform?

thumb_up_off_alt13

repeat1

account_circle

Shreya Shankar

vor 2 wochen

a chief reason for fine-tuning is to get rid of slop in the outputs of your pipelines

thumb_up_off_alt34

repeat1

account_circle

Shreya Shankar

vor 2 wochen

Agreed. daniel bashir is a fantastic interviewer. I remember being so impressed by all the preparation that went into my episode. I’ve never seen anything like it

thumb_up_off_alt16

repeat3

account_circle

Shreya Shankar

vor 2 wochen

i hate when i have a UX idea that i think is really good on a sunday night so i rush to implement it & get bogged down with debugging my bad typescript & emerge with a working solution hours later only to find that it was a bad idea & i should have just gone to bed early

thumb_up_off_alt59

repeat0

account_circle

Ian Arawjo (@[email protected])

@IanArawjo

vor 2 wochen

We will talk about our work, “Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences”, at HEAL (well, at least I will be there 😎). Excited to chat with all the amazing people working in this space!

thumb_up_off_alt22

repeat3