Oscar Le (@oscarle_x) 's Twitter Profile
Oscar Le

@oscarle_x

Cofounder & CEO of SnapEdit. Top AI photo editor app, 40M users, 4M MAU. Ph.D in Computer Science, expertise in AI. #ai #snapedit

ID: 752515594386280448

linkhttps://snapedit.app/ calendar_today11-07-2016 14:51:13

913 Tweet

3,3K Takipçi

4,4K Takip Edilen

Oscar Le (@oscarle_x) 's Twitter Profile Photo

Sadly gpt-oos-120B maybe even worse than gpt-4o-mini in term of knowledge (which according to rumour has only 40B). Here is my favorite question, and 4o-mini answer correctly without even thinking. All models 70B+ always answer correctly this question.

Sadly gpt-oos-120B maybe even worse than gpt-4o-mini in term of knowledge (which according to rumour has only 40B). 

Here is my favorite question, and 4o-mini answer correctly without even thinking. All models 70B+ always answer correctly this question.
Oscar Le (@oscarle_x) 's Twitter Profile Photo

Now LLM models are in incremental improvement phase. So this is a good time to build LLM wrappers. You don't have to worry your startup will become a bullet point in the slides of the next Google I/O

Oscar Le (@oscarle_x) 's Twitter Profile Photo

Coding with LLMs may lead to the rise of multi-tasking brain. Before we value single-tasking because it is superior in quality. But now LLMs do the heavy lifting, if someone can oversight 10 coding sessions at the same time, they may well be 5x more efficient then others.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

I kind of like GPT-5-thinking. Because I don't see it different much from o3 at all. Just like o3 with a different name. The only difference is when thinking, it only shows a headline, not a paragraph of thinking anymore. So sometimes I thought it was freezing.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

I guess OpenAI can't improve the performance of GPT-5 much, so they focused on reducing hallucination to have good things to report. And that trend will keep continue. There is a lot of room in reducing hallucination that AI Labs haven't explored.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

I'm curious if big AI companies play the espionage game like cold war countries? Like spy when they release new models, how good the benchmark are before announcement...

Oscar Le (@oscarle_x) 's Twitter Profile Photo

When receiving a gig, you can most of the time say yes and figure out a solution within a week by assembling some components already available. That requires reading a lot and trying things out a lot, but normally can be done within timeframe. The solution will be sloppy but

Oscar Le (@oscarle_x) 's Twitter Profile Photo

People think VR is the future, but VR never give us the true feeling of immersion in a new word. What truly immerses us into a new world is in dream. Better develop Dream machine than VR.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

When I was in school, I learned all you need to do is sit in the front row. Professors will notice and remember you and you will get a decent grade. I mostly sit at the end of the class though. But every time if I sit in the front, I got good grade.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

Why are you mulling that much brother? With these kinds of 10 mins wait, running 5 Claude Code sessions in parallel is the only way to go.

Why are you mulling that much brother?

With these kinds of 10 mins wait, running 5 Claude Code sessions in parallel is the only way to go.
Oscar Le (@oscarle_x) 's Twitter Profile Photo

With Cursor I'm aware of every change my code, but with Claude Code I find it keep adding code until I loss control. Maybe 1 reason js the diff review of Cursor is easier to follow. And another reason is the terminal output in claude code is hid under Ctrl+R

Oscar Le (@oscarle_x) 's Twitter Profile Photo

My current setup: Vscode + Claude Code + Continue_dev - I plan design, architecture, implementation decisions... CC is bad at this. I break down into steps and tell CC to note down to md files. - CC implement each step and tests. When everything is clear, CC can implement 1000s

Oscar Le (@oscarle_x) 's Twitter Profile Photo

Before LLM, novels and high quality journals actually use em-dash a lot. I don't think normal people use them much because hard to type. Maybe that is the reason LLMs think that em-dash is the hallmark of high quality writing and churn out em-dash whenever it find a chance.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

We start researching on "body shaping". We have been avoiding working on this for a long time because this feature could create some kind of unrealistic expectation. And Chinese apps are already very good at it. But surveying our users say otherwise. So here we are.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

The more I spend time with Claude Code, the more I feel like it is a mildly toxic relationship. Claude keep hiding away errors from me and when I find out, it just say sorry you are right I shouldn't do that. I don't want to micromanage though, that is even more toxic

Oscar Le (@oscarle_x) 's Twitter Profile Photo

We are updating our onboarding. Now we use animated video to guide users to try out the right tool for them. For mobile apps usually D1 retention is <30%, onboarding is one of the most important thing to A/B test frequently. Good things may come out if you try enough.

Oscar Le (@oscarle_x) 's Twitter Profile Photo

I'm quite sure because either (1) They are too optimistic and let LLMs do things it is not good at (2) They too pessimistic and let LLMs do things that provide too little value to see. Failing the first time is okay, they need to try a lot until they find a sweet spot.

I'm quite sure because either 

(1) They are too optimistic and let LLMs do things it is not good at

(2) They too pessimistic and let LLMs do things that provide too little value to see.

Failing the first time is okay, they need to try a lot until they find a sweet spot.