anton (@abacaj) Twitter Tweets • TwiCopy

anton

@abacaj

+ Follow

Software engineer. Hacking on large language models

ID:70514287

calendar_today31-08-2009 22:06:04

10,8K Tweets

36,1K Followers

518 Following

AK

@_akhaliq

2 weeks ago

From Words to Numbers

Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples

We analyze how well pre-trained large language models (e.g., Llama2, GPT-4, Claude 3, etc) can do linear and non-linear regression when given in-context examples,

account_circle

anton

@abacaj

2 weeks ago

One of these lines is going down and it’s hard to tell. Lol feels like intentional

thumb_up_off_alt330

chat_bubble_outline0

repeat7

shareShare

account_circle

anton

@abacaj

2 weeks ago

one thing I realized (which wasn't so obvious to me) is that there are plenty of people who don't really want to prompt models like gpt-4/claude, even though the models are usable on their own (without a wrapper)

people would rather have a guided workflow (questions) that then…

account_circle

anton

@abacaj

2 weeks ago

wow Claude sonnet is really good at OCR? Even beating gpt4 vision

account_circle

Aran Komatsuzaki

@arankomatsuzaki

2 weeks ago

Google presents Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

1B model that was fine-tuned on up to 5K sequence length passkey instances solves the 1M length problem

arxiv.org/abs/2404.07143

account_circle

anton

@abacaj

2 weeks ago

anyone have success with really long context (gemini 1M tokens)? I think just throwing extra context without really improving model performance isn't very useful. LLMs are known to be thrown off by more text, the more context you have the higher chance you'll put irrelevant…

thumb_up_off_alt180

chat_bubble_outline0

repeat5

shareShare

account_circle

Bob

@futuristfrog

2 weeks ago

Here is how I solved Taelin's A::B Challenge for 10k

twitter.com/VictorTaelin/s…

1. I referenced kenshin9000's prompt as a starting point
platform.openai.com/playground/p/O…

2. First thing I tried was swapping the # for Tags like A# to <A so it can form tags like <A A> (Hopefully…

account_circle

anton

2 weeks ago

repeat2

account_circle