changhiskhan(@changhiskhan) 's Twitter Profileg
changhiskhan

@changhiskhan

Working on LanceDB, serverless OSS vector search for multimodal data. Formerly TubiTV/Cloudera/Pandas. @lancedb

ID:48747910

linkhttps://github.com/lancedb/lancedb calendar_today19-06-2009 16:17:47

2,6K Tweets

1,7K Followers

1,0K Following

LanceDB(@lancedb) 's Twitter Profile Photo

Existing closed-source API-based LLM developer tools are opaque and the developers deserve transparency, hackability, and control over their tools.

Learn how Continue's open-source approach, powered by LanceDB, is changing the game for LLM-powered dev tools ⚒️🚀

Read the

Existing closed-source API-based LLM developer tools are opaque and the developers deserve transparency, hackability, and control over their tools. Learn how @continuedev's open-source approach, powered by LanceDB, is changing the game for LLM-powered dev tools ⚒️🚀 Read the
account_circle
Prashant Dixit(@Prashant_Dixit0) 's Twitter Profile Photo

OpenAI launched GPT-4o last week with six benchmarks on which they have performed comparisons with GPT4 Turbo, Gemini Pro 1.5, and Claude-3 Opus.

Here I have Benchmarked GPT-4o on NIAN(Needle in a Needlestack)

Blog - blog.lancedb.com/benchmarking-g…
LanceDB
Other Benchmark👇

account_circle
Hamilton(@hamilton_os) 's Twitter Profile Photo

Doing ? Using Hugging Face ? We have some updates this week:

sf-hamilton==1.63.0:
* One Hugging Face data loader & two data savers [ , LanceDB]
* Example & blog post doing with LanceDB using data & models from Hugging Face

Doing #RAG? Using @huggingface ? We have some updates this week: sf-hamilton==1.63.0: * One @huggingface data loader & two data savers [#parquet, @lancedb] * Example & blog post doing #NER with @lancedb using data & models from @huggingface
account_circle
changhiskhan(@changhiskhan) 's Twitter Profile Photo

I’m very excited about the TypeScript embedding function registry being baked in LanceDB . In python, this feature abstract away embedding generation, just declare the function as part of the table schema! Comments/requests are welcome!

github.com/lancedb/lanced…

account_circle
changhiskhan(@changhiskhan) 's Twitter Profile Photo

For those of you asking: yes She is the correct spelling for my last name. Just remember: He is my pronoun. She is my proper noun.

account_circle
changhiskhan(@changhiskhan) 's Twitter Profile Photo

Building LanceDB with Lei Xu has been such a fun journey. It's about to get even awesome-r with our new funding round led by CRV. Huge thanks to our customers and our community for their trust!

blog.lancedb.com/new-funding-an…

account_circle
Andy Pavlo (@andy_pavlo@discuss.systems)(@andy_pavlo) 's Twitter Profile Photo

Key Findings:
• Dictionary encoding is effective for most data types.
• Simple encoding schemes are preferable.
• Don't use block compression (Snappy, zstd).
• Don't require deserializing all meta-data to read one column.
• GPUs struggle with Parquet/ORC.

Key Findings: • Dictionary encoding is effective for most data types. • Simple encoding schemes are preferable. • Don't use block compression (Snappy, zstd). • Don't require deserializing all meta-data to read one column. • GPUs struggle with Parquet/ORC.
account_circle
changhiskhan(@changhiskhan) 's Twitter Profile Photo

Great post from Weston Pace to continue the discussion from x.com/criccomini/sta…

Fun read if you like nerding out about data formats / performance: blog.lancedb.com/file-readers-i…

account_circle
changhiskhan(@changhiskhan) 's Twitter Profile Photo

People have been trying to create a super app a la WeChat in the US market for years. Seems like OpenAI will become that super app but with way more capabilities?

account_circle