Just wrote a new blog post: how I spent a year making the world's fastest Parquet loader in JavaScript, and all the optimizations that went into it.
TLDR: Hyparquet can load a parquet file from S3 in 155ms. While duckdb-wasm took 3466ms the same file!
Shots fired at the @ApacheIceberg Meetup:
"What I see is a community that's gotten way too complacent with complexity. I think there's a way you can do this a LOT more simply."
...so I ported Iceberg to JavaScript.
Announcing the Hyperparam OSS Universe!
We wrote a suite of open-source libraries for working with large AI Datasets (eg- parquet files) entirely in JavaScript.
This is the way to build faster, more interactive, and more scalable data applications in the browser.
Announcing the Hyperparam OSS Universe!
We wrote a suite of open-source libraries for working with large AI Datasets (eg- parquet files) entirely in JavaScript.
This is the way to build faster, more interactive, and more scalable data applications in the browser.
duckdb is great! but a couple points for hyparquet:
- duckdb wasm is ~40mb (often larger than the data being loaded)
- bundling wasm is a pain
- hyparquet is tiny (10kb) and easy to deploy
- if you want to minimize time-to-displayed-data in the browser, hyparquet usually wins.
duckdb is great! but a couple points for hyparquet:
- duckdb wasm is ~40mb (often larger than the data being loaded)
- bundling wasm is a pain
- hyparquet is tiny (10kb) and easy to deploy
- if you want to minimize time-to-displayed-data in the browser, hyparquet usually wins.
OpenAI: A Lesson in How NOT to Write a Parquet File
This week @OpenAI released new datasets on @huggingface -- great news for open data! But their specific parquet files makes me sad...
They released a 300mb parquet file that contains 400 rows in a single parquet rowgroup. When…
OpenAI: A Lesson in How NOT to Write a Parquet File
This week @OpenAI released new datasets on @huggingface -- great news for open data! But their specific parquet files makes me sad...
They released a 300mb parquet file that contains 400 rows in a single parquet rowgroup. When…
Just found this very cool project OpenTimes where you pick a starting point, and it gives you travel time to all points around you.
Uses hyparquet to efficiently fetch geospatial data and visualize it. I absolutely love this kind of browser engineering!
10K Followers 2K FollowingEx Founder with 2 exits | Previously Led Product Design, Visual, and Brand @ Agora, High Circle, Copper, PayPal, DraftKings + more.
175 Followers 4K FollowingJunior@Nankai University | Major in CS | Research in CV, GenAI | Full Stack Developer | Beginner in Crypto | Runner, Cyclist, Gym-goer | Rap enthusiast
5K Followers 812 FollowingDev Advocate at @Fused_io
Interested in maps, satellite images & the people building all of it
Youtube: https://t.co/k2JTlXKSp6
Podcast : @MindsBehindMaps
38K Followers 5K FollowingViews of a Transhuman neo-Buddhist from the future on sociology, artificial intelligence, mathematics, philosophy, neonoir film, and the post-singularity era.
384 Followers 381 FollowingPhD student working on bioinformatics, #WebGL-powered cancer genome visualization, and tumor evolution at @HautaniemiLab @helsinkiuni | @[email protected]
5 Followers 460 FollowingAngel Investors Community has a 1-month free trial for new members. Welcome to join and verify our trading strategy
https://t.co/O9PCuzZQg8
272 Followers 2K Followingtravel, yoga exercise🧘♀️, golf🏌️♀️, shopping, reading📖, looking at food, I like watching anime , etc....👍
And I have an open attitude towards everyone
64K Followers 1K FollowingCo-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique
139 Followers 2K FollowingStudying requires concentration and concentration. Even if you have a high IQ and a good teacher, if you study half-heartedly, you will get nothing.
343 Followers 5K FollowingServo de Cristo, Doutor em Sistemas Computacionais (UFRJ) e Tecnologista em Computação (MB), atuando em projetos de P&DT na área de Defesa.
10K Followers 2K FollowingEx Founder with 2 exits | Previously Led Product Design, Visual, and Brand @ Agora, High Circle, Copper, PayPal, DraftKings + more.
365K Followers 8 FollowingVercel provides the developer tools and cloud infrastructure to build, scale, and secure a faster, more personalized web. Creators of @nextjs, @v0, and @aisdk.
396K Followers 50 FollowingTypeScript is a language for application-scale JavaScript development. It's a typed superset of JavaScript that compiles to plain JavaScript.
43K Followers 3K FollowingWe're in a race. It's not USA vs China but humans and AGIs vs ape power centralization.
@deepseek_ai stan #1, 2023–Deep Time
«C’est la guerre.» ®1
259K Followers 188 FollowingEurostat is the statistical office of the European Union. We provide high quality statistics and data on Europe.
Lawful good.
#AskEurostat
20K Followers 2 FollowingDuckDB is an analytical in-process SQL database management system. "DuckDB" and the DuckDB logo are registered trademarks of the DuckDB Foundation.
64K Followers 1K FollowingCo-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique
83K Followers 631 FollowingLow-cost, high performance inference platform, powered by the Groq LPU. Delivering instant access to leading AI models with GroqCloud™.
92K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I
242K Followers 21 FollowingWe’ll help you make it like nobody’s business. Multimodal media generation and editing tools to get your idea to production. Self-deploy? 👍 Need a partner? 🤝
50K Followers 5K FollowingCofounder and Head of Post Training @NousResearch, prev @StabilityAI
Github: https://t.co/LZwHTUFwPq
HuggingFace: https://t.co/sN2FFU8PVE
163K Followers 325 FollowingCEO of @abacusai, the world’s first AI super assistant and general-purpose agent, DeepAgent, for enterprises and professionals. ex-GM, AWS and Google
94K Followers 8K Followingdespite all my ragie I'm still just a wagie in a cagie
working on DL Software: https://t.co/FVn3NRNrLe
https://t.co/CgaoMfhUHd
637K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.