Jan Lukavský @janl_apache
Data Engineer, @ApacheBeam committer and PMC member, open source enthusiast, IoT, author of 'Building Big Data Pipelines with Apache Beam'. #streamingdata Joined December 2021-
Tweets387
-
Followers126
-
Following210
-
Likes779
An idea on the Fermi paradox: When a civilization reaches the phase of exponential (technological) growth, it quickly loses the ability to understand the world, starts voting for populists and decays. Are we living it?
A small number of people are posting text online that’s intended for direct consumption not by humans, but by LLMs (large language models). I find this a fascinating trend, particularly when writers are incentivized to help LLM providers better serve their users! People who post…
Can #streamprocessing supersede the performance of relational databases? Yes, it can! ♥️
Great, the @ApacheCon EU videos are out. ♥️ Here is one where I tried to explain how to use #streamprocessing to get ACID transactions out of virtually any eventually consistent database. I'd love to hear any comments or ideas on this! youtube.com/watch?v=UPSLVc…
Cruise scales autonomous driving with Apache Beam, managing petabytes of data monthly! 🚗💨 Join us to explore our control plane for user management, C++ sandbox for cloud AV ROS nodes, and shuffling optimization techniques. #BeamSummit #ApacheBeam beamsummit.org
Another paper pointing out in details what we've known for a while: LLMs (used via prompting) cannot make sense of situations that substantially differ from the situations found in their training data. Which is to say, LLMs do not possess general intelligence to any meaningful…
Another paper pointing out in details what we've known for a while: LLMs (used via prompting) cannot make sense of situations that substantially differ from the situations found in their training data. Which is to say, LLMs do not possess general intelligence to any meaningful…
My second book is published! “Streaming Databases” Unifying Batch and Stream Processing. Putting the database back together, @martinkl . #streamingprocessing learning.oreilly.com/library/view/-…
All set! My bus to Bratislava is only 90 minutes late, time for some final polishing of the slides for Tuesday, but I'm already excited to be part of @ApacheCon EU! Who am I gonna meet there? 🙋♂️🙂
Kafka is not a database. Kafka is commit-log. Databases use commit-logs for being able to reconstruct themselves to a consistent state => Kafka can be used to reconstruct a database. In the limit case, Kafka can serve as a database. But is not a database. 🫠
Kafka is not a database. Kafka is commit-log. Databases use commit-logs for being able to reconstruct themselves to a consistent state => Kafka can be used to reconstruct a database. In the limit case, Kafka can serve as a database. But is not a database. 🫠
OK, this is funny. 😄 But, current "AI" is merely a function. Complex one, right. But how can 'sin(x)' be "safe" on "unsafe"? It is about how people use it. And we should already have laws for that. 🤷♂️
OK, this is funny. 😄 But, current "AI" is merely a function. Complex one, right. But how can 'sin(x)' be "safe" on "unsafe"? It is about how people use it. And we should already have laws for that. 🤷♂️
We're back! Sorry for the interruption. It turns out they don't let 10 year olds have X accounts. But we hope you'll all celebrate our 10th birthday with us! 🎂 kubertenes.cncf.io
Oh my. Sounds pretty much like duck typing to me. Please let's not turn Java into Python. 🙏 youtube.com/watch?v=_afECX…
I really think people *should* pick the right tools. But what I personally think would be the most benecial? If one could make this decision at runtime, not at implementation time.
I really think people *should* pick the right tools. But what I personally think would be the most benecial? If one could make this decision at runtime, not at implementation time.
While people are designing systems for 4 9s, there are still companies voluntarily shutting down their systems for two consecutive days. Fascinating. @Ceskasporitelna
Here is an idea. What we call 'consciousness' can be rephrased as the 'inability to return back in time'. Every input to a living brain changes it forever. As opposed to 'functions' which return same values for same inputs. Always. Wrong? @ylecun
I just got confirmation that my session "Enhancing Flexibility and Productivity with Access Patterns and Storage-Agnostic Abstractions" has been accepted for @ApacheCon EU 2024. Thanks guys, I'm absolutely excited to be part of this great event!
This is perhaps one of the most important charts on AI for 2024. It was built by the amazing researcher team at @CathieDWood’s @ARKInvest. We can see the rise of open source local models are on the path to overtake massive (and expensive) cloud based closed models.

Riley @66RQd5h4P18uU4
29 Followers 1K Following
@decode @i_mranjan
3 Followers 145 Following journaling the backend of things // pipes, logs, latency. all of it. tgt: 2025
代孕三代试管VX�... @Jessicacarina02
2 Followers 123 Following Coffee lover, travel enthusiast, and always eager to learn something new. Living for simple moments and meaningful stories
Chase Annes @chase_carder01
20 Followers 213 Following single bani ☺️am loyal and honest as well in for anything serious and fun❤️
Anneli Nieminen @AnneliNiem86177
0 Followers 22 Following
Wang Bob @WangBob14
15 Followers 291 Following Father, Database developer, Indie hacker, toB SaaS, Football, Kickboxer
petr sadek @petrsad
3 Followers 109 Following
navarro @adnmercantil
126 Followers 1K Following
Charlie Smith @xtxsn85928297
74 Followers 601 Following Macro trends in blockchain adoption worldwide.
Jane Sims @qllps67289959
13 Followers 507 Following Empower Yourself, Elevate Your World: Courage in Action, Wisdom in Every Step
Juliette Lam @cfuoo31952399
15 Followers 124 Following Fundamental and technical insights on Web3 startups.
Mary Chan @ufcne22364149
9 Followers 76 Following Decoding the intricacies of the blockchain-based economy.
Datavin3 @datavin3
15 Followers 123 Following Welcome to Datavin3, your go-to destination for Apache Hop resources and tutorials.
kaibo @KaiboZhou
27 Followers 294 Following
Michael Benjamin @_m_benjamin
62 Followers 451 Following Heart in Hawaii, Freezing in Boston. Real-time and streaming / live data expert, especially in fintech
Ralph Matthias Debusm... @RalphMDebusmann
73 Followers 191 Following Former NLP researcher and SAP and Bosch software engineer, former CTO of https://t.co/Y2xJ9jVHsu, now Technologist/Lead Enterprise Kafka Engineer at Migros.
shuang @sandy87418
9 Followers 100 Following
Frank Ren @renq654321
39 Followers 298 Following freelance data engineer. previously @BytedanceTalk , @meituan
Ben Holfeld @BenHolfeld
96K Followers 45K Following Immortal/German/SF/CA/USA/QuantumPhysics/Longevity/AI/Robotics/SF AI Studio/Building with OpenAI/MS/xAI/Google/Nvidia e/acc /dd Disclaimer below.
lurk @lurk70824490
9 Followers 98 Following
FlowersForSmith @Technopublic24
112 Followers 3K Following Lead Chris's Odyssey to a liberal future. Like reading. Try to be a writer. Name is from the book "Folowers for Alergon" and Smith is Agent Smith in the Matrix.
Bruno Volpato @bruflow
12K Followers 665 Following Senior Software Engineer at @datadoghq working on cool query stuff. Husband.
Suman Shil @sumanshil
34 Followers 1K Following
Dazey @1nfoverload
254 Followers 1K Following Computing, Writing, Traveling, Language Learning https://t.co/JVhMiw85yE @[email protected] @[email protected]
Zdena Tyson @ZdenekTison
54 Followers 124 Following Software Engineer with expertise in Streaming, AI, and Cloud Computing. Passionate about tech and innovation, with interests as a hobby investor, biker and DJ.
Benjamin Buick @BenjaminBuick
250 Followers 137 Following King of Streams. Pro humanitate, ad astra.
zeus_simon @zeus_mayu
9 Followers 142 Following
Vivek Chandela @vvkcnd
11 Followers 364 Following Software Engineer fascinated by storage and streaming systems.
martin kouřil @xkouril
2 Followers 197 Following
kang @dreamworldcn
7 Followers 113 Following
skddmk @skddmk
17 Followers 1K Following
Love Angel @SetiawanAlmando
441 Followers 2K Following Engaged in the new energy wind power industry, personal team, have their own thinking, optimistic, cheerful, talkative, I hope we can learn from each other.
नितिन त�... @ntripathi
391 Followers 4K Following
Marek Simunek @MarekSimunek1
1 Followers 10 Following
Štěpán Svoboda @StepanSvo
137 Followers 670 Following Data Scientist, Econometrics @UvA_Amsterdam Also urbanism, 🇪🇺
Talat UYARER @talatuyarer
362 Followers 802 Following Software Engineer, Member of @TheASF, Works @GoogleCloud. Tweets are MY OWN personal opinions and do not represent my employer and organizations I belong to.
Luke Pham @REMEMBER_70411
54 Followers 4K Following
MavenCode @mavencode
354 Followers 2K Following AI, ML, Distributed Programing, Scala, Python, Go-lang Consulting Company
Ismail Chaida 👨�... @Ismail_CHAIDA
383 Followers 5K Following Software & Data/Kotlin/Scala Engineer | Views are my own
DeepSeek @deepseek_ai
973K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Demis Hassabis @demishassabis
488K Followers 146 Following Nobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
Asim Hussain @jawache
13K Followers 188 Following #Entheogen Advocate 🍄🌿 Executive Director - @gsfcommunity 🎙️ https://t.co/zEMrzDpqcN 📝 https://t.co/ajzoC83a4X 💚 https://t.co/LFqOSg69G2
kaibo @KaiboZhou
27 Followers 294 Following
Michael Benjamin @_m_benjamin
62 Followers 451 Following Heart in Hawaii, Freezing in Boston. Real-time and streaming / live data expert, especially in fintech
Nitish ⚡️ @nitishmutha
4K Followers 347 Following Co-founder and CTO @GenieAI - Building the world’s best AI Legal Drafter. @UCL alum.
Ralph Matthias Debusm... @RalphMDebusmann
73 Followers 191 Following Former NLP researcher and SAP and Bosch software engineer, former CTO of https://t.co/Y2xJ9jVHsu, now Technologist/Lead Enterprise Kafka Engineer at Migros.
Frank Ren @renq654321
39 Followers 298 Following freelance data engineer. previously @BytedanceTalk , @meituan
Ben Holfeld @BenHolfeld
96K Followers 45K Following Immortal/German/SF/CA/USA/QuantumPhysics/Longevity/AI/Robotics/SF AI Studio/Building with OpenAI/MS/xAI/Google/Nvidia e/acc /dd Disclaimer below.
Bruno Volpato @bruflow
12K Followers 665 Following Senior Software Engineer at @datadoghq working on cool query stuff. Husband.
Dazey @1nfoverload
254 Followers 1K Following Computing, Writing, Traveling, Language Learning https://t.co/JVhMiw85yE @[email protected] @[email protected]
Zdena Tyson @ZdenekTison
54 Followers 124 Following Software Engineer with expertise in Streaming, AI, and Cloud Computing. Passionate about tech and innovation, with interests as a hobby investor, biker and DJ.
Benjamin Buick @BenjaminBuick
250 Followers 137 Following King of Streams. Pro humanitate, ad astra.
Alexis Conneau @alex_conneau
35K Followers 189 Following Co-founder and CEO https://t.co/efv72CKpAG (@WaveFormsAI) - Ex @OpenAI GPT-4o/AVM Audio Research Lead - #Her #TARS - Ex @AIatMeta, @Polytechnique (X11)
martin kouřil @xkouril
2 Followers 197 Following
Gunnar Morling 🌍 @gunnarmorling
64K Followers 290 Following Technologist @Confluentinc · Ex-lead of Debezium · Spec lead of Bean Validation 2.0 · Creator of JfrUnit, kcctl and MapStruct · Java Champion · 🚴
Yi Tay @YiTayML
46K Followers 81 Following research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.
Jason Wei @_jasonwei
98K Followers 635 Following ai researcher @meta superintelligence labs, past: openai, google 🧠
Gradle @gradle
38K Followers 235 Following Release announcements, tips, and events focused on boosting developer productivity with Gradle Build Tool and Develocity.
Chris Albon @chrisalbon
89K Followers 3K Following Director of ML and Data Eng @Wikimedia Foundation. We host Wikipedia.
AK @_akhaliq
425K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Yann LeCun @ylecun
949K Followers 764 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Richard Socher @RichardSocher
112K Followers 1K Following CEO @youdotcom MP @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMind
elvis @omarsar0
263K Followers 670 Following Building with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with AI Agents ↓
(((ل()(ل() 'yoav)))... @yoavgo
65K Followers 2K Following
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Radim Řehůřek @RadimRehurek
6K Followers 526 Following Founder and CTO @PII_tools. Applied #ML, #NLP, #IR. 💘 history and beginnings. PhD in AI. Creator @gensim_py. Life & travel in East Asia.
Jiří Materna @JiriMaterna
1K Followers 414 Following Machine Learning Freelance, organizer of @MLPrague and @mlcollegecom, former head of research at https://t.co/BJV0z1s7im. Interested in ML, NLP, IR and DL. GA pilot.
Csaba Kincses 👨�... @kincsescsaba
5K Followers 1K Following Autodidact Scala programmer, former dev @MorganStanley Follow me for 𝝺 everything #Scala 💻 #softwaredevelopment & tech commentary 💡 future of programmers
Keith Gaputis @keithgaputis
124 Followers 692 Following Data Engineering Field CTO at @SnowflakeDB. Generally interested in new tech, distributed systems, and software/data architecture.
Robin Moffatt 🍻�... @rmoff
10K Followers 651 Following Doing fun stuff with data and open source. 🌐 https://t.co/WparjfnauD / 🦋 @rmoff.net #dataBS
Martin Kirschner 🔎 @martin_kirschne
606 Followers 181 Following Mám rád velká data, strojové učení a počítačovou lingvistiku. V https://t.co/7EjyhYvvaW mám na starosti hledání na webu, hledání v mapách a firmách a velké jazykové modely.
hubert dulay @hkdulay
803 Followers 2K Following “Streaming Data Mesh” & “Streaming Databases” OReilly Author @startreedata ex-@confluentinc ex-@cloudera #datamesh #clusterheadaches 🇵🇭🎸🥁🎹🤘
Decodable @Decodableco
3K Followers 2K Following Decodable is a serverless real-time data platform built on #ApacheFlink. No clusters to set up. No code to write. No PhD required.
Immerok @immerokcom
1K Followers 1 Following 🤝We’re now part of the Confluent family 🐿Follow @ConfluentInc for updates on how we’re bringing a cloud-native fully managed @apacheflink service to Confluent
hachyderm.io/austince @austin_space_ce
249 Followers 899 Following Engineer @confluentinc. Prev. @immerokcom. Takes large bites. Please invite me to your fediverse. [email protected] @austince.bsky.social
Capensis @capensis_sas
5K Followers 2K Following 🐧 Expert en systèmes #Linux et solutions d'infrastructures #OpenSource 🚨 Éditeur de @canopsis : solution d'#hypervision / #observabilité Open Source
Benjamin Wootton @BenjaminWootton
1K Followers 2K Following Freelance Consultant - Real Time Analytics With ClickHouse