Jiri Simsa @jsimsa
Working on data processing and analysis infrastructure for ML @ Google. California, USA Joined September 2015-
Tweets145
-
Followers164
-
Following0
-
Likes143
Today at @MLSysConf, @MichaelKuchnik will present Plumber, our tool for diagnosing and removing performance bottlenecks in ML input data pipelines. Joint work with @jsimsa, @GeorgeAmvrosia2 and Virginia Smith. Paper: proceedings.mlsys.org/paper/2022/fil…
If you are interested in advancing infrastructure that provides large scale data analysis and processing for ML workloads across Google, my team is hiring: linkedin.com/jobs/view/2905…
Our VLDB’21 talk about tf.data, a ML data processing framework, is now online: youtu.be/VsOvy3eGK8Y More details in our paper: vldb.org/pvldb/vol14/p2… It has been great to collaborate on this work with @mrry @jsimsa & Ihor Indyk!
Five years ago, we open sourced @TensorFlow, our machine learning framework that's now the most popular machine learning library in the world. 🌎 To celebrate, we’re sharing few interactive demos and tutorials you can try, no experience required → goo.gle/3nz22Xh
In 2016, when I was working on machine translation, it took me more than a week on a multi-GPU machine to train a competitive system on WMT English-German. Today, JAX on a TPU v3 supercomputer can train a better model on the same data in 16 seconds! cloud.google.com/blog/products/…
👉 tf.data supports *any* machine learning framework (JAX, @TensorFlow, PyTorch, more!), and is a great way to speed up your data input pipelines. Be sure to try out our new features for tf.data, available in TF 2.3: github.com/tensorflow/ten…
👉 tf.data supports *any* machine learning framework (JAX, @TensorFlow, PyTorch, more!), and is a great way to speed up your data input pipelines. Be sure to try out our new features for tf.data, available in TF 2.3: github.com/tensorflow/ten…
🔍Inside TensorFlow: tf.data + tf.distribute In this presentation, Jiri Simsa showcases best practices. You’ll learn about the input pipeline, parallel extraction, distributed training, and more. Watch here → goo.gle/2wYGEG7
If your dataset is small, use an in-memory cache: ds = ds.cache() If large, create an on-disk cache: ds = ds.cache("my_file") Afterwards, you can call ds.batch() and ds.shuffle() as always. Complete example: tensorflow.org/tutorials/load…
Speaker spotlight - @jsimsa, tech lead of the tf.data project & software engineer at Google, to present on tf.data the recommended API for creating #TensorFlow input pipelines @ #DataOrchestrationSummit. RSVP: lnkd.in/d-M6cRz #opensource
Presented tf.data and tf.distribute at #GoogleMLSummit in Tokyo! Stay tuned for a recording.
Google Developers ML Summit , @JeffDean の基調講演!#GoogleMLSummit
Thank you for the kind words!
Not only are TPUs fast for doing machine learning, but they are also more energy efficient than alternative platforms, so you can feel great as you train that language model on scientific articles about climate change.
Not only are TPUs fast for doing machine learning, but they are also more energy efficient than alternative platforms, so you can feel great as you train that language model on scientific articles about climate change.
Today in #CloudTPU announcements: (1) @TensorFlow 1.8 now available with a slew of perf improvements (2.7k to 3.2k images/sec on ResNet-50, aka 12.5 hours is now 9 hours to fully train), and (2) we have opened up a new zone (us-central1-b) for HA & load balancing.
Our latest DAWNBench results are live: 8h52m for @TensorFlow to train ResNet-50 on ImageNet on a single @GCPcloud TPU (<$60), and just 30 minutes on half a TPU pod! dawn.cs.stanford.edu/benchmark/
We just posted new DAWNBench results for ImageNet classification training time and cost using Google Cloud TPUs+AmoebaNet (architecture learned via evolutionary search). You can train a model to 93% top-5 accuracy in <7.5 hours for <$50. Results: dawn.cs.stanford.edu/benchmark/
Cloud TPUs (now in !!open!! beta) are a leap forward in price & performance for Machine Learning. (See dawn.cs.stanford.edu/benchmark/ for end-to-end benchmarks.) Spin one up at console.cloud.google.com/compute/tpus today!
If you want to find out more about tf.data performance after my talk at #TFDevSummit, check out this awesome guide by @jsimsa and @bsaeta!
If you want to find out more about tf.data performance after my talk at #TFDevSummit, check out this awesome guide by @jsimsa and @bsaeta!
I'll be speaking about tf.data at 10am PDT. Hope you can tune in to the livestream! tensorflow.org/dev-summit/
I'll be speaking about tf.data at 10am PDT. Hope you can tune in to the livestream! tensorflow.org/dev-summit/

Jeff Dean @JeffDean
365K Followers 6K Following Chief Scientist, Google DeepMind & Google Research. Gemini Lead. Opinions stated here are my own, not those of Google. TensorFlow, MapReduce, Bigtable, ...
👩💻 Paige Bai... @DynamicWebPaige
69K Followers 2K Following ✨ AI should be about empowering humans, building understanding, and making dreams realities. 👩💻 DevX Eng. Lead @GoogleDeepMind ex-@GitHub || views = my own!
rohan anil @_arohan_
25K Followers 2K Following
Mihai Maruseac @mihaimaruseac
2K Followers 2K Following Supply chain security @ Google OSS Security Team. Previously TensorFlow Security & OSS (@ Google); Haskell+differential privacy+ML @ LeapYear. Views my own
Moloch's righthand gu... @AdraHaeman
245 Followers 2K Following Enjoying god's beatiful creation, one woman at a time
Hormoz Zarnani @hzarnani
38 Followers 189 Following
Yu Liu @yuliu15727336
50 Followers 304 Following
Maximilian Böther @MaxiBoether
334 Followers 1K Following Ph.D. student @ETH_EN @SystemsGroupETH @anaklimovic, working on ML pipelines on growing datasets, previously student @HPI_DE and student researcher @google
Michael Kuchnik @MichaelKuchnik
30 Followers 228 Following
Pier Carlo Cadoppi @vaipier
321 Followers 953 Following Software Engineer @amazon, founded @UnivEMS_ association while in university in Parma, Italy. The world is beautiful: it’s worth fighting for!
Tomek @svg_pl
108 Followers 1K Following
AN @trailinga
52 Followers 5K Following shankaro shankarah sakshat | vyaso narayano hari | ubhayor madhya vivadhe | kim karothi kinkaramyaham
Engr Uka's student. @auther2000
155 Followers 453 Following sweat equity investor • Industry 4.0 enthusiast
Taylor Robie @rdxhmx
1 Followers 24 Following
Dr. Smarty Pants @DrSmartyPants44
151 Followers 5K Following Recovering astrophysicist, now a deep-learner
DrSmartyPants46 @DrSmartyPants46
102 Followers 5K Following Adventurer. Explorer. Seeker of ancient mysteries and hidden treasures. Join me on thrilling expeditions as we uncover the secrets of the world.
DrSmartyPants49 @DrSmartyPants49
62 Followers 5K Following
Mr. Money Bags @drsmartypants42
160 Followers 5K Following Harnessing the twitter finance community to make money ;)
Subhadip Mitra @bassrehab
44 Followers 218 Following Use fewer bytes. Optimize your code. #SaveEarth. Currently @Google
Guozhen She @sgzhazelnut
582 Followers 4K Following https://t.co/JYMH0vTvoL Plumber@Snowflake Cyclist@PNW
Hai Son @sonhai
10 Followers 60 Following
Alexlexlex @uctptep
202 Followers 4K Following
txz @txz32829812
115 Followers 6K Following
eiko yoneki @eikoy
188 Followers 388 Following
Alexey Tumanov @alsched
548 Followers 281 Following Assistant Professor of Computer Science @gatech_scs @gtcomputing | postdoc @Berkeley_EECS @ucbrise | ML Systems
Swapnil Pimpale @swapnilpimpale
380 Followers 646 Following Distributed Systems @Apple, @SCSatCMU grad, Opinions are my own
Marc Romeyn @MarcRomeyn
839 Followers 4K Following Senior ML engineer @nvidia. Building open-source LLM tooling (NeMo & NeMo-Run). Deep Learning, ML-infra & Recsys. ex @spotify, @efLDN alum. 🇳🇱
Planet of the Cyborgs @WwDAdA
1K Followers 2K Following PMcLeod: Lateralist Practical Biz Edge w Tech HybridData/AI/ML AR/XR Ontologies Affect IoT Bots DecisionMgt (📯Singer Tenor Lieder Wagner 🐲RPGs 👨👧👧Family)
Curious Mix @curious_mix
180 Followers 1K Following Pro hardcoder, deadlocked optimizing reality w/ weights, data structs & chaotic race conditions.
Yunhe (Jack) Feng @yunhefengit
190 Followers 572 Following Assistant Professor @UNT | Postdoctoral Fellow @UW | PhD in Computer Science @UTKnoxville
Christoph Moser @CM_Greil
12 Followers 623 Following
Anuj Dutt @anujdutt92
226 Followers 1K Following GenAI @Adobe | Previously Edge AI @Jabra_US | Program Advisor @UCIrvine | Ex ML Engineer @VideaHealth | Ex AI Researcher @Bose | Mentor @TFUGChandigarh
Lettnem @Lettnem
329 Followers 3K Following
Srikanta Prasad @Srikanta_prasad
209 Followers 4K Following Machine Learning Engineer |Data Scientist | Ghost writer
some|body @CoderHermit
110 Followers 4K Following Favorite movie: 'The Man from Earth'. A book I wish to be written:'How to git gud at Starcraft and life in general' by Oriol V.
Tushar Jain @unilarity1
239 Followers 2K Following ML Researcher. previously @Amazon, @Verisk, @NYU. Founder @IronLabsAI
Albert Villanova @avillanovamoral
2K Followers 5K Following ML Engineer @huggingface. Data Scientist, PhD Theoretical Particle Physics, BSc Computer Science. Always learning. he/him
Nash Allen @allena5h
4 Followers 47 Following
CMD:\~ @_CarlosMD
378 Followers 2K Following Mobile developer, passionate for new technologies environments AR,IoT,AI and educational vídeo games. =)
Tejas Mahajan @tjdevWorks
143 Followers 598 Following Data at @MerQube | Prev: MS @nyuniversity Courant | @nyudatascience |
@cruzzarate @cruzzarate
459 Followers 5K Following