I write about data engineering | SQL | Python | Distributed systems. Get my free data engineering course at https://t.co/sZTEcV0Q9Wstartdataengineering.com New yorkJoined April 2020
It can be overwhelming to start learning data engineering. I'd recommend starting with the basics of python, sql, UNIX commands, building a simple data project, update Github, Linkedin. Landing a DE job is 60% part learning and 40% marketing. See reply 👇🏽 for helpful links.
Starting as a DE? 90% of what you will need
is SQL (OLAP), python, & distributed system basics
Don't overcomplicate!
#data#dataengineering
#SQL#database
#Python
Backfilling is an inevitable part of data projects. When designing your data pipelines take some time to answer the following questions
1. Does multiple backfill runs cause duplicate data?
2. Can multiple backfills be parallelized?
#data#dataEngineering #datapipeline #datasets
Learning data engineering? Build a pipeline locally.
1. Python to pull data from an API (e.g. Coincap)
2. Load data into a local Postgres container
3. Automate it with cron/task scheduler
Start small, build, improve, & repeat.
#data#dataengineering #Pythonlearning #Python
Left anti-join is cool!
Get all the data from the left table that has no matching data in the right table
select t1.* from t1 left join t2 on t1.id=t2.id where t2.id is null;
#data#dataengineering
#SQL#database
When data to process is larger than memory, try to stream with python generators, before jumping to distributed systems!
#data#dataengineering #Python#Pythonlearning #Generator
E.g. Stream a file(note () and not []), get diff between date cols
uv by @astral_sh is truly one of the best tools you can have in your toolkit as a DE.
TIL: You can quickly start a jupyter notebook with it
doc: docs.astral.sh/uv/guides/inte…
Data engineers write the most complex piece of code to Upsert into tables.
Here's THE command you need to know
MERGE INTO/INSERT ON CONFLICT
#data#dataengineering
#SQL
11K Followers 3K Following#MicrosofFabric user advocate, interests in Small Data & Self Service #Microsoftemployee since Dec 2023 , but my tweets are my own
3K Followers 2K FollowingPrincipal Software Engineer at @justbobsledit. Formerly led Data and Engineering at @thebeatapp , @omioglobal , @thoughtworks .
722 Followers 38 FollowingHelping data teams build reliable modern architectures | Founder of Simple Stack Academy | Former corporate employee turned independent consultant
5K Followers 4K Followingdata director @workingfamilies prev: @sunrisemvmt. waffle house ambassador & David Byrne fan account. budding organizer. she/her, born n raised on a holler
11 Followers 59 FollowingCustomerPulse automatically analyzes your customer data to reveal segments, behaviors, and growth opportunities delivered straight to your inbox
18 Followers 425 FollowingA Student with an ambition to LEARN.
Coding enthusiast and Sports lover.
likes to listen to music,
and ofcourse TRAVELLING.
https://t.co/uYo58H8nKy
106 Followers 2K FollowingDoctor of Physiotherapy| Physiotherapist | Data analyst | Potential Manual therapist | Like anything related to IT skills | Engaged
11K Followers 3K Following#MicrosofFabric user advocate, interests in Small Data & Self Service #Microsoftemployee since Dec 2023 , but my tweets are my own
3K Followers 2K FollowingPrincipal Software Engineer at @justbobsledit. Formerly led Data and Engineering at @thebeatapp , @omioglobal , @thoughtworks .
722 Followers 38 FollowingHelping data teams build reliable modern architectures | Founder of Simple Stack Academy | Former corporate employee turned independent consultant
32K Followers 172 FollowingMaker of a tech note-taking app https://t.co/xcprYEEMJQ. My indie dev journey 👉https://t.co/JbdJ58XbTN 🇯🇵@craftzdog Dad👨👩👧 UTC+9
523K Followers 868 FollowingI run a portfolio of internet companies and host @startupideaspod. CEO: @latecheckoutplz we build companies like @ideabrowser, @meetLCA, @boringmarketer etc
51K Followers 1K FollowingSincere poster. No cynicism. Dad to two sets of twins!
- https://t.co/yL0V3eZKDL
- https://t.co/wIdhAlsrlX
- https://t.co/hM9ogEIevT
- @MostlyTechPod
554K Followers 131 FollowingFather of three, Creator of Ruby on Rails + Omarchy, Co-owner & CTO of 37signals, Shopify director, NYT best-selling author, and Le Mans 24h class-winner.
18K Followers 2K Following🇺🇸 living in 🇬🇧, writes about email marketing (new book: https://t.co/TTMf566S88), founder of @RightMessageApp, and smitten with @lauraelizdunn
685K Followers 117 Following#1 NYT Bestselling Author: The 48 Laws of Power, The Art of Seduction, The 33 Strategies of War, The 50th Law, Mastery, The Laws of Human Nature, The Daily Laws
2K Followers 3K FollowingI like to make simple helpful apps. Learn how to customize your Shopify store without coding knowledge : https://t.co/vZI65P6puC
2K Followers 65 FollowingSoftware Engineer. Opinions and views shared here are my own. Most posts are jokes.
I post about things related to data and tennis.
183K Followers 429 FollowingHelping people build wealth since 2017. Author of Just Keep Buying (https://t.co/q98gHouElD) & The Wealth Ladder (https://t.co/UUf1a8ZLTO)