They're Right: You Have to Learn SQL!


Federico Trotta - from Medium

August the 5th

Discussing the need for Data Scientists to know SQL

Hey friend,

I know what's happening out there: you're reading everywhere on the Internet that you have to learn SQL if you want to become a Data Scientist.

Your LinkedIn feed is full of posts from gurus telling you that you must learn it!

And you are thinking something like: "Oh, man! Isn't Python enough?!".

or: "Oh, c'mon! Why do I have to learn it?! I can open tabular data! You just want to sell me your SQL course!!".

Well, I have good and bad news for you.

The bad news is that they're right: you need to know SQL if you want to become a Data Scientist.

The good news is that you mainly need the basics. And you can also learn it for free.

Let me explain.

The beautiful world where you're living now is structured like that:

  • You're getting some (very clean!) data in the CSV format somewhere. Maybe on Kaggle.
  • You open it and start analyzing the file.
  • You may end up making some predictions with ML.

Did I guess it right?!

Well, here's the problem: when you work with companies you have two options:

  1. They have a database.
  2. They don't have a database.

If the company you're working for has a database, then you have no option: you have to retrieve the data from it and you need to know how to query a relational database with SQL!

If they don't have a database, you know what happens? They'll throw you a plethora of files (not just one, like you're doing right now!) in CSV, Excel, and other imaginative formats (even in PDF, sometimes!!). You'll have to retrieve the data from them - spending a lot of time! -, order them somehow, and start your analyses.

It can even happen that you'll have a mixed situation: a part of the data you have to analyze is in a database, and the other part is in a lot of files in different formats.

But, please, don't get me wrong.

I don't want to scare you.

I just want to tell you that, yes: you need to know some SQL.

But just "some" of it because:

  1. Your main work should be analyzing data and making predictions (and if it's not like that, you may start thinking of changing company...).
  2. Querying a relational database will shorten your data manipulation tasks with Pandas. This is very good because, with some little SQL queries, you will gain incredible manipulation results in a matter of seconds, saving a lot of time with "too advanced" Pandas manipulation techniques.

"Ok, Federico: you convinced me! Where do I start?"

You can start for free with SQL BOLT here.

With SQL BOLT you'll learn the theory and make the right practice to learn the SQL you need (and more of it, if you want to deepen it).

But, believe me, you'll mainly need "SELECT FROM WHERE GROUPBY".

Don't know what does it mean? Learn SQL! ;)


RESOURCES FOR YOU

  • Learn Pandas the right way, with a hands-on approach. Download my FREE ebook here.

That's all for this week.

Have a lovely weekend,

Federico


Unsubscribe · Preferences


Copyright © 2023 Federico Trotta, All rights reserved.

Federico Trotta: p.zza A. De Gasperi, 13. 52044 Cortona, Italy.

You are receiving this email because you opted in via our website.

Federico Trotta

Read more from Federico Trotta

Federico Trotta August the 26th Let's discuss what you can do with Python in terms of a future job. Spoiler: you don't need to become a Data Scientist or a Software Engineer. Hey friend, yesterday I saw that Microsoft is starting to integrate Python into Excel. And you know what does it mean? Well, two things for me: If you are a Python professional, the last environment where you want to use Python is Excel. AI prompts will become more important to use Excel, rather than Googling. Especially...

Federico Trotta August the 19th I'm telling you why you don't need another Python course (and what to do, instead, step by step) Hey friend, I'm sure you are what I have been for quite some time: a certificate collector. You took your first Python course. Maybe on Udemy. Then, you took another one. And then another again. You feel you're not learning that much and you're missing a learning path, but you need to show you learned something to your next employer. Also, you really want to learn...

Federico Trotta - from Medium August the 12th I'm telling you why you want to learn Python Hey friend, I am an honest person. Because honesty pays. Always. There are three main reasons why you want to learn Python: 1) You want to get a high-paying job. 2) You want a remote job. 3) You want a job that lasts, no matter the crises. Let's be honest: you want at least one of the three. Right?! We all have experienced the pandemic and the lockdowns. Another pandemic returns? Who cares? I can work...