Preparing Raw Data for Big Data and Data Science

Preparing Raw Data for Big Data and Data Science

English | MP4 | AVC 1920×1080 | AAC 48KHz 2ch | 2 Hours | 1.10 GB

Preparing data for use in AI, machine learning, and other applications is often 80% of the work. In the third installment in our landmark “Mastering Data Science with SQL” series, Learn how to use SQL to prepare massive datasets both locally and in the cloud, and see these principles put to work using a PostgreSQL database hosted in Microsoft Azure.

Table of Contents

00:00:00 Introduction
00:01:51 Preparing Data
00:16:13 Finding Data
00:29:31 Downloading Airline Data (Demo)
00:35:31 Processing Raw Data
00:42:56 Processing CSV data with Python and Pandas (Demo)
01:04:37 Importing Data
01:10:51 Importing Data with the PostgreSQL Copy Command (Demo)
01:20:26 Exporting Data
01:27:02 Exporting Data with PostgreSQL (Demo)
01:31:07 PostgreSQL on Microsoft Azure
01:37:44 Importing a Large Dataset into PostgreSQL on Azure (Demo)
02:00:56 Conclusion