• Search...

categories: Technology & Innovation

Productivity
SDG 8: Decent Work and Economic Growth
SDG 9: Industry, Innovation and Infrastructure

Data Acquisition & Modeling

Duration: 2 h 50 m / 31 lessons

Level: General

Course Language: Arabic

By the end of this course, you will be able to

  • By the end of this level, you will be able to: Define different types of data attributes and different types of data sets, and understand the differences between different data quality problems including noise, missing values, and duplicates.

  • Describe various data preparation techniques including sampling, feature selection, estimation, and transformation of variables.

  • Get a concrete idea about the evolution of big data and the structure of the Hadoop ecosystem, and implement projects using Python that cover different aspects of the level.

Course details

  • 2 h 50 m/31 lessons
  • Last updated: 27/10/2022
  • Course completion certificate

Course Content

Free lessons

1.

Data Types and Quality Issues

4 Minutes
2.

Data 1

3 Minutes
3.

Data 2

3 Minutes
4.

Attributes

5 Minutes
1.

Data Types and Quality Issues

4 Minutes
2.

Data 1

3 Minutes
3.

Data 2

3 Minutes
4.

Attributes

5 Minutes
5.

Attribute Types (P.1)

7 Minutes
6.

Attribute Types (P.2)

4 Minutes
7.

Attribute Types (P.3)

5 Minutes
8.

Types of Datasets

7 Minutes
9.

Graph-Based Data

10 Minutes
10.

Data Quality

10 Minutes
11.

Missing Values

8 Minutes
12.

Duplicate Data

7 Minutes
13.

Summary

2 Minutes
14.

Data Preparation

7 Minutes
15.

Sampling

8 Minutes
16.

Feature Selection

5 Minutes
17.

Feature Selection Methods

6 Minutes
18.

Discretization ​​1

7 Minutes
19.

Discretization ​​2

2 Minutes
20.

Variable Transformation

6 Minutes
21.

Example of Variable Transformation

2 Minutes
22.

The Required Project

4 Minutes
23.

Big Data Deep Dive

4 Minutes
24.

Distributed Systems

6 Minutes
25.

Big Data Evolution (P.1)

5 Minutes
26.

Big Data Evolution (P.2)

4 Minutes
27.

Big Data Challenges

3 Minutes
28.

Hadoop Ecosystem (P.1)

4 Minutes
29.

Hadoop Ecosystem (P.2)

2 Minutes
30.

Data Acquisition and Pipelining

1 Minutes
31.

Questions

3 Minutes

About this course

This level comprehensively describes the various data attributes and types of data sets that a data scientist would usually encounter. The level also describes various data quality issues and how to deal with them. Various data preparation techniques are also covered. Finally, an introduction to big data and the Hadoop ecosystem is given.

Course requirements and prerequisites

- Graduate of any university (Engineering is not mandatory)

- Previous programming experience of any language is a big plus

- Knowledge of Linear algebra is a big plus

Mentor

Looking for help?