Post on 07-Jun-2020
NAYA College
www.naya-college.co.il | 0732865417: פקס | 0732865544 :, הרצליה | טלפון71הנדיב
Python for Data Analysis
40 hours
Course Overview:
Python is a general purpose language, and is very user friendly for new users. In terms of structure and syntax, it’s a well-designed, intuitive, and exceedingly powerful general-purpose programming language. As such it has many features that definitely won’t be relevant for everyone. In last year's we have seen that Python is an increasingly popular tool for data analysis. And it is not necessary to become proficient in Python programming to be able to perform data analysis. Still, you need to invest time to learn the fundamentals of the language before you jump into applications.
This course is designed to build strong basis in Python and its related packages for operating with data. The course includes the main tools for data analysts and other users with little or no serious programming experience who just want to get things done in data analysis.
The course includes the basic and intermediate Python programming and the most important libraries for data analysis:
Numpy is the fundamental package for scientific computing with Python. A good understanding of Numpy will help you use tools like Pandas effectively.
Matplotlib is a widely-used package for scientific graphics. This part will include an introduction to the matplotlib objects, to their basic functionalities and a survey of the basic plot types.
Pandas is a package for data manipulation, and gives a set of easy-to-use capabilities resembling SQL (on the data processing side) and Excel (on the visualization side).
In addition, the course will introduce how to work with Python in Big Data environment, including introduction to Hadoop and Spark, and the main library for this purpose – Pyspark.
Course Objectives:
To get an intermediate skill level of Python programming to perform data analysis.
To use the numpy library to create and manipulate arrays.
To create data visualizations using matplotlib with python.
To use the pandas with Python to create and structure data.
To work with various data formats within python, including: JSON, HTML, and MS Excel Worksheets.
Who Should Attend:
This course is intended for data analysts, BI experts, developers and everyone who wants to perform a data manipulation with Python.
Required Skills:
Basic math skills
Basic to Intermediate Python Skills are the benefit
Course Contents:
Basic Python
Fundamentals o Intro o Python essentials o The working environment
Data types o Numbers
NAYA College
www.naya-college.co.il | 0732865417: פקס | 0732865544 :, הרצליה | טלפון71הנדיב
o Strings o Booleans o None
Collections o Lists o Tuples o Dictionaries o Sets
Control flow o if…else o for…in o list comprehension o while o continue & break
Textual interface o input o format
Intermediate Python
Functions o User-defined functions o *args and **kwargs o Built-in functions o Lambda expressions
Debugging and Error Handling
Text files
Object-Oriented Programming (OOP) Basics
The standard library o import o datetime
Python Tools for working with Data
The pandas library o The NumPy library
Array Broadcasting
o The matplotlib library matplotlib objects Plotting Seaborn
o The pandas library Series and Index DataFrame GroupBy Visualizations
o Use cases
General tools o Regular expressions (re) o JSON o API's and Connecting with Data Resources
NAYA College
www.naya-college.co.il | 0732865417: פקס | 0732865544 :, הרצליה | טלפון71הנדיב
Introduction to Python for big data – short overview
Introduction to big data o Background o Hadoop o Spark o PySpark
Working environment o databricks
PySpark and RDDs
PySpark SQL and Datasets