Data Analysis Using Python

This course provides an introduction to Python, including how to clean, aggregate, describe, and visualize data.  

Course Objectives

After completing this course, participants will be able to:

  • Analyze numerical, categorical, and time-series data in Python

  •  Perform basic analysis in Python using pandas

  •  Edit and create Excel worksheets in Python

  •  Create functions to automate analysis

Module 1:  Pandas Basics

  • Introduction to the Jupyter notebook

  • Python Fundamentals

  • Introduction to pandas

  • Data sources and I/O

  • Interacting with Excel

Module 2:  Aggregations, Visualizations and Analysis

  • Data Aggregation and Partitions

  • Pandas for exploratory data analysis

  • Data visualization in Python

  • Introduction to regression analysis in Python

Module 3:  Case Study – Big Data

  • Analysis of a large dataset

  • Reading data into Python from txt and csv files, data manipulation and basic descriptive statistics

  • Plots and graphing, for-loops and if-statements 

  • Identify the purpose of certain lines of code

  • Practice what has been learned in Modules 1-2 using ‘big data’ set

  • Regression analysis using “big data” set