subscriber_portfolio

Title: Codecademy Portfolio Project, Data Ingestion Pipeline.

To see the full codebase for this project: Link to my github account

Description:

A project intended to automate the data ingestion of subscriber cancellations for an online learning company. Ultimately providing a tidy, analytic ready csv file and sqlite database.

Features:
Technologies:
  1. Python and various standard library modules.
  2. The Pandas and Numpy third-party packages.
  3. Sqlite databases.
  4. Knowledge of data cleaning and tidying.
  5. Command Line and Bash Scripting.
Folder Structure:

Main Level: Includes the python, testing and bash scripts as well as the folders for the following:

Running the Bash Script:

The entire process can be run by running the following script from the command line:

bash_script_runall.sh

Collaborators:

Codecademy Portfolio Project: #13 lesson from the Data Engineering Career Path.

License:

N/A.