CommonLounge Archive

Hands-on Assignment: Data Exploration with NumPy

June 21, 2018

In this hands-on assignment, we’ll apply the NumPy python library to explore a dataset. The dataset we’ll be using is a medical dataset with information about some patients on metrics like glucose, insulin levels, and other metrics related to diabetes. The assignment will serve two primary objectives - (a) practice NumPy on a realistic task, and (b) learn how to get a feel for a large dataset (also known as data cleaning and data exploration).

Dataset description

We’ll be using the following dataset: diabetes.csv. Open the file in your favorite text editor and have a look.

The following are the column names: Pregnancies, Glucose, BloodPressure, SkinThickness, Insulin, BMI, DiabetesPedigreeFunction, Age, Outcome.

Quiz time!

The quiz will guide you through the rest of the assignment. Make sure you finish each step before moving on to the next step. Let’s get started!

© 2016-2022. All rights reserved.