April 27, 2026

Inspecting Data in Pandas (head(), tail(), info(), describe())

Inspecting Data in Pandas for Pharmacy Students (head(), tail(), info(), describe())

Before analyzing any dataset, it is essential to inspect it. In pharmaceutical sciences, data inspection helps understand patient datasets, ADR reports, and pharmacokinetic (PK) data.


๐Ÿ”ท Why Inspect Data?

  • Understand dataset structure
  • Identify missing values
  • Get statistical insights
๐Ÿ’ก Key Insight: Data inspection is the first step before analysis.

๐Ÿ”ท head() Function

Displays first 5 rows of dataset.

import pandas as pd

df = pd.read_csv("adr_data.csv")
print(df.head())

๐Ÿ‘‰ Useful for quick preview of dataset.


๐Ÿ”ท tail() Function

Displays last 5 rows.

print(df.tail())

๐Ÿ”ท info() Function

Provides dataset summary.

df.info()
  • Number of rows
  • Column names
  • Data types
  • Missing values

๐Ÿ”ท describe() Function

Provides statistical summary.

df.describe()

Includes:

  • Mean
  • Standard deviation
  • Minimum and maximum values

๐Ÿ’Š Pharma Dataset Example

df = pd.read_csv("adr_data.csv")

print(df.head())
print(df.info())
print(df.describe())

๐Ÿ‘‰ Helps understand ADR patterns and dose statistics.


๐Ÿง  Memory Tricks

  • head() โ†’ Top rows
  • tail() โ†’ Bottom rows
  • info() โ†’ Structure
  • describe() โ†’ Statistics

๐Ÿงช Practice Exercise

Load dataset and:

  • Display first 3 rows
  • Check dataset info
  • Find average dose

๐Ÿงช Mini Project

Analyze PK dataset:

import pandas as pd

df = pd.read_csv("pk_data.csv")

print(df.head())
print(df.describe())

๐Ÿ“ MCQs

  1. head() shows:
    a) Last rows
    b) First rows
    c) Middle rows
    d) All rows
    Answer: b

  2. info() provides:
    a) Values
    b) Structure
    c) Graph
    d) Output
    Answer: b

  3. describe() shows:
    a) Names
    b) Statistics
    c) Rows
    d) Columns
    Answer: b

โ“ FAQs

Why inspect data before analysis?

To understand structure, errors, and missing values.

Which function shows statistics?

describe() is used for statistical summary.


๐Ÿ“ฅ Download Sample Datasets for Practice

Practice inspecting ADR and PK datasets.


โžก Next Topic: Data Cleaning & Handling Missing Values โ†’

Recommended readings

  1. Introduction to Pandas (Why it is used in Pharma Data Analysis)
  2. Pandas Series & DataFrame (with patient & PK datasets)
  3. Reading CSV & Excel Files (PK datasets, ADR reports)
  4. Inspecting Data (head(), tail(), info(), describe())
  5. Data Cleaning & Missing Values (real clinical dataset problems)
  6. Filtering & Selecting Data (high dose, ADR filtering)
  7. Grouping & Aggregation (mean dose, ADR frequency)

Question Bank Unit 4: Data Handling with Pandas

For detailed information: Basics of Python Programming for Pharmaceutical Sciences