Inspecting Data in Pandas for Pharmacy Students (head(), tail(), info(), describe())
Before analyzing any dataset, it is essential to inspect it. In pharmaceutical sciences, data inspection helps understand patient datasets, ADR reports, and pharmacokinetic (PK) data.
๐ท Why Inspect Data?
- Understand dataset structure
- Identify missing values
- Get statistical insights
๐ท head() Function
Displays first 5 rows of dataset.
import pandas as pd
df = pd.read_csv("adr_data.csv")
print(df.head())
๐ Useful for quick preview of dataset.
๐ท tail() Function
Displays last 5 rows.
print(df.tail())
๐ท info() Function
Provides dataset summary.
df.info()
- Number of rows
- Column names
- Data types
- Missing values
๐ท describe() Function
Provides statistical summary.
df.describe()
Includes:
- Mean
- Standard deviation
- Minimum and maximum values
๐ Pharma Dataset Example
df = pd.read_csv("adr_data.csv")
print(df.head())
print(df.info())
print(df.describe())
๐ Helps understand ADR patterns and dose statistics.
๐ง Memory Tricks
- head() โ Top rows
- tail() โ Bottom rows
- info() โ Structure
- describe() โ Statistics
๐งช Practice Exercise
Load dataset and:
- Display first 3 rows
- Check dataset info
- Find average dose
๐งช Mini Project
Analyze PK dataset:
import pandas as pd
df = pd.read_csv("pk_data.csv")
print(df.head())
print(df.describe())
๐ MCQs
- head() shows:
a) Last rows
b) First rows
c) Middle rows
d) All rows
Answer: b - info() provides:
a) Values
b) Structure
c) Graph
d) Output
Answer: b - describe() shows:
a) Names
b) Statistics
c) Rows
d) Columns
Answer: b
โ FAQs
Why inspect data before analysis?
To understand structure, errors, and missing values.
Which function shows statistics?
describe() is used for statistical summary.
๐ฅ Download Sample Datasets for Practice
Practice inspecting ADR and PK datasets.
โก Next Topic: Data Cleaning & Handling Missing Values โ
Recommended readings
- Introduction to Pandas (Why it is used in Pharma Data Analysis)
- Pandas Series & DataFrame (with patient & PK datasets)
- Reading CSV & Excel Files (PK datasets, ADR reports)
- Inspecting Data (head(), tail(), info(), describe())
- Data Cleaning & Missing Values (real clinical dataset problems)
- Filtering & Selecting Data (high dose, ADR filtering)
- Grouping & Aggregation (mean dose, ADR frequency)
Question Bank Unit 4: Data Handling with Pandas
For detailed information: Basics of Python Programming for Pharmaceutical Sciences