Filtering & Selecting Data in Pandas for Pharmacy Students (High-Risk Patients & ADR Analysis)
Filtering and selecting data are key operations in data analysis. In pharmaceutical sciences, these techniques help identify high-risk patients, severe ADR cases, and important clinical patterns.
๐ท What is Filtering?
Filtering means selecting rows based on specific conditions.
๐ก Key Insight: Filtering converts raw data into meaningful clinical insights.
๐ท Basic Filtering
import pandas as pd
df = pd.read_csv("clinical_data.csv")
# High dose patients
high_dose = df[df["Dose"] > 600]
print(high_dose)
๐ท Multiple Conditions
# High dose and elderly patients risk_patients = df[(df["Dose"] > 600) & (df["Age"] > 60)] print(risk_patients)
๐ ADR Filtering Example
# Severe ADR cases severe_cases = df[df["Reaction"] == "Severe"] print(severe_cases)
Used to identify serious adverse drug reactions.
๐ท Selecting Specific Columns
# Select only Drug and Dose print(df[["Drug", "Dose"]])
๐ท Using loc[] for Selection
# Select rows and columns print(df.loc[df["Dose"] > 600, ["Patient", "Dose"]])
๐ Clinical Risk Analysis
# Identify high-risk patients
risk = df[(df["Dose"] > 600) & (df["Reaction"] == "Severe")]
print("High-risk patients:")
print(risk)
๐ง Memory Tricks
- [] โ Filter rows
- & โ AND condition
- | โ OR condition
- loc[] โ Select data
๐งช Practice Exercise
Load dataset and:
- Find patients with dose > 500
- Filter severe ADR cases
- Select drug and reaction columns
๐งช Mini Project
Create a clinical filter system:
import pandas as pd
df = pd.read_csv("clinical_data.csv")
high_risk = df[(df["Age"] > 60) & (df["Dose"] > 600)]
print("High Risk Patients:")
print(high_risk)
๐ MCQs
- Filtering is used to:
a) Delete data
b) Select data
c) Print data
d) Store data
Answer: b - AND condition uses:
a) |
b) &
c) +
d) =
Answer: b - loc[] is used for:
a) File reading
b) Data selection
c) Looping
d) Printing
Answer: b
โ FAQs
Why is filtering important in pharmacy?
It helps identify high-risk patients and critical clinical conditions.
How to combine multiple conditions?
Use & (AND) or | (OR).
๐ฅ Download Clinical Dataset for Filtering Practice
Practice identifying high-risk patient groups.
โก Next Topic: Grouping & Aggregation โ
Recommended readings
- Introduction to Pandas (Why it is used in Pharma Data Analysis)
- Pandas Series & DataFrame (with patient & PK datasets)
- Reading CSV & Excel Files (PK datasets, ADR reports)
- Inspecting Data (head(), tail(), info(), describe())
- Data Cleaning & Missing Values (real clinical dataset problems)
- Filtering & Selecting Data (high dose, ADR filtering)
- Grouping & Aggregation (mean dose, ADR frequency)
Question Bank Unit 4: Data Handling with Pandas
For detailed information: Basics of Python Programming for Pharmaceutical Sciences