April 28, 2026

Filtering & Selecting Data in Pandas (High-Risk Patients & ADR Filtering)

Filtering & Selecting Data in Pandas for Pharmacy Students (High-Risk Patients & ADR Analysis)

Filtering and selecting data are key operations in data analysis. In pharmaceutical sciences, these techniques help identify high-risk patients, severe ADR cases, and important clinical patterns.


๐Ÿ”ท What is Filtering?

Filtering means selecting rows based on specific conditions.

๐Ÿ’ก Key Insight: Filtering converts raw data into meaningful clinical insights.

๐Ÿ”ท Basic Filtering

import pandas as pd

df = pd.read_csv("clinical_data.csv")

# High dose patients
high_dose = df[df["Dose"] > 600]
print(high_dose)

๐Ÿ”ท Multiple Conditions

# High dose and elderly patients
risk_patients = df[(df["Dose"] > 600) & (df["Age"] > 60)]
print(risk_patients)

๐Ÿ’Š ADR Filtering Example

# Severe ADR cases
severe_cases = df[df["Reaction"] == "Severe"]
print(severe_cases)

Used to identify serious adverse drug reactions.


๐Ÿ”ท Selecting Specific Columns

# Select only Drug and Dose
print(df[["Drug", "Dose"]])

๐Ÿ”ท Using loc[] for Selection

# Select rows and columns
print(df.loc[df["Dose"] > 600, ["Patient", "Dose"]])

๐Ÿ’Š Clinical Risk Analysis

# Identify high-risk patients
risk = df[(df["Dose"] > 600) & (df["Reaction"] == "Severe")]

print("High-risk patients:")
print(risk)

๐Ÿง  Memory Tricks

  • [] โ†’ Filter rows
  • & โ†’ AND condition
  • | โ†’ OR condition
  • loc[] โ†’ Select data

๐Ÿงช Practice Exercise

Load dataset and:

  • Find patients with dose > 500
  • Filter severe ADR cases
  • Select drug and reaction columns

๐Ÿงช Mini Project

Create a clinical filter system:

import pandas as pd

df = pd.read_csv("clinical_data.csv")

high_risk = df[(df["Age"] > 60) & (df["Dose"] > 600)]

print("High Risk Patients:")
print(high_risk)

๐Ÿ“ MCQs

  1. Filtering is used to:
    a) Delete data
    b) Select data
    c) Print data
    d) Store data
    Answer: b

  2. AND condition uses:
    a) |
    b) &
    c) +
    d) =
    Answer: b

  3. loc[] is used for:
    a) File reading
    b) Data selection
    c) Looping
    d) Printing
    Answer: b

โ“ FAQs

Why is filtering important in pharmacy?

It helps identify high-risk patients and critical clinical conditions.

How to combine multiple conditions?

Use & (AND) or | (OR).


๐Ÿ“ฅ Download Clinical Dataset for Filtering Practice

Practice identifying high-risk patient groups.


โžก Next Topic: Grouping & Aggregation โ†’

Recommended readings

  1. Introduction to Pandas (Why it is used in Pharma Data Analysis)
  2. Pandas Series & DataFrame (with patient & PK datasets)
  3. Reading CSV & Excel Files (PK datasets, ADR reports)
  4. Inspecting Data (head(), tail(), info(), describe())
  5. Data Cleaning & Missing Values (real clinical dataset problems)
  6. Filtering & Selecting Data (high dose, ADR filtering)
  7. Grouping & Aggregation (mean dose, ADR frequency)

Question Bank Unit 4: Data Handling with Pandas

For detailed information: Basics of Python Programming for Pharmaceutical Sciences