April 27, 2026

Question Bank Unit 4: Data Handling with Pandas

📘 Unit 4 Question Bank – Data Handling with Pandas

Includes MCQs, Short Answer Questions, Long Answer Questions, Case-based problems, and Mini Projects with pharmaceutical applications.


🔷 Section A: MCQs

  1. Pandas is used for:
    a) Gaming
    b) Data analysis
    c) Networking
    d) Designing
    Answer: b

  2. Series is:
    a) 2D
    b) 1D
    c) 3D
    d) None
    Answer: b

  3. DataFrame represents:
    a) List
    b) Table
    c) String
    d) Loop
    Answer: b

  4. Which function reads CSV?
    a) read()
    b) read_csv()
    c) open()
    d) load()
    Answer: b

  5. head() shows:
    a) Last rows
    b) First rows
    c) All rows
    d) None
    Answer: b

  6. describe() gives:
    a) Names
    b) Statistics
    c) Rows
    d) Columns
    Answer: b

  7. Which function removes missing values?
    a) fillna()
    b) dropna()
    c) replace()
    d) remove()
    Answer: b

  8. Which operator is used for AND condition?
    a) |
    b) &
    c) +
    d) =
    Answer: b

  9. groupby() is used for:
    a) Filtering
    b) Grouping data
    c) Sorting
    d) Printing
    Answer: b

  10. mean() calculates:
    a) Sum
    b) Average
    c) Max
    d) Min
    Answer: b

🔷 Section B: Short Answer Questions

  • Define Pandas.
  • What is a Series?
  • What is a DataFrame?
  • Write syntax for reading CSV file.
  • Explain head() and tail().
  • What is info() function?
  • Define data cleaning.
  • What are missing values?
  • Explain filtering in Pandas.
  • What is groupby()?

🔷 Section C: Long Answer Questions

  1. Explain Pandas and its importance in pharmaceutical data analysis.
  2. Explain Series and DataFrame with suitable examples.
  3. Explain how to read CSV and Excel files using Pandas.
  4. Explain data inspection functions: head(), tail(), info(), describe().
  5. Explain data cleaning and handling missing values.
  6. Explain filtering and selecting data with examples.
  7. Explain grouping and aggregation techniques in Pandas.

🔷 Section D: Case-Based Questions

💊 Case 1: ADR Dataset Analysis

Dataset contains Drug, Dose, Reaction columns.

  • Load dataset using Pandas
  • Filter severe reactions
  • Count ADR frequency
💊 Case 2: Missing Data Handling
  • Identify missing values
  • Fill missing dose with mean
  • Remove incomplete rows
💊 Case 3: PK Data Analysis
  • Load PK dataset
  • Display first rows
  • Calculate average concentration
💊 Case 4: High-Risk Patient Identification
  • Filter patients with dose > 600
  • Filter elderly patients
  • Display high-risk group

🔷 Section E: Mini Projects

  • Project 1: ADR Dataset Analyzer (filter + count reactions)
  • Project 2: Patient Data Cleaner (handle missing values)
  • Project 3: PK Data Analyzer (mean, max concentration)
  • Project 4: Clinical Risk Identifier (filter high-risk patients)

🧠 Quick Revision

ConceptKey Point
PandasData analysis tool
Series1D data
DataFrameTable data
head()Top rows
CleaningFix missing data
FilteringSelect data
groupby()Analyze groups

📥 Download Unit 4 Practice Datasets & Solutions

Includes ADR datasets, PK tables, and real-world pharma problems.

Recommended readings

  1. Introduction to Pandas (Why it is used in Pharma Data Analysis)
  2. Pandas Series & DataFrame (with patient & PK datasets)
  3. Reading CSV & Excel Files (PK datasets, ADR reports)
  4. Inspecting Data (head(), tail(), info(), describe())
  5. Data Cleaning & Missing Values (real clinical dataset problems)
  6. Filtering & Selecting Data (high dose, ADR filtering)
  7. Grouping & Aggregation (mean dose, ADR frequency)

For detailed information: Basics of Python Programming for Pharmaceutical Sciences