A.1.3: Case Study #3: Datasets & Risks

The Building Blocks & Guardrails

Note 1: This outline provides guidance specific to Case Study #3, complementing the general case study rubric [link].
Note 2: See lecture note 3.1 Datasets & Risks of AI [link] for some illustrative examples of the tasks.


The Brief

Understand how the training dataset guides the capability and boundary of a Generative AI model.

Investigate how specific training datasets can differentiate the capability of Generative AI (GenAI) models or LLMs.

You are NOT training customized Generative AI models.
You are exploring the boundaries of market-ready GenAI/LLMs in specific tasks (domain knowledge).

Examples:

• It can be asking a pre-trained LLM to generate “realistic” nighttime images based on given input daytime images (download daytime images from the [link])

• Or you can think about another specific task in your domain of study that you notice current LLMs on the market cannot handle well, but through very specific prompts or better datasets, they might improve their capabilities.

Your analysis should address the following questions.


1. What is the domain-specific task?

(E.g., creating realistic nighttime street view images from daytime ones instead of generating a fancy rendering or fantasy-like visions)

Problem / Function

    – What is the main objective of the task?
    – Why is this problem important?
    – Why LLM (or GenAI) might be useful?


2. What models do you use for the task?

(E.g., Do you generate the nighttime images with StableDiffusion, Nano Banana, Midjourney, etc.?)

Model Dependency

    – Describe clearly how it operates.
    – Explain how the overall task breaks down into input data preparation, prompt (or algorithmic steps), and output steps.

Your explanation should show

    – The overall workflow
    – The specific model’s mechanism you are leveraging (E.g., if you are using StableDiffusion, explain how the Diffusion architecture works)
    – The full Inputs → transformations → outputs link


3. Is the GenAI model (or LLM) powerful in fulfilling the domain-specific task?

(E.g., mimicking a realistic nighttime urban environment?)

Capabilities

    – What strengths or behaviors does the model enable?

Limitations

    – What problems or dysfunctionality do you spot/diagnose from the pre-trained model?


4. How do you improve it?

Causal reasoning

    – Explain how training dataset quality, and/or prompt engineering might (or might not) improve the model’s ability for the domain-specific task.

Comparison

    – Where helpful, compare the “initial results” to an “improved result” from an alternative model, or an improved prompt, or feeding in specific training data.

Your analysis should reveal one key insight about how the domain-specific training dataset/prompt/harness reshapes model performance.


What a Strong Submission Demonstrates

A strong submission will:

Your writing should demonstrate training dataset engineering thinking or prompt engineering thinking.


Deliverables

Your submission must include four parts.


i. Annotation (≈500 words)

Provide a short written annotation accompanying your case study.

Title
    – A clear title identifying the system or mechanism studied.

Caption / Description
    – A brief written explanation summarising the focus of your case study.


ii. The Artefact

Your artefact consists of two components:

Supporting Explanation — contextual material that frames the domain specific task you explore
Input → Model → Output Representation — the core explanation showing how the whole input-output flow operates.


iii. Vignette

The vignette is a short-form video presentation summarising your case study.

• Duration: approximately 60 seconds

Your video should clearly communicate:

• The GenAI or LLM model studied
• The domain-specific task you investigated
• Your key insight about how the training dataset or prompt reshapes AI model behaviour


iv. Evidences (Optional)

You may include supplementary materials in an appendix, such as:

• Code snippets
• Parameter sweeps
• Prompt comparisons
• Additional diagrams or tables
• Experimental results


Grading Criteria

In a nutshell

We are not grading:

We are grading:

Criterion What It Means
Understanding Do you correctly explain how the mechanism works?
Causal Reasoning / Comparison Do you link structural design choices to observable system behaviour, including how behaviour might change under alternative designs?
Limitations Do you identify meaningful trade-offs, constraints, or failure modes introduced by the mechanism?
Clarity Can you explain complex ideas in a precise and accessible manner?
Visual Explanation Do your diagrams or visual representations clearly communicate the mechanism and information flow?

Rubric

Criterion Excellent Adequate Insufficient
Mechanism Accuracy Structurally correct and clearly articulated explanation of how the mechanism operates Mostly correct explanation with minor gaps or simplifications Superficial, vague, or incorrect description of the mechanism
Causal Analysis Clear linkage between structural design and system behaviour, with thoughtful comparison or reasoning Some connection between structure and behaviour, but underdeveloped Describes outputs or observations without structural reasoning
Failure Awareness Identifies meaningful trade-offs, limitations, or structural constraints Mentions limitations but without depth or explanation No meaningful discussion of limitations
Clarity of Communication Complex mechanism explained clearly and accessibly to non-specialists Explanation understandable but dense or uneven Obscure, overly technical, or difficult to follow
Visual Explanation Diagrams clearly reveal the system workflow, mechanism, and information flow Visuals present but only partially clarify the mechanism Visuals absent, confusing, or decorative rather than explanatory

Submission Deadline

Original: 2026.04.29 00:00


The Standard

The goal is not to describe AI.
The goal is to understand how it works.
Do not remain at the surface of outputs.
Demonstrate that you can think in mechanisms, modules, and pipelines.
Don’t just use AI.
Explain it.