A.1.3: Case Study #3: Datasets & Risks

The Building Blocks & Guardrails

Note 1: This outline provides guidance specific to Case Study #3, complementing the general case study rubric [link].
Note 2: See lecture note 3.1 Datasets & Risks of AI [link] for some illustrative examples of the tasks.

The Brief

Understand how the training dataset guides the capability and boundary of a Generative AI model.

Investigate how specific training datasets can differentiate the capability of Generative AI (GenAI) models or LLMs.

You are NOT training customized Generative AI models.
You are exploring the boundaries of market-ready GenAI/LLMs in specific tasks (domain knowledge).

Examples:

• It can be asking a pre-trained LLM to generate “realistic” nighttime images based on given input daytime images (download daytime images from the [link])

• Or you can think about another specific task in your domain of study that you notice current LLMs on the market cannot handle well, but through very specific prompts or better datasets, they might improve their capabilities.

Your analysis should address the following questions.

1. What is the domain-specific task?

(E.g., creating realistic nighttime street view images from daytime ones instead of generating a fancy rendering or fantasy-like visions)

• Problem / Function

    – What is the main objective of the task?
    – Why is this problem important?
    – Why LLM (or GenAI) might be useful?

2. What models do you use for the task?

(E.g., Do you generate the nighttime images with StableDiffusion, Nano Banana, Midjourney, etc.?)

• Model Dependency

– Describe clearly how it operates.
– Explain how the overall task breaks down into input data preparation, prompt (or algorithmic steps), and output steps.

• Your explanation should show

    – The overall workflow
    – The specific model’s mechanism you are leveraging (E.g., if you are using StableDiffusion, explain how the Diffusion architecture works)
    – The full Inputs → transformations → outputs link

3. Is the GenAI model (or LLM) powerful in fulfilling the domain-specific task?

(E.g., mimicking a realistic nighttime urban environment?)

• Capabilities

– What strengths or behaviors does the model enable?

• Limitations

– What problems or dysfunctionality do you spot/diagnose from the pre-trained model?

4. How do you improve it?

• Causal reasoning

– Explain how training dataset quality, and/or prompt engineering might (or might not) improve the model’s ability for the domain-specific task.

• Comparison

– Where helpful, compare the “initial results” to an “improved result” from an alternative model, or an improved prompt, or feeding in specific training data.

Your analysis should reveal one key insight about how the domain-specific training dataset/prompt/harness reshapes model performance.

What a Strong Submission Demonstrates

A strong submission will:

Isolate the role of the training dataset quality from the mechanism clearly
Diagram the overall workflow of the model
Zoom into the selected module (training dataset/prompt/etc)
Identify inputs → transformations → outputs
Explain how the training dataset matters to behavioral outcomes

Your writing should demonstrate training dataset engineering thinking or prompt engineering thinking.

Deliverables

Your submission must include four parts.

i. Annotation (≈500 words)

Provide a short written annotation accompanying your case study.

• Title
– A clear title identifying the system or mechanism studied.

• Caption / Description
– A brief written explanation summarising the focus of your case study.

ii. The Artefact

Your artefact consists of two components:

• Supporting Explanation — contextual material that frames the domain specific task you explore
• Input → Model → Output Representation — the core explanation showing how the whole input-output flow operates.

iii. Vignette

The vignette is a short-form video presentation summarising your case study.

• Duration: approximately 60 seconds

Your video should clearly communicate:

• The GenAI or LLM model studied
• The domain-specific task you investigated
• Your key insight about how the training dataset or prompt reshapes AI model behaviour

iv. Evidences (Optional)

You may include supplementary materials in an appendix, such as:

• Code snippets
• Parameter sweeps
• Prompt comparisons
• Additional diagrams or tables
• Experimental results

Grading Criteria

In a nutshell

We are not grading:

Advanced mathematics
Sophisticated coding
Exhaustive technical coverage

We are grading:

Criterion	What It Means
Understanding	Do you correctly explain how the mechanism works?
Causal Reasoning / Comparison	Do you link structural design choices to observable system behaviour, including how behaviour might change under alternative designs?
Limitations	Do you identify meaningful trade-offs, constraints, or failure modes introduced by the mechanism?
Clarity	Can you explain complex ideas in a precise and accessible manner?
Visual Explanation	Do your diagrams or visual representations clearly communicate the mechanism and information flow?

Rubric

Criterion	Excellent	Adequate	Insufficient
Mechanism Accuracy	Structurally correct and clearly articulated explanation of how the mechanism operates	Mostly correct explanation with minor gaps or simplifications	Superficial, vague, or incorrect description of the mechanism
Causal Analysis	Clear linkage between structural design and system behaviour, with thoughtful comparison or reasoning	Some connection between structure and behaviour, but underdeveloped	Describes outputs or observations without structural reasoning
Failure Awareness	Identifies meaningful trade-offs, limitations, or structural constraints	Mentions limitations but without depth or explanation	No meaningful discussion of limitations
Clarity of Communication	Complex mechanism explained clearly and accessibly to non-specialists	Explanation understandable but dense or uneven	Obscure, overly technical, or difficult to follow
Visual Explanation	Diagrams clearly reveal the system workflow, mechanism, and information flow	Visuals present but only partially clarify the mechanism	Visuals absent, confusing, or decorative rather than explanatory

Submission Deadline

Original: 2026.04.29 00:00

The Standard

The goal is not to describe AI.
The goal is to understand how it works.
Do not remain at the surface of outputs.
Demonstrate that you can think in mechanisms, modules, and pipelines.
Don’t just use AI.
Explain it.