A.1.3: Case Study #3: Datasets & Risks
The Building Blocks & Guardrails
Note 1: This outline provides guidance specific to Case Study #3, complementing the general case study rubric [link].
Note 2: See lecture note 3.1 Datasets & Risks of AI [link] for some illustrative examples of the tasks.
The Brief
Understand how the training dataset guides the capability and boundary of a Generative AI model.
Investigate how specific training datasets can differentiate the capability of Generative AI (GenAI) models or LLMs.
You are NOT training customized Generative AI models.
You are exploring the boundaries of market-ready GenAI/LLMs in specific tasks (domain knowledge).
Examples:
• It can be asking a pre-trained LLM to generate “realistic” nighttime images based on given input daytime images (download daytime images from the [link])
• Or you can think about another specific task in your domain of study that you notice current LLMs on the market cannot handle well, but through very specific prompts or better datasets, they might improve their capabilities.
Your analysis should address the following questions.
1. What is the domain-specific task?
(E.g., creating realistic nighttime street view images from daytime ones instead of generating a fancy rendering or fantasy-like visions)
• Problem / Function
– What is the main objective of the task?
– Why is this problem important?
– Why LLM (or GenAI) might be useful?
2. What models do you use for the task?
(E.g., Do you generate the nighttime images with StableDiffusion, Nano Banana, Midjourney, etc.?)
• Model Dependency
– Describe clearly how it operates.
– Explain how the overall task breaks down into input data preparation, prompt (or algorithmic steps), and output steps.
• Your explanation should show
– The overall workflow
– The specific model’s mechanism you are leveraging (E.g., if you are using StableDiffusion, explain how the Diffusion architecture works)
– The full Inputs → transformations → outputs link
3. Is the GenAI model (or LLM) powerful in fulfilling the domain-specific task?
(E.g., mimicking a realistic nighttime urban environment?)
• Capabilities
– What strengths or behaviors does the model enable?
• Limitations
– What problems or dysfunctionality do you spot/diagnose from the pre-trained model?
4. How do you improve it?
• Causal reasoning
– Explain how training dataset quality, and/or prompt engineering might (or might not) improve the model’s ability for the domain-specific task.
• Comparison
– Where helpful, compare the “initial results” to an “improved result” from an alternative model, or an improved prompt, or feeding in specific training data.
Your analysis should reveal one key insight about how the domain-specific training dataset/prompt/harness reshapes model performance.
What a Strong Submission Demonstrates
A strong submission will:
- Isolate the role of the training dataset quality from the mechanism clearly
- Diagram the overall workflow of the model
- Zoom into the selected module (training dataset/prompt/etc)
- Identify inputs → transformations → outputs
- Explain how the training dataset matters to behavioral outcomes
Your writing should demonstrate training dataset engineering thinking or prompt engineering thinking.
Deliverables
Your submission must include four parts.
i. Annotation (≈500 words)
Provide a short written annotation accompanying your case study.
• Title
– A clear title identifying the system or mechanism studied.
• Caption / Description
– A brief written explanation summarising the focus of your case study.
ii. The Artefact
Your artefact consists of two components:
• Supporting Explanation — contextual material that frames the domain specific task you explore
• Input → Model → Output Representation — the core explanation showing how the whole input-output flow operates.
iii. Vignette
The vignette is a short-form video presentation summarising your case study.
• Duration: approximately 60 seconds
Your video should clearly communicate:
• The GenAI or LLM model studied
• The domain-specific task you investigated
• Your key insight about how the training dataset or prompt reshapes AI model behaviour
iv. Evidences (Optional)
You may include supplementary materials in an appendix, such as:
• Code snippets
• Parameter sweeps
• Prompt comparisons
• Additional diagrams or tables
• Experimental results
Grading Criteria
In a nutshell
We are not grading:
- Advanced mathematics
- Sophisticated coding
- Exhaustive technical coverage
We are grading:
| Criterion | What It Means |
|---|---|
| Understanding | Do you correctly explain how the mechanism works? |
| Causal Reasoning / Comparison | Do you link structural design choices to observable system behaviour, including how behaviour might change under alternative designs? |
| Limitations | Do you identify meaningful trade-offs, constraints, or failure modes introduced by the mechanism? |
| Clarity | Can you explain complex ideas in a precise and accessible manner? |
| Visual Explanation | Do your diagrams or visual representations clearly communicate the mechanism and information flow? |
Rubric
| Criterion | Excellent | Adequate | Insufficient |
|---|---|---|---|
| Mechanism Accuracy | Structurally correct and clearly articulated explanation of how the mechanism operates | Mostly correct explanation with minor gaps or simplifications | Superficial, vague, or incorrect description of the mechanism |
| Causal Analysis | Clear linkage between structural design and system behaviour, with thoughtful comparison or reasoning | Some connection between structure and behaviour, but underdeveloped | Describes outputs or observations without structural reasoning |
| Failure Awareness | Identifies meaningful trade-offs, limitations, or structural constraints | Mentions limitations but without depth or explanation | No meaningful discussion of limitations |
| Clarity of Communication | Complex mechanism explained clearly and accessibly to non-specialists | Explanation understandable but dense or uneven | Obscure, overly technical, or difficult to follow |
| Visual Explanation | Diagrams clearly reveal the system workflow, mechanism, and information flow | Visuals present but only partially clarify the mechanism | Visuals absent, confusing, or decorative rather than explanatory |
Submission Deadline
Original: 2026.04.29 00:00
The Standard
The goal is not to describe AI.
The goal is to understand how it works.
Do not remain at the surface of outputs.
Demonstrate that you can think in mechanisms, modules, and pipelines.
Don’t just use AI.
Explain it.