Exam Writing Format & Strategy

Exam Rules Recap

Detail	2025 Format	2024 Format
Duration	60 min (5 reading + 55 writing)	60 min (5 reading + 55 writing)
Total marks	20	30
Questions	6	7
Allowed	Double-sided handwritten A4	Double-sided page of notes
Devices	NO calculators, NO phones	Same

Time management:

2025: ~20 marks in 55 min = ~2.75 min per mark
2024: ~30 marks in 55 min = ~1.83 min per mark
Rule of thumb: 1 mark ≈ 2-3 minutes. If a question is worth 2 marks, spend ~5 minutes max.

The 5-Minute Reading Period Strategy

During the 5-minute reading time (no writing allowed):

Scan ALL questions — count marks, identify topics
Identify the calculation question (CNN dimensions) — mentally plan the steps
Identify the diagnosis question (bias/variance from curves) — start forming your answer
Plan your time allocation — more time on high-mark questions
Identify what you need from your cheat sheet — locate formulas you'll need

Answer Formatting Rules

Rule 1: Lead with the Answer

💡 中文思维习惯："先铺垫再给结论"。英文学术写作相反：先给结论，再解释原因。这是中国学生最需要改变的习惯。

❌ "There are many factors to consider. First, we need to think about..."
✅ "No, training for 2000 epochs will not help because the model is already overfitting."

Rule 2: Be Concise — Quality Over Quantity

💡 不要写"废话"凑字数。老师明确说了 "quality over quantity"。用3句精确的话比写半页模糊的要好。

❌ (half page of vague general knowledge)
✅ "This is an overfitting scenario (train 95%, val 60%). L2 regularisation will help 
    because it penalises large weights, promoting a simpler model that generalises better."

Rule 3: Link to the Specific Scenario

💡 不要只写"正则化能防止过拟合"这种教科书式的回答。必须引用题目给的具体数字（如 train=95%, val=60%）来支撑你的判断。

❌ "Regularisation helps prevent overfitting." (too generic)
✅ "Since the training accuracy (95%) is much higher than validation (60%), 
    indicating overfitting, L2 regularisation is likely to help by constraining 
    model complexity." (linked to given numbers)

Rule 4: Show Calculation Steps

💡 计算题一定要写出公式和代入过程，不能只写最终答案。即使算错了，步骤正确也能拿部分分。

❌ "The output is 16x16x10"
✅ "Conv output: ((50 + 2×0 - 5) / 3) + 1 = (45/3) + 1 = 15 + 1 = 16
    Output: [16, 16, 10] (10 from number of filters)"

Rule 5: For "What Do You Think?" — Go Beyond Numbers

💡 "你怎么看"这类题不能只算数字，要解释数字背后的含义。模型到底在做什么？为什么会这样？

❌ "Accuracy is 60% and recall is 100%."
✅ "The accuracy is 60% and recall is 100%. This means the model predicts almost 
    everything as positive — it catches all actual positives (perfect recall) but 
    at the cost of many false positives (precision only 56%). The model appears 
    to be performing well at detecting positives, but is actually just classifying 
    nearly everything as positive."

Answer Templates by Question Type

Type: "Evaluate This Suggestion" (2 marks each)

[YES/NO — 1 mark]
[Reasoning connected to scenario — 1 mark]

Template:

[Yes/No], [suggestion] is [likely/unlikely] to improve the validation accuracy. 
The model is currently [overfitting/underfitting] (training accuracy [X]%, 
validation accuracy [Y]%). [Suggestion] [mechanism: e.g., "penalises large weights" / 
"adds more training data"] which [helps/does not help] with [overfitting/underfitting] 
because [specific reason].

Type: "Calculate + Interpret" (Confusion Matrix)

Step 1: State formulas
Step 2: Plug in numbers
Step 3: Give result
Step 4: Interpret (what does the model actually DO?)
Step 5: Explain WHY (class imbalance? threshold too low?)

Type: "Explain Concept" (2-4 marks)

Paragraph 1: WHAT it is (definition)
Paragraph 2: HOW it works (mechanism)
Paragraph 3: WHY it matters (benefit/purpose)

Type: CNN Calculation (show workings)

For each layer:
  Input:   [H, W, C]
  Formula: ((H + 2p - f) / s) + 1 = ...
  Output:  [H', W', C']

Final: Flatten = H' × W' × C' = [number]

Time Allocation Guide

For a 20-mark exam (2025 format):

Question	Marks	Time	Topic
Q1	2	5 min	Data cleaning
Q2	3	8 min	Bias-variance + curves
Q3	3	8 min	Activation functions
Q4	4	10 min	Learning rate curves
Q5	4	12 min	Transformers
Q6	4	12 min	CNN calculation

For a 30-mark exam (2024 format):

Question	Marks	Time	Topic
Q1	4	7 min	Data preprocessing
Q2	6	11 min	Design choices
Q3	4	7 min	Confusion matrix
Q4	4	7 min	LR + optimisers
Q5	4	8 min	RNN vs Transformer
Q6	4	8 min	CNN calculation
Q7	4	7 min	DNN training

Last-Minute Reminders

Read the scenario carefully — the numbers matter (train acc, val acc, missing values count)
Diagnose BEFORE prescribing — always state overfitting/underfitting first
Show your work on calculations — partial marks are possible
Cross out wrong work — the exam says "cross out work you don't want assessed"
Write clearly — illegible answers get 0 marks
Use overflow pages if needed, but note which question on the original page

中国学生考试写作常见问题（Common Issues for Chinese Students）

问题 1：先写"背景"再给"结论"

❌ 中文习惯："首先，正则化是一种技术……它的作用是……所以我认为……"
✅ 英文习惯："Yes, L2 regularisation will help. This is because..."

→ 英文考试要求：第一句话就给结论，然后解释原因。

问题 2：过度使用 "can"

❌ "Regularisation can help to improve the model."
✅ "Regularisation is likely to improve validation accuracy by constraining model complexity."

→ "can" 太弱。用 "is likely to" 或 "will" 更果断、更学术。

问题 3：缺少因果连接

❌ "The model is overfitting. We should use dropout."（两句话之间没有逻辑连接）
✅ "The model is overfitting, as evidenced by the gap between training and validation accuracy. Therefore, applying dropout is likely to help by reducing co-adaptation."

→ 用 "as evidenced by"、"therefore"、"because" 把句子连起来。

问题 4：用中文直译

❌ "The model learned too good on the training data"
✅ "The model fits the training data too closely"

❌ "The performance is not good enough"
✅ "The model fails to generalise to unseen data"

❌ "We can use a more big model"  
✅ "Increasing the model size would help"

问题 5：不敢下判断

❌ "Maybe this suggestion could possibly help..."
✅ "Yes, this suggestion is likely to improve validation accuracy."

→ 考试要明确 YES/NO。模糊的回答拿不到分。