Challenge
Subject: Users want live weather in chat
Beta users ask "What's the weather in Seattle?" and "Do I need an umbrella today?" Copilot gives generic, outdated answers. We need live weather this week.
Your role: AI engineer on Copilot v0.1 · Ships: This week
Product will not accept a solution that breaks any row below. Use these when you compare approaches.
- 1. Speed — under 3 seconds end-to-end From the user sending “What’s the weather in Seattle?” to Copilot’s reply must take less than 3 seconds. Slow pipelines fail this release.
- 2. Any city — not a fixed list Users can name Seattle, Detroit, Tokyo, or any city. The solution must work for cities you did not hard-code.
- 3. Live conditions — not training data Answers must reflect right now (temperature, rain, etc.). Generic facts from model memory (“Seattle is often rainy”) do not count — you must prove the data is current.
- 4. Small scope — ship weather in chat this week Copilot v0.1 is chat-only today. Your deliverable is live weather inside that chat — not a separate project (agent platform, memory system, doc pipeline). You should compare approaches like prompt, tool/API, and RAG; pick what best meets constraints 1–3 for this release.
Done when: "Weather in Seattle?" → current answer + evidence
Compare
- Speed < 3s Question → answer in under 3 seconds
- Any city Not hard-coded to Seattle only
- Live conditions Prove right-now weather — not model memory
- Small scope Ship weather in chat this week — compare all options, don’t expand product surface
Select all that apply for this approach.
Select all that apply for this approach.
Optional third approach — select if you considered it.
Initial pick:
Experiment
Run your initial pick first, then try at least one other approach before deciding.
Decide
Pick your final architecture, then say what the experiments proved.
1. Your final decision
Or fill a “you tested this” rejection section below instead of this block.
3. Rejections — complete any sections that appear after step 1.
Prove — Transfer
You already picked an architecture. Now test it on a new city — not Seattle.
Click send, read the reply and evidence panel, then check what you see.
Complete Step 1 first. Then send the follow-up and check whether the answer uses current Detroit weather.
Record
Decision log (auto-draft):
M1 decision recorded
You compared approaches, ran an experiment, and proved transfer.