AI Python for Beginners [Complete this before mid term - optional]
Book: Python For Everybody (selected topics and chapters e.g. file handlingrelated chapter)
Working with Hugging Face [Complete this certificate before mid term]
Introduction to LLMs in Python [We will also try to Complete this certificate before mid term inshaAllah]
Implementing the CS50 Duck with OpenAI's APIs - Rongxin Liu & David J. Malan
Hugging Face: Agents Course
Hugging Face: MCP Course
VisionTracker Lab: Structured Vibe Coding (SVC) Framework
Skill Track: OpenAI Fundamentals (must)
Course 1 (mid): Working with the OpenAI API
Course 2 (chapters 1 & 2 in mid and ch 3, 4 in final): ChatGPT Prompt Engineering for Developers
Course 3 (will be covered in 03 lectures maximum inshaAllah): Developing AI Systems with the OpenAI API
Course 4 (will be covered in 02 lectures maximum inshaAllah):: Introduction to Embeddings with the OpenAI API
Tutorial : Fine-Tuning GPT-3 Using the OpenAI API and Python
Project (Its your assignment! do it yourself): Topic Analysis of Clothing Reviews with Embeddings
Skill Track: Developing AI Applications (selected courses)
Project (Its your assignment! do it yourself): Planning a Trip to Paris with the OpenAI API
Course (Will be covered in 01 lecture inshaAllah): AI Ethics
Course (Will be covered in 01 lecture inshaAllah): Working with Hugging Face
Course (Will be covered in 01 lecture inshaAllah): Introduction to Data Privacy
Course (Will be covered in 02 lecture inshaAllah): Developing LLM Applications with LangChain
Career Track: Associate AI Engineer for Developers (selected courses)
Project (Its your assignment! do it yourself): Organizing Medical Transcriptions with the OpenAI API
Course (If time permits, it will be covered in 02 lecture inshaAllah): LLMOps Concepts
Course (We will not cover this course in this semester. It is suggested to read this with the course introduction to data science or tools & techniques in data science: Software Engineering Principles in Python
Week # 1: Introduction to Prompt Engineering, LLMs, GenAI, Deepseek, prompt optimization, types of prompts (zero shot prompting, few or multiple shot prompting, Chain of Thoughts (COT) prompting, ...)
Discuss interface of Google AI Studio (if multi-media available).
Week # 2: Finding a best fit line to data points using genetic algorithm. Linear regression: Simple and Multiple. How to search parameters in linear regression. Equation of a straight line in slope intercept form. Line drawing- slope intercept form, two pint form. Scatter plot. Regression line. Slope and angle of line with x-axis, m=tan( theta), theta = tan^-1(m).
Types of Prompts with examples- zero shot prompting, few or multiple shot prompting, Chain of Thoughts (COT) prompting. Sentiment Analysis or Sentiment Classification. Introduction (input, output). Applications of sentiment analysis. Zero shot and few shot based sentiment classification.
Week # 3: Python Basics
Python: Integer variable, List, List of List, Tuples, Accessing elements of a list/tuple/numpy array, converting lists to numpy array, calling builin functions ( print, len, type), basics of numpy Pandas: 1D, 2D arrays, converting array to pandas dataframe, adding new column to a dataframe, shape of numpy array using .shape attribute.
Week # 4:
Python: Strings, formatted strings in Python, List, For Loop, Writing prompt as a formatted string, Using formatted string in a loop (Example: Greetings related prompt) . Introduction to Python constructs related to API based LLMs prompting.
Week # 5:
Introduction to Google AI Studio and getting familiarity with various terminologies related with LLMs and GenAI. How to generate Gemini API key, How to choose a model, Tools (Function calling, Structured output, Code Execution, Grounding With Google Search, ). LLMs Hallucinate? LLMs are non-deterministic. LLMs parameters (Temperature). System Instructions. Advanced settings: Output length, Top P, Safety Settings- Harmful, Add stop sequence. Get code (of recent interaction with Google AI Studio).
What is a Token in LLM context?
What is Agentic AI?
Week # 6-8:
Functions in Python: Built in, user defined. Function call, function header, function arguments. Define a power function. Power( ), return statement. Default argument. Call Power(2,4), Power(3,2), Power(3) etc. Interactive program (Take input from the user). What is a user prompt. Data type of input values using input().
What is a spreadsheet software? Names of two spread sheet softwares - MS Excel, Google sheet. What is a work book, work sheet(s)? What is a cell, explain what is C8 cell? How to enter a formula into a spread sheet- user defined and using built-in functions. Write a formula to copy a cell value from another cell. Write a formula to count how many students present on a particular day? Apply a formula on a range of cell values. Explain the syntax of COUNTIF() function.
Classes: data members, member functions, constructor, object, dot operator, encapsulation, inheritance.
Announcement: Don't forget to complete the 5 day course you are already registered into it as discussed in the class multiple times.
5-Day Gen AI Intensive Course with Google (Monday, March 31 - Friday, April 4 )
How the 5-Day Gen AI Intensive Course with Google Course Works:
Everyday, you’ll receive an email with assignments including the latest versions of all whitepapers, codelabs and podcast episodes.
While completing the assignments, you’ll be able to ask questions and discuss with other participants on our Discord channel.
Everyday, Paige Bailey will host a YouTube livestream at 11 AM PT / 7 PM CET / 11:30 PM IST starting on Monday, March 31st.
Throughout the event week, please make sure to complete all course content, including the codelabs, whitepapers, and podcast episodes.
On the final day of the event, we will launch the capstone project, which will allow you to apply the knowledge you've gained throughout the course.
By participating in the capstone project, you will earn a badge and a certificate on Kaggle.
Setup Instructions
Sign up for a Kaggle account and learn how Notebooks work. Make sure to phone verify your account, it’s necessary for the course’s codelabs.
Sign up for an AI Studio account and ensure you can generate an API key.
We also have a troubleshooting guide for the codelabs. Be sure to check there for solutions to common problems.
Sign up for a Discord account and join us on the Kaggle Discord server. We have the following channels dedicated to this event:
#5dgai-announcements: find official course announcements and livestream recordings.
#5dgai-introductions: introduce yourself and meet other participants from around the world.
#5dgai-question-forum: Discord forum-style channel for asking questions and discussions about the assignments.
#5dgai-general-chat: a general channel to discuss course materials and network with other participants.
Please note that if you would like to post on other channels on the Kaggle discord you will need to link your Kaggle account to discord here: https://kaggle.com/discord/confirmation.
Once you’ve got everything set up, please introduce yourself in the #5dgai-introductions channel on Discord. We are looking forward to meeting you!
Datacamp skill track: OpenAI Fundamentals
Interactive Course: Working with the OpenAI API
Interactive Course: ChatGPT Prompt Engineering for Developers
Interactive Course: Developing AI Systems with the OpenAI API
Interactive Course: ...
Chat with Gemini
Comprehensive guide to participate in Google DeepMind hackathon called 'Vibe Code with Gemini 3 Pro'.
Subject: URGENT & EXCITING OPPORTUNITY: Google DeepMind "Vibe Coding" Sprint
It’s a massive, time-sensitive opportunity from Google DeepMind. It is a hackathon called 'Vibe Code with Gemini 3 Pro'.
Here is the 30-second summary you need to know:
The Goal: Build a working AI app using Google AI Studio that solves a real-world problem. You will use Gemini 3 Pro (the latest model) to turn natural language prompts into complex apps. This is called 'Vibe Coding'.
The Deadline: Friday, December 12, 2025. You have exactly 4 days from today (Dec 8). It is a sprint!
The Prize: Top 50 teams get $10,000 in API credits each. That is a total prize pool of $500,000.
How to Win: You need three things:
Public App Link: A working app built in Google AI Studio.
Video Demo (2 mins): Show it working and explain the impact.
Writeup (250 words): Short and punchy explanation of the 'Why'.
Why You Should Care: This is the future of software development. 'Vibe Coding' is replacing syntax writing. Participating here creates a portfolio piece that proves you are ahead of the curve."
"Many of you are thinking, 'Teacher, we only have 4 days. How can we build something winning?' The answer is Vibe Coding. We aren't writing boilerplate code; we are architecting logic. Here is how you should think to win."
Evaluation Criteria (The "Secret Sauce" for Winning): Google judges on four pillars. We will reverse-engineer our ideas from these:
· Impact (40% - The Big One):
Don't build: A To-Do list or a simple calculator.
Do build: Something that solves a "Daily Frustration" or a "Global Challenge."
Ask yourself:
Does this help a blind person navigate?
Does it help a researcher understand a paper faster?
Does it automate a boring business task?
· Technical Depth (30% - The "Wow" Factor):
You must use Gemini 3's Superpowers: Reasoning and Multimodality.
Idea Spark:
Can your app take a video input and reason about it?
Can it read a complex diagram and write code for it?
If it's just text-in/text-out, it's not enough.
· Creativity (20%):
Use the model in a way that wasn't possible yesterday. Mix different inputs (audio + image).
· Presentation (10%):
Your 2-minute video is your sales pitch. It must tell a story.
Brainstorming Session (Interactive):
"Turn to your neighbor. I want you to pick one of these tracks: Education, Health, or Accessibility.
Find a problem that requires 'seeing' (images/video) and 'thinking' (reasoning).
Example: Take a photo of a messy pantry (Multimodality) -> App reasons about ingredients and dietary needs -> Generates a 3-day meal plan to minimize waste (Reasoning + Impact)."
The Philosophy: "Vibe Coding"
Vibe coding is Intent-Based Programming. You are no longer writing the syntax (if/else); you are writing the requirements and the vibe (behavior/style), and the AI writes the syntax.
The Trap: Students think this means "no work."
The Reality: It requires High Domain Knowledge. You cannot ask the AI to build a 'financial auditing tool' if you don't know how auditing works, because you won't be able to verify if the AI is lying to you.
Essential SE Practices for Vibe Coding:
1. Requirements Engineering (Prompt Engineering):
In Vibe Coding, your Prompt is your Specification.
Bad Prompt: "Make a quiz app."
SE Prompt: "Act as a Pedagogical Expert. Create a React-based web app that accepts a PDF lecture. Parse the PDF for key concepts (Requirement A). Generate 5 multiple-choice questions with 'distractor' answers that are plausible but incorrect (Requirement B). Display results with a 'Why you were wrong' explanation (Requirement C)."
2. Iterative Refinement (Agile):
You will not get it right on the first prompt. You must iterate.
Cycle: Prompt -> Generate -> Verify Code (Human Review) -> Refine Prompt.
3. Verification & Validation (V&V):
Since the AI writes the code, your job shifts 100% to Testing.
Does the app actually run? Does the logic hold up? You are now the Lead QA Engineer.
Example Project Plan for the Class:
Let's build a "Visual Learning Companion" (Education Track).
Concept: A student uploads a photo of a textbook page or a whiteboard diagram.
Feature 1: The app transcribes the diagram into text.
Feature 2: It creates an interactive flashcard set based on that diagram.
Feature 3: It acts as a "Tutor", allowing the user to ask questions about specific parts of the image.
We will use the Google AI Studio to build our "Visual Learning Companion". Here is the step-by-step execution plan using the Build feature.
Step 1: Introduction to the Interface
Go to aistudio.google.com.
Click on the "Build" tab (this is where the Vibe Coding magic happens, distinct from the pure "Prompt" playground).
Select Model: Ensure Gemini 3 Pro is selected for maximum reasoning capability.
Step 2: Setting the "Vibe" (System Instructions)
Locate the System Instructions block. This sets the global behavior.
System Instruction: "You are an expert Educational Technologist and Senior Full-Stack Developer. Your goal is to create intuitive, accessible learning tools. You prefer clean, modern UI (Tailwind CSS) and robust error handling. When analyzing images, look for conceptual relationships, not just text."
Step 3: The "Vibe Code" Process (The Build Bar)
In the chat/build input, we type our high-level intent (Instruction):
Instruction Prompt: "Create a web application with a split-screen interface. On the left, allow the user to drag and drop an image (diagram or notes). On the right, display an interactive chat. When an image is uploaded, use vision capabilities to analyze it and automatically generate a 3-sentence summary of the core concept."
Action: Click Run/Build.
Observation: AI Studio will generate the HTML/JS/CSS code preview immediately.
Step 4: Iteration & Temperature
Temperature Parameter: If the code is buggy or too random, lower the temperature (e.g., to 0.2) to make it more deterministic and precise. If the "Flashcard ideas" are boring, raise the temperature (e.g., to 0.9) for creativity.
Refining prompt (example): "The summary is good, but the UI looks dated. Update the UI to use a dark mode theme with neon blue accents, and add a button to 'Generate Quiz' from the analysis."
Step 5: Deployment
Once satisfied with the preview, click "Publish" or "Share" in the top right.
Ensure the link is Public (for the competition submission).
Here is a list of 7 High-Potential Project Ideas tailored for your students. These are designed to win by hitting the competition's specific sweet spot: Multimodality + Advanced Reasoning. They move beyond simple "text generation" to solving problems by "seeing" and "thinking."
1. The "Paper-to-Code" Tutor
The Problem: Students struggle to translate hand-drawn logic (flowcharts, UI sketches, math equations) into actual code.
The Solution: An app that takes a photo of a whiteboard or notebook and instantly converts it into a working code prototype or interactive explanation.
Gemini 3 Superpower: Visual Reasoning (Understanding the relationship between boxes/arrows) + Code Generation.
Spec Sketch:
o Input: Image of a handwritten flowchart or UI sketch.
o System Instruction: "You are a Senior Frontend Engineer. Analyze the sketch for layout and logic. Convert it into a functional React component using Tailwind CSS. If logic is ambiguous, add comments asking for clarification."
o Vibe: Encouraging, precise, and educational.
2. The "Lab Partner" (Science)
The Problem: Students in chemistry/physics labs often set up equipment incorrectly, leading to failed experiments or safety hazards.
The Solution: A "safety check" app. The user points their camera at their physical lab setup, and the AI verifies if it matches the experiment protocol.
Gemini 3 Superpower: Spatial Understanding (Recognizing object placement) + Safety Reasoning.
Spec Sketch:
o Input: Video scan or photo of the lab bench.
o System Instruction: "Act as a Lab Safety Officer. Compare the visual input against standard safety protocols for [Titration/Circuitry]. Identify loose wires, improper glassware, or lack of PPE. Output a 'Pass/Fail' checklist."
o Vibe: Strict but helpful, safety-first.
3. The "Appliance Whisperer"
The Problem: Modern appliances (washing machines, thermostats) use flat touchscreens or complex dials that are impossible for visually impaired people to use.
The Solution: An audio-guide app. The user points their phone at a washing machine interface. The app reads the current state and guides their finger to the correct button: "Move your hand slightly right. That is the 'Delicates' cycle."
Gemini 3 Superpower: Spatial Pointing (Pixel-precise coordinates) + Multimodality.
Spec Sketch:
o Input: Live video feed or photo of a machine interface.
o System Instruction: "You are a visual assistant for the blind. Identify the control panel elements. When the user asks 'How do I set it to cold wash?', provide directional instructions relative to the center of the image."
o Vibe: Calm, clear, and highly descriptive.
4. The "Pantry Medic"
The Problem: People with strict dietary restrictions (diabetes, allergies) struggle to quickly assess if a pantry full of random items is safe for a meal.
The Solution: Take a photo of a pantry shelf or a pile of ingredients. The app identifies every item, cross-references it with a user's health profile (e.g., "Low Sodium"), and suggests a safe recipe using only visible items.
Gemini 3 Superpower: OCR (Reading labels) + Deductive Reasoning (Filtering ingredients vs. constraints).
Spec Sketch:
o Input: Image of open fridge/pantry + Text Profile ("I have Celiac disease").
o System Instruction: "Identify all food items. Filter out anything containing gluten. Create a 3-step recipe using the remaining safe ingredients. Warn specifically if an item is 'risky'."
o Vibe: Protective and creative chef.
5. The "Lease Sentinel" (Legal/Housing)
The Problem: Students and young renters sign leases without understanding the fine print or predatory clauses.
The Solution: Upload a PDF of a rental contract. The AI acts as a "Tenant Lawyer," flagging specific clauses that are illegal or unusual for that city/region.
Gemini 3 Superpower: Long Context Window (Reading 50+ pages) + Logical Reasoning (Detecting disadvantageous terms).
Spec Sketch:
o Input: PDF Document (Lease Agreement).
o System Instruction: "You are a Housing Advocate. Review this lease for 'red flags' such as: illegal deposit deductions, vague entry clauses, or excessive late fees. Quote the exact text and explain why it is risky in simple English."
o Vibe: Professional, skeptical, and empowering.
6. The "UI Time Traveler"
The Problem: upgrading legacy software is painful. Developers hate rewriting old interfaces.
The Solution: Upload a screenshot of software from the 1990s (Windows 95 style). The app instantly "modernizes" it into a sleek 2025 Dark Mode web app, keeping the exact same functionality but updating the UX.
Gemini 3 Superpower: Visual Understanding + Design Intuition + Code Generation.
Spec Sketch:
o Input: Screenshot of an old interface.
o System Instruction: "You are a UI/UX expert. Analyze the functional elements of this legacy screen (buttons, inputs). Re-implement them using a modern 'Glassmorphism' aesthetic in CSS Grid. Keep the layout logic but fix the aesthetics."
o Vibe: Trendy and efficient.
7. The "Video debugger"
The Problem: Reproducing bugs is hard. Users send vague emails saying "it broke."
The Solution: A user records a screen capture video of them clicking around and the bug happening. The AI watches the video, correlates the clicks with the error, and writes a "Bug Report" with a hypothesis of the code failure.
Gemini 3 Superpower: Video Understanding (Temporal reasoning—cause and effect).
Spec Sketch:
o Input: Screen recording video (mp4).
o System Instruction: "Watch the user's cursor. Identify the exact moment the error occurs. Describe the sequence of actions leading up to it (Step-to-Reproduce). Hypothesize which backend service might be failing."
o Vibe: Analytical and Sherlock Holmes-esque.
If you have to pick one for the whole class to model, go with #3 (Appliance Whisperer) or #4 (Pantry Medic). They are:
Easy to Demo: Everyone has a fridge or a washing machine.
High Impact: They solve immediate human needs.
Visual: They look great in the submission video.
Chat with ChatGPT
Summary of the call:
What — This is a global hackathon / sprint called Vibe Code with Gemini 3 Pro, hosted on Google DeepMind via Kaggle / Google AI Studio. Participants build working apps or AI-powered solutions using the newest AI model Gemini 3 Pro. (Kaggle)
Why — The idea is to exploit the power of cutting-edge AI (multimodal reasoning, code generation, long-context, “agentic” workflows) to turn natural language ideas / prompts into real, functioning applications — very quickly. It’s a chance to demonstrate what “vibe coding” means in practice. (blog.google)
When & What’s at stake — The sprint runs for 1 week (initially announced as Dec 5–12, 2025). Prize pool: US$500,000 in Gemini API credits. (Kaggle)
What kinds of projects — Entries should solve “genuinely hard problems” or deliver useful experiences — in one of the allowed categories: Science, Education, Accessibility, Health, Business, or Technology. Simple scripts or basic chatbots are unlikely to succeed. (techAU)
What judges look for (evaluation criteria / scoring emphasis) — According to the call:
· Real-world impact / usefulness (major weight)
· Clever and effective use of Gemini 3 Pro’s advanced capabilities (reasoning, multimodal, long context, agentic coding)
· Creativity and originality
· Polished presentation / usability (UI/UX, polish) (techAU)
In short: This is an opportunity for students to build — in just a week — meaningful prototype applications using state-of-the-art AI. It’s ideal for learning, experimenting, building portfolios, and possibly gaining visibility/credits.
When you meet the students next class, you can give them a talk along these lines — to make them excited, spark ideas, and set expectations.
Suggested talk to students:
“Dear students — we’re now witnessing a turning-point in software engineering. With models like Gemini 3 Pro, you don’t need months of teams and coding to build powerful applications. You can turn ideas — in normal language — into real working apps.
Google has launched a 1-week global “vibe coding” sprint. They give a huge reward pool (500 000 USD in credits), but they also expect ambition: impactful, real-world apps in fields like Education, Health, Science, Accessibility, Business or general Technology.
This is not for trivial chatbots / toy projects. Judges will reward you for usefulness, originality, clever use of Gemini 3 Pro abilities, and a polished prototype.
I want you to start thinking — as your homework before next class — about project ideas: things that reflect real problems or opportunities in our region (or globally) and can benefit from AI’s power.
Next class we will discuss all ideas, select a few promising ones, form small teams, and — if you like — we will participate officially.
This is your chance to build something meaningful, learn modern AI-based software engineering, and put your names out there — all within one week.”
Key motivational points to emphasize:
· This is a global competition — you can join alongside anyone worldwide.
· Even if you don’t win, the experience will give you exposure to state-of-the-art AI, build your portfolio, and strengthen your practical skills.
· Because of “vibe coding”, the barrier to entry is much lower than traditional development; but success depends on good ideas + clear problem focus + ability to specify requirements.
· With your skills (me teaching, you learning), you have a real shot — it’s about being smart and creative, not just coding speed.
· It’s also a chance to experience modern software engineering / product management / project management in compressed, real-world form.
Because you emphasised that you want to teach sound software engineering + product management — not just code — this is the right time to combine both with “vibe coding.” Here’s how I propose to guide your students conceptually and practically.
· “Vibe coding” means moving from manual line-by-line coding to high-level intent + specification → letting AI agents produce structure, code, logic, UI, wiring. As human developers/architects you focus on what the app should do, why, how it should behave; AI handles boilerplate, implementation detail, integration, rapid prototyping.
· Human value shifts: from writing code to designing, planning, specifying, verifying, evaluating usability, edge cases, ethical implications.
· With advanced models like Gemini 3 Pro — capable of reasoning, multimodal understanding, long-context, tool use — “vibe coding” becomes realistic: prototypes that previously required teams can now be built by small teams or even individuals. (blog.google)
Even if AI generates code, we must not abandon sound SE discipline. For good projects, I recommend:
1. Clear requirement elicitation / specification — as humans, define functional requirements, non-functional requirements (performance, usability, security, privacy), user stories / use cases.
2. Domain understanding / context analysis — if project targets e.g. education, accessibility, health — understand the domain deeply: who are users, what needs exist, what constraints (language, infrastructure, ethics) matter.
3. Modular design & architecture thinking — before letting AI code, plan overall architecture: modules, data flow, UI/UX flow, error handling, edge cases.
4. Prompt engineering & instruction design — carefully craft prompts to the AI (with system instructions, constraints, edge case guidelines), so generated code aligns with your specification.
5. Testing / validation / quality control — treat AI output as first draft: review code, test functionality, check for security/privacy, validate edge cases, bug-fix.
6. Iterative design & feedback loops — treat prototype as MVP (minimum viable product), get feedback (peers / users), refine requirements, iterate.
7. Project & product management mindset — timeline, roles (who writes prompts, who verifies, who tests), task tracking, version control, documentation.
Given the short time window (1 week sprint), we should teach them lean, agile, minimal-scope but high-value projects. Process:
· Brainstorm domain problems (in class / homework).
· For each idea, write a mini project brief: problem statement, target users, proposed solution, key features, success criteria (what makes it useful), rough architecture / modules.
· Evaluate feasibility under vibe-coding constraints (can AI implement needed parts quickly? Are domain constraints manageable? Will AI reasoning suffice for logic?).
· Choose 1–2 most promising ideas, form small teams (3–5 students).
· Draft prompts / spec documents before sprint start.
· During sprint: prompt → generate → review → test → refine → polish UI/UX → final submission.
Since you teach software engineering in Bahawalpur (so many local students), maybe an Education / Local needs app — e.g.:
· An interactive tutorial-generator for local curriculums (students type topic in local language/English, AI builds lesson plan, slides, interactive quizzes).
· An accessibility tool for students with limited internet: offline-capable quiz generator + practice app.
· A small health awareness chatbot + resource aggregator for your region: educational modules for hygiene, vaccination, mental health, with multilingual support.
These fit the categories, have potential real-world impact, and can be prototyped quickly with AI.
Here’s a plan (that you can follow with students) to actually start building using Gemini 3 Pro, once we pick a project idea.
Step-by-step plan:
1. Familiarize with the tools: Introduce students to Gemini 3 Pro (its capabilities, strengths: reasoning, multimodal, long context, tool use) and to Google AI Studio (or if you prefer, Google Antigravity — an agentic-development IDE built to work with Gemini 3 Pro). (blog.google)
2. Define project requirements: As described above — write functional & non-functional requirements, user stories, expected features, UI/UX flow, data flow, modules.
3. Write prompts / system instructions: Based on requirements, craft prompts to feed into Gemini 3 Pro. Include constraints, edge-case instructions, output format, module boundaries. (For example: “Generate a Python FastAPI backend that handles user authentication, quiz storage, accepts user answers, returns quiz results — use SQLite, provide code + README.”)
4. Use AI Studio / Antigravity to generate code & iterate: Let Gemini 3 Pro produce initial code (backend, frontend, UI, integrations as needed). Then, as humans, review the code — test functionality locally, fix bugs, refine prompts to improve output.
5. Polish & finalize prototype: Ensure UI/UX is usable, error handling works, basic documentation (README, usage guide) is present. If relevant, prepare video/demo or README to explain project to judges.
6. Prepare submission according to competition requirements: Submit via Kaggle as per the call. Ensure project meets guidelines (category, impact, completeness, quality).
· Do not aim too big — given the 1-week timeframe, choose a small but meaningful project: something that solves a clear pain point, that can be mostly implemented by AI, and whose domain logic is manageable — ideally around web apps, data-driven tools, education, accessibility.
· Focus on clarity of idea + real-world impact — judges value usefulness and creative use of AI over “just fancy code.”
· Use proper SE discipline throughout — treat the AI as a powerful assistant, but you remain responsible for design, quality, testing, ethics.
· Use this as a learning opportunity — even if you don’t win, the sprint is a live lab in modern software engineering + product thinking + AI-augmented development.
Category: Education
Problem: Teachers waste hours preparing lessons, quizzes, activities.
Solution: A system where a teacher enters a chapter/topic → the AI generates full course material: lesson plans, slides, worksheets, quizzes, summaries, Bloom’s-taxonomy questions, and even video scripts.
Core Features:
· Multimodal: Accepts PDFs/images of curriculum and extracts key topics
· Lesson generator
· Quiz/assessment generator
· Classroom activities generator
· Simple web UI
Why this is a strong competition project:
High real-world impact + excellent use of Gemini’s reasoning, multimodal extraction, long-context abilities.
Category: Accessibility + Education
Problem: Students learn at different speeds but schools cannot personalize.
Solution: A smart tutor that evaluates student level and then generates lessons adapted to their pace.
Core Features:
· Student profile analysis
· Difficulty adjustment
· Personalized quizzes
· Voice-based interaction for accessibility
Why strong:
Unique impact + personalization is a high-value education challenge.
Category: Health
Problem: Many people lack access to quick medical guidance or don’t know which doctor to visit.
Solution: A safe, educational system that explains symptoms, provides general information, and suggests which specialist to visit — not medical diagnosis.
Core Features:
· Symptom explanation (non-diagnostic)
· Safety filters
· First-aid education modules
· “When should I see a doctor?” logic
· Links to local available services (non-sensitive, public info)
Why strong:
Important real-world need + easy to implement safely with AI + high impact for judges.
Category: Accessibility
Problem: Students with low vision cannot easily access printed or digital content.
Solution: A tool that converts any uploaded image/PDF into audio explanations + summaries, optimized for clarity.
Core Features:
· OCR + multimodal understanding
· Summaries, step-by-step explanations
· Math diagram → verbal walkthrough
· Simple mobile-friendly UI
Why strong:
Multimodal processing is exactly what Gemini 3 Pro excels at.
Category: Business + Technology
Problem: Young founders struggle to refine startup ideas and build MVPs.
Solution: An interactive coach that helps users:
· Validate problem
· Identify market
· Propose features
· Generate MVP architecture
· Create pitch deck + branding
Why strong:
Gemini 3’s reasoning + document generation shine here.
Category: Science + Accessibility
Problem: Farmers don’t have expert guidance on crops, watering schedules, fertilizers.
Solution: Farmers upload a picture of crop/soil → AI analyses health, suggests practices, gives weather-aware advice.
Core Features:
· Crop image analysis
· Soil description analysis
· Water/fertilizer schedule generator
· Local weather integration
Why strong:
Multimodal + high real-world relevance + innovation in the agriculture domain.
Category: Health + Education
Problem: Students face stress, anxiety, and have no safe place to talk.
Solution: An AI companion with strict safety rules that provides emotional support, guidance, healthy routines, stress-management plans.
Core Features:
· Mood journaling
· Cognitive reframing guidance
· Study–stress balance planner
· Crisis-safe guardrails
Why strong:
Huge impact + relevant to judges — but must be safe and careful (education + well-being only).
Category: Technology
Problem: Students write code but don’t receive deep feedback.
Solution: A multimodal tool where students paste code → AI reviews logic, security, structure, explains bugs with examples, and generates improved versions.
Core Features:
· Code quality scoring
· Security suggestions
· Debug walk-through
· Learning explanations
Why strong:
Straightforward to build + strong utility for developers + excellent fit for Gemini’s reasoning abilities.
Category: Education / Science
Problem: Students struggle to understand research papers.
Solution: Upload a research paper → AI extracts key ideas, contributions, methodology, results → then generates:
· Lecture slides
· Diagram explanations
· Concept summary maps
Why strong:
Uses long-context + multimodal + reasoning. Very polished and easy to execute.
Category: Education / Accessibility
Problem: No structured local information about historic sites, climate, crops, education, culture.
Solution: A system where users type or speak questions → AI gives accurate summaries based on curated local content (non-sensitive, public data).
Core Features:
· Region-specific FAQ generator
· Visual map explanations (if allowed)
· “Tour guide” mode
· “Student homework help for local history” mode
Why strong:
Creative, region-focused, unique — judges value originality + relevance.
Considering your SE teaching goals, student skill levels, and Gemini 3 features, the best choices are:
1. SmartCourse Builder 360
2. Research Paper Analyzer + Slide Generator
3. AI Code Reviewer + Debugging Mentor
4. Inclusive Learning Assistant for Visually Impaired
5. Smart Agriculture Assistant
6. Adaptive Learning Tutor
7. AI Startup Ideation Coach
8. Local Knowledge Assistant for Bahawalpur
9. Student Mental Wellness Companion (Safe-Mode)
✔ Detailed teaching script