ZeroHour

When the towers fall, ZeroHour still works.

Offline-first disaster response powered by two tiers of Gemma 4 — Gemma 4 E2B on-device triages in any language with no internet, relayed over BLE GATT through drones to the hub where Gemma 4 26B assigns the nearest responder in real time.

A victim speaks Telugu. A drone overhead picks up the SOS over Bluetooth. A responder already on the ground — who may not speak Malayalam or Telugu — gets the full situation in English. All of it works when the towers are down.

Built for the Gemma 4 Impact Challenge · Global Resilience + LiteRT tracks · May 2026.

The Problem — Kerala 2018

In August 2018, Kerala experienced its worst floods in nearly a century. Over 5.4 million people were displaced across all 14 districts. More than 483 lives were lost. The Indian Army, Navy, Air Force, Coast Guard, and NDRF teams from across the country poured in — over 1,200 rescue boats deployed at peak, including more than 600 fishing boats mobilised by the local community.

And yet, coordination was chaos.

The networks went down first. Cell towers submerged one by one as water levels rose. Entire villages lost all communication. Families stranded on rooftops had no way to signal their location beyond waving sarees at passing helicopters. The state government resorted to civilian WhatsApp groups — screenshots of GPS coordinates shared by desperate victims going viral — as the de facto dispatch system. There was no triage. Every SOS was equal. Critical cases were missed because no one could tell them apart from non-critical ones [1].

The language barrier made it worse. Responders flew in from Tamil Nadu, Andhra Pradesh, Karnataka, Gujarat, and Maharashtra. Many spoke no Malayalam. Victims — elderly, panicked, speaking their village dialect — could not communicate their needs to the people trying to save them. Interpreters were improvised. Critical details were lost in translation [2].

There was no intelligent routing. A boat would travel to a location only to find the family had already moved, or that someone closer and more critical had been waiting longer. Responders duplicated effort. Commanders dispatched blind. The best tool available was a phone call to a WhatsApp group [3].

This is not a Kerala problem. It repeated in Uttarakhand 2013, Chennai 2015, Cyclone Fani 2019, and in every major disaster before and since. The infrastructure that emergency response depends on is exactly what disasters destroy.

ZeroHour was built so that the next time the towers fall, the response does not.

What ZeroHour addresses

Challenge seen in Kerala 2018	ZeroHour's answer
Cell towers submerged, no connectivity	Gemma 4 E2B runs fully on-device — no internet needed
Victims stranded with no way to signal	BLE GATT relay — drone picks up SOS over Bluetooth
Responders from other states, no shared language	Gemma converts any spoken language to plain English automatically
No triage — all calls treated equally	On-device severity + emergency type classification in seconds
Coordination chaos, no intelligent routing	Gemma 4 26B at hub assigns nearest best-fit responder with ETA
GPS coordinates shared via screenshot	Structured SOS packet with precise lat/lng, type, severity, victim statement

References

[1] Scroll.in (2018). As Kerala battles flood, social media helps connect anxious relatives, coordinate relief efforts. scroll.in

[2] SDMA Kerala / Institute of Sustainable Development and Governance. Kerala Floods 2018 — Post-Disaster Report. sdma.kerala.gov.in

[3] Google AI Edge (2025). LiteRT — on-device ML inference runtime. ai.google.dev/edge/litert

Why Gemma 4 — and Not Something Else

This is the question that matters. There are larger, more capable models out there. There are cloud APIs with better benchmark numbers. So why Gemma 4?

Because in a disaster zone, the cloud is gone.

GPT-4V and Claude all require an API call. In a collapsed building with no cell signal, they are as useful as a switched-off server. Llama 3.2 Vision has no native audio understanding and no production-grade Android runtime. Whisper + a separate vision LLM would require two models loaded simultaneously on a phone — more than 4 GB of RAM on a device already under stress.

Gemma 4 E2B is the only open model that is:

Requirement	Why it matters	Gemma 4 E2B
Runs fully on-device	Disaster = no internet	✅ 2B params, fits in phone RAM
Natively multimodal	Victims send voice + photos	✅ Text + Vision + Audio in one model
Audio understanding	Most victims speak, not type	✅ `Content.AudioFile()` via LiteRT-LM
Open weights	No API key, no privacy risk, works forever	✅ Apache 2.0
Android-optimized runtime	Needs to run in seconds on a mid-range phone	✅ LiteRT (`com.google.ai.edge.litertlm`)
Structured output	Triage needs `severity`, `emergency_type`, not prose	✅ Native function calling / JSON mode

No other model hits all six. Gemma 4 E2B via LiteRT is the only combination that makes an offline-first multimodal triage system on a commodity Android phone possible today.

On inference speed: LiteRT is Google's optimized edge inference runtime — 2–3× faster than loading a GGUF through a generic runtime on the same hardware. On a Samsung A16, a full multimodal triage call (text + voice note + 2 photos) completes in under 10 seconds. That matters when someone is trapped.

On the 2B parameter scale: Gemma 4 E2B is not a compromise — it is the right tool for the job. Triage is a structured classification and summarization task. It does not need 70B parameters. It needs fast, accurate, on-device inference with enough language understanding to handle Telugu audio, flood photos, and a typed note at the same time. E2B delivers that.

What ZeroHour Builds On Top of Gemma 4

1. On-device multimodal triage

When a victim hits send, Gemma 4 E2B runs entirely on their phone. It receives:

Their typed message (any language)
Their voice note (spoken in their native language)
Photos they captured

It outputs a structured triage packet in plain English — understood by any responder anywhere:

emergency_type  : fire
severity        : critical
people_count    : 2
quick_needs     : immediate evacuation
victim_statement: I am trapped in a burning building with one other person
image_analysis  : Flames and thick smoke visible near a ground floor entrance

The victim_statement field is the key innovation. Rather than attempting word-for-word translation of romanized audio (which small on-device models do poorly), Gemma is prompted to summarize what the victim communicated in plain English. It understands the meaning from audio context — the panic, the words, the background sounds — and renders it comprehensibly. The responder knows exactly what the victim said without needing a translator.

2. BLE GATT relay pipeline — the phone as a server

When there is no internet, the phone does not give up. It becomes a Bluetooth GATT server, advertising the SOS payload to any nearby relay device. A drone flying a search grid overhead connects as a GATT client, negotiates MTU 512, reads the full JSON triage packet, POSTs it to the hub over its own uplink, and writes a 6-byte ACK back to the phone's writable characteristic. The victim's screen updates: relay acknowledged.

This is novel. Prior disaster-response BLE systems use non-connectable advertising beacons with tiny fixed-size payloads. ZeroHour uses the full GATT client-server model to transmit a rich, Gemma-generated JSON packet of up to 490 bytes — containing triage results, victim statement, image analysis, and GPS — over a single Bluetooth connection. No pairing. No app on the drone. Just a Python script and a BLE adapter.

3. Language-agnostic by design

A victim in rural Andhra Pradesh who speaks only Telugu can record a voice note. Gemma hears them, understands the situation, and generates English output for responders. The system never asks what language the victim speaks. It does not matter. Gemma 4's multilingual audio understanding handles it.

This directly addresses the Digital Equity & Inclusivity dimension of the challenge: ZeroHour works for the Tamil fisherman, the Hindi farmer, the Bengali factory worker — not just the English speaker with a stable internet connection.

Architecture & Flow

System overview

graph TD
    subgraph Phone["Victim Phone (Android · Flutter)"]
        HS[Home Screen\ntext · photos · voice note]
        SS[Sending Screen]
        G4[Gemma 4 E2B\nLiteRT on-device]
        GATT[BLE GATT Server\nSOS_SERVICE_UUID]
        PS[Packet Summary Screen]
        HS --> SS
        SS --> G4
        G4 -->|triage JSON| SS
        SS -->|offline path| GATT
        SS -->|internet path| DIRECT[Direct HTTP POST]
    end

    subgraph Drone["Relay Drone (Python · Bleak)"]
        SCAN[BLE Scanner\nfilter by service UUID]
        CLIENT[GATT Client\nMTU 512 · read · ACK write]
        SCAN --> CLIENT
    end

    subgraph Hub["Hub (FastAPI · port 8001)"]
        API[POST /sos/]
        AI[AI Assignment Engine\nGemma 4 26B via Google AI Studio]
        PG[(PostgreSQL)]
        RD[(Redis pub/sub)]
        API --> AI --> PG
        AI --> RD
    end

    GATT -->|BLE connect| SCAN
    CLIENT -->|POST /sos/| API
    CLIENT -->|write ACK char| GATT
    DIRECT --> API
    GATT -->|ACK received| PS
    DIRECT -->|hub response| PS

    RD -->|WebSocket push| RD2[Responder App\nReact Dashboard]

BLE GATT relay — step by step

sequenceDiagram
    participant P as Victim Phone
    participant D as Relay Drone
    participant H as Hub (FastAPI)
    participant R as Responder

    P->>P: Gemma 4 E2B triage (fully offline)
    P->>P: Open GATT server + start connectable advertising
    D->>P: BleakClient.connect()
    D->>P: request_mtu(512)
    D->>P: read SOS_DATA_CHAR
    P-->>D: SOS JSON ≤ 490 bytes (victim_statement · image_analysis · GPS · severity)
    D->>D: repair truncated JSON if needed
    D->>H: POST /sos/
    H->>H: AI assignment (Gemma 4 26B)
    H-->>D: {responder, eta_minutes, reason}
    D->>P: write SOS_ACK_CHAR (6-byte victim code)
    P->>P: EventChannel fires → Packet Summary Screen
    H->>R: WebSocket push → new assignment

App screen flow

flowchart LR
    A([Open App]) --> B[Home Screen\nType message\nAttach photos\nRecord voice note]
    B --> C[Sending Screen\nStep-by-step progress]
    C --> D{Internet?}
    D -->|Yes| E[Direct POST\nto Hub]
    D -->|No| F[Gemma 4 E2B\non-device triage]
    F --> G[BLE GATT Server\nbroadcast SOS]
    G -->|Drone connects\n& reads JSON| H[Drone relays\nto Hub]
    H -->|Writes ACK\nto phone| I[Packet Summary Screen]
    E -->|Hub responds| I
    I --> J([Send Another SOS\nor Stay Put])

On-device AI pipeline (Gemma 4 E2B)

flowchart TD
    VN[Voice Note\nany language] --> GM
    PH[Photos] --> GM
    TX[Typed Text] --> GM
    GM["Gemma 4 E2B\nvia LiteRT\ncom.google.ai.edge.litertlm"]
    GM --> ET[emergency_type\nfire · flood · medical ...]
    GM --> SV[severity\ncritical · urgent · low]
    GM --> PC[people_count]
    GM --> QN[quick_needs\nshort English phrase]
    GM --> VS[victim_statement\nplain English — what they said]
    GM --> IA[image_analysis\nwhat AI saw in photos]
    ET & SV & PC & QN & VS & IA --> PKT[SOS Packet\nsent via BLE or HTTP]

Technical Stack

Layer	Technology
On-device AI	Gemma 4 E2B via `com.google.ai.edge.litertlm:0.11.0` (LiteRT)
Victim app	Flutter + Kotlin MethodChannel / EventChannel
BLE	Android BluetoothGattServer + Bleak (Python drone)
Hub API	FastAPI + uvicorn (async, port 8001)
Database	PostgreSQL 16 (SQLAlchemy 2 async)
Real-time	Redis 7 (pub/sub + live location TTL cache)
Hub AI	Gemma 4 26B (`gemma-4-26b-a4b-it`) via Google AI Studio
Dashboard	React 18 + Vite + Tailwind CSS
Infra	GCP Cloud Run + Supabase (PostgreSQL) + Upstash (Redis)

Live Deployment (GCP)

ZeroHour is fully deployed on Google Cloud Platform using a containerized architecture on Cloud Run.

Frontend Dashboard: https://zerohour-frontend-416804666735.us-central1.run.app

Deployment Architecture

The system uses Docker for all components, orchestrated in the cloud:

FastAPI Backend: Runs on Cloud Run, scaling from 0 to 3 instances.
React Frontend: Served via Nginx on Cloud Run.
Database: PostgreSQL on Supabase.
Cache/PubSub: Redis on Upstash.
AI Triage: Gemma 4 26B hosted via Google AI Studio.

Project Structure

ZeroHour/
├── victim_app/                     # Flutter Android app
│   ├── lib/
│   │   ├── main.dart               # Startup router (resumes pending SOS on restart)
│   │   ├── config.dart             # API URL, model path, SharedPreferences keys
│   │   ├── services/
│   │   │   ├── gemma_service.dart  # LiteRT MethodChannel — prompt + JSON extraction
│   │   │   ├── api_service.dart    # Hub HTTP client
│   │   │   └── ble_sos_service.dart# GATT server lifecycle + ACK EventChannel
│   │   └── screens/
│   │       ├── home_screen.dart    # SOS form — text input, camera, voice recorder
│   │       ├── sending_screen.dart # Step flow: triage → post / BLE relay
│   │       └── packet_summary_screen.dart  # Receipt: what Gemma understood + sent
│   └── android/app/src/main/kotlin/…/MainActivity.kt
│       # Gemma 4 E2B Engine init, multimodal triage, GATT server, ACK scan
│
├── drone/
│   ├── ble_relay.py                # BLE scanner → GATT client → hub POST → ACK write
│   └── requirements.txt           # bleak, httpx, winsdk
│
├── backend/
│   ├── main.py                     # FastAPI app (port 8001)
│   ├── schemas.py                  # SOSCreate (extra=ignore), SOSOut, AssignmentBrief
│   ├── db/
│   │   ├── database.py             # Async engine + aiosqlite / asyncpg
│   │   └── models.py               # SOSPacket · Responder · Assignment
│   ├── services/
│   │   ├── gemma.py                # Hub triage — Gemma 4 26B via Google AI Studio
│   │   ├── geo.py                  # Haversine + ETA
│   │   └── pubsub.py               # Redis channels
│   └── routers/
│       ├── sos.py                  # POST /sos/ · GET /sos/queue
│       ├── responders.py           # Register · heartbeat · live GPS
│       └── ws.py                   # /ws/supervisor · /ws/responder/{code}
│
└── frontend/                       # React responder + supervisor dashboard
    └── src/apps/
        ├── responder/              # Triage queue, packet detail, map, mesh radar
        └── supervisor/

Getting Started

Prerequisites

Android device with BLE (tested: Samsung A16 5G, Android 10+)
Python 3.11+ with BLE support (drone relay)
Docker Desktop (Postgres + Redis)
Node.js 20+ (dashboard)
Flutter 3 SDK (only needed if building the app yourself)

1. Backend

cd backend
python -m venv godseye && godseye\Scripts\activate   # Windows
pip install -r requirements.txt
docker compose up -d
uvicorn main:app --reload --port 8001 --host 0.0.0.0

2. Drone relay

cd drone
pip install -r requirements.txt
python ble_relay.py --hub http://localhost:8001
# --verbose to log all BLE advertisements

3. Victim app (Android)

Option A — Install the pre-built APK (recommended)

Download the latest APK from GitHub Releases and sideload it onto your Android device.

Option B — Build from source

cd victim_app
flutter build apk --release
# Output: build/app/outputs/flutter-apk/app-release.apk
adb install build/app/outputs/flutter-apk/app-release.apk

Load the Gemma 4 E2B model weights onto the device

The app requires the Gemma 4 E2B .litertlm weights file (~2.5 GB). Download gemma-4-e2b-it.litertlm from Google AI Edge and push it to the device:

# 1. Launch the app first (so Android creates the data directory with correct ownership)
# 2. Then push the model — takes ~3 minutes
adb push gemma-4-e2b-it.litertlm /sdcard/Android/data/com.zerohour.zerohour_victim/files/gemma.litertlm

The app will copy the file to internal storage on first launch and show "Gemma 4 E2B ready" when done. The internal copy persists across reinstalls of the same signed APK — you only need to push once.

Android 14+ note: The app must be launched at least once before pushing so the system creates the data directory with the correct app ownership. Pushing before first launch will result in a "Source not found" error.

The weights file is not included in this repo.

4. Responder dashboard

cd frontend
npm install && npm run dev

Or use the live deployment: zerohour-frontend-416804666735.us-central1.run.app

API Reference

Method	Endpoint	Description
`POST`	`/sos/`	Submit SOS — triggers AI triage + assignment
`GET`	`/sos/queue`	List SOS packets (`?status=pending`)
`PATCH`	`/sos/{id}/resolve`	Resolve an SOS
`POST`	`/responders/`	Register a responder
`POST`	`/responders/{code}/location`	Heartbeat — GPS + battery + vitals
`GET`	`/responders/live/locations`	All responders active in last 30 s
`PATCH`	`/responders/{code}/status`	available / en_route / busy
`WS`	`/ws/supervisor`	Real-time all-events feed
`WS`	`/ws/responder/{code}`	Real-time assignments for one responder

Full SOS payload (GATT relay example)

{
  "victim_code": "V-BIGZ",
  "lat": 17.38530,
  "lng": 78.48667,
  "severity": "critical",
  "emergency_type": "fire",
  "message": "Victim is trapped in a fire and needs immediate rescue",
  "has_audio": true,
  "has_image": true,
  "hops": 1,
  "device_triage": {
    "emergency_type": "fire",
    "severity": "critical",
    "people_count": 2,
    "quick_needs": "immediate evacuation",
    "message": "Victim is trapped in a fire and needs immediate rescue",
    "victim_statement": "I am stuck in a burning building with one other person",
    "image_analysis": "Flames and thick smoke visible near a ground floor entrance"
  }
}

Unknown fields are silently ignored (model_config = {"extra": "ignore"} on SOSCreate) so extended GATT packets never 422.

AI Assignment Pipeline

SOS saved to Postgres → broadcast to supervisor via WebSocket
All available responders queried; Haversine distance computed, filtered ≤ 5 km
Top 5 candidates + triage context sent to Gemma 4 26B at hub
Model returns { assign, reason, eta_minutes, confidence }
Assignment persisted; responder marked en_route
Redis pub/sub pushes to responder WebSocket in real time

Falls back to nearest role-matched responder if hub AI is unavailable.

Environment Variables

Variable	Default	Description
`DATABASE_URL`	`postgresql+psycopg://...`	Postgres (Supabase)
`REDIS_URL`	`rediss://...`	Redis (Upstash)
`GEMINI_API_KEY`	required	Google AI Studio API key for Gemma 4 26B hub model
`GEMMA_MODEL`	`gemma-4-26b-a4b-it`	Gemma 4 26B model used at the hub for assignment and triage

Hackathon Context

Competition: Gemma 4 Impact Challenge (Kaggle)
Tracks entered: Global Resilience ($10,000) · LiteRT Special Technology ($10,000)
Deadline: May 18 2026
On-device model: Gemma 4 E2B (gemma.litertlm) via com.google.ai.edge.litertlm:0.11.0
Runtime: Google AI Edge LiteRT — CPU backend, multimodal (vision + audio + text)
Hub model: Gemma 4 26B (gemma-4-26b-a4b-it) via Google AI Studio

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
backend		backend
deploy		deploy
drone		drone
frontend		frontend
victim_app		victim_app
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ZeroHour

The Problem — Kerala 2018

What ZeroHour addresses

References

Why Gemma 4 — and Not Something Else

What ZeroHour Builds On Top of Gemma 4

1. On-device multimodal triage

2. BLE GATT relay pipeline — the phone as a server

3. Language-agnostic by design

Architecture & Flow

System overview

BLE GATT relay — step by step

App screen flow

On-device AI pipeline (Gemma 4 E2B)

Technical Stack

Live Deployment (GCP)

Deployment Architecture

Project Structure

Getting Started

Prerequisites

1. Backend

2. Drone relay

3. Victim app (Android)

4. Responder dashboard

API Reference

Full SOS payload (GATT relay example)

AI Assignment Pipeline

Environment Variables

Hackathon Context

About

Uh oh!

Releases 2

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ZeroHour

The Problem — Kerala 2018

What ZeroHour addresses

References

Why Gemma 4 — and Not Something Else

What ZeroHour Builds On Top of Gemma 4

1. On-device multimodal triage

2. BLE GATT relay pipeline — the phone as a server

3. Language-agnostic by design

Architecture & Flow

System overview

BLE GATT relay — step by step

App screen flow

On-device AI pipeline (Gemma 4 E2B)

Technical Stack

Live Deployment (GCP)

Deployment Architecture

Project Structure

Getting Started

Prerequisites

1. Backend

2. Drone relay

3. Victim app (Android)

4. Responder dashboard

API Reference

Full SOS payload (GATT relay example)

AI Assignment Pipeline

Environment Variables

Hackathon Context

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Contributors

Uh oh!

Languages