ReactiveGWM: Steering NPC in Reactive Game World Models

TL;DR

A game world model where the NPC follows high-level strategies, not just appears as background pixels — and the strategy module transfers zero-shot to a new game.

ReactiveGWM decouples player control from NPC behavior: player buttons enter the diffusion backbone as a lightweight additive bias, while NPC intents (Offense / Defense / Control) are grounded through cross-attention. Trained on one game, the cross-attention modules plug directly into an unannotated world model of a different game — unlocking steerable NPCs without retraining.

Method

Decoupling player control and NPC strategy

Two non-interfering pathways inside the DiT block: an additive bias for fine-grained player buttons, and cross-attention for high-level NPC strategy. Self-attention and FFN keep modeling the game's native dynamics.

1

Strategy-aligned data

Each clip is paired with an NPC-only structured prompt — a strategy (Offense / Defense / Control) plus active & passive behaviors — separated from player actions and scene captions.

Construction of strategy-aligned data: every clip is annotated with player actions and an NPC-only structured prompt.
2

Two pathways inside the DiT block

Player buttons are pooled to the latent frame rate and added as a residual bias. NPC strategy is encoded as text and injected via cross-attention. Self-attention and FFN are left intact.

DiT block with action module (additive bias) and strategy cross-attention.
3

Train once, transfer zero-shot

ReactiveGWM_base: full fine-tuning on a source game with strategy annotations. ReactiveGWM_transfer: reuse the target game's vanilla backbone and plug in our trained cross-attention — steerable NPCs without any retraining on the new game.

Overview of ReactiveGWM training and training-free transfer to a different game.

Configurations

Control interface

The player is controlled via low-level buttons; the NPC is steered via a high-level strategy.

Player — controlled by buttons NPC — steered by strategy

Frame from Street Fighter showing both characters with the player highlighted in blue and the NPC highlighted in red

Demo · Street Fighter 2

Same buttons, different strategies

Compare the vanilla backbone, ReactiveGWM_base, and ReactiveGWM_transfer under the same player input but different NPC strategies (Offense / Defense / Control).

SF2 Button Mapping

XLight Punch (LP)

YMedium Punch (MP)

ZHeavy Punch (HP)

ALight Kick (LK)

BMedium Kick (MK)

CHeavy Kick (HK)

Vanilla

ReactiveGWM
base

ReactiveGWM
transfer

NPC Strategy: Offense

Vanilla

ReactiveGWM
base

ReactiveGWM
transfer

NPC Strategy: Defense

Vanilla

ReactiveGWM
base

ReactiveGWM
transfer

NPC Strategy: Control

Prompt Detail

💡 Click the corresponding video to view its prompt details.

No video selected yet.

Demo · Street Fighter 3

Cross-game strategy transfer

ReactiveGWM_transfer reuses the strategy modules trained on SF2 on top of an unannotated SF3 backbone — steerable NPCs emerge without any retraining on this game.

SF3 Button Mapping

XHeavy Punch (HP)

YMedium Punch (MP)

ZHeavy Kick (HK)

ALight Punch (LP)

BLight Kick (LK)

CMedium Kick (MK)

Vanilla

ReactiveGWM
base

ReactiveGWM
transfer

NPC Strategy: Offense

Vanilla

ReactiveGWM
base

ReactiveGWM
transfer

NPC Strategy: Defense

Vanilla

ReactiveGWM
base

ReactiveGWM
transfer

NPC Strategy: Control

Prompt Detail

💡 Click the corresponding video to view its prompt details.

No video selected yet.

ReactiveGWM: Steering NPC in Reactive Game World Models

Decoupling player control and NPC strategy

Strategy-aligned data

Two pathways inside the DiT block

Train once, transfer zero-shot

Control interface

Same buttons, different strategies

Vanilla

ReactiveGWMbase

ReactiveGWMtransfer

NPC Strategy: Offense

Vanilla

ReactiveGWMbase

ReactiveGWMtransfer

NPC Strategy: Defense

Vanilla

ReactiveGWMbase

ReactiveGWMtransfer

NPC Strategy: Control

Prompt Detail

Cross-game strategy transfer

Vanilla

ReactiveGWMbase

ReactiveGWMtransfer

NPC Strategy: Offense

Vanilla

ReactiveGWMbase

ReactiveGWMtransfer

NPC Strategy: Defense

Vanilla

ReactiveGWMbase

ReactiveGWMtransfer

NPC Strategy: Control

Prompt Detail

ReactiveGWM
base

ReactiveGWM
transfer

ReactiveGWM
base

ReactiveGWM
transfer

ReactiveGWM
base

ReactiveGWM
transfer

ReactiveGWM
base

ReactiveGWM
transfer

ReactiveGWM
base

ReactiveGWM
transfer

ReactiveGWM
base

ReactiveGWM
transfer