Skip to content

[graph_trainer] Add MANIFESTO.md#3014

Merged
SherlockNoMad merged 2 commits intomainfrom
bahuang/graph-trainer-docs
Apr 20, 2026
Merged

[graph_trainer] Add MANIFESTO.md#3014
SherlockNoMad merged 2 commits intomainfrom
bahuang/graph-trainer-docs

Conversation

@SherlockNoMad
Copy link
Copy Markdown
Contributor

@SherlockNoMad SherlockNoMad commented Apr 17, 2026

Summary

  • Add MANIFESTO.md articulating why GraphTrainer exists: eager PyTorch cannot keep up with accelerator scaling, and distributed training at scale will require a compiler

Test plan

  • Documentation only, no code changes

MANIFESTO.md articulates why GraphTrainer exists: eager PyTorch cannot
keep up with accelerator scaling, and distributed training at scale will
require a compiler. Documents the eager challenges (composability,
CUDAGraph complexity, coarse scheduling) and GraphTrainer's bets (single
unified graph, everything as graph passes, SimpleFSDP, debuggability).

ARCHITECTURE.md is a skeleton pointing to the manifesto, README, and
developer guide — content to follow.
@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 17, 2026
Will be added in a follow-up once content is ready.
@SherlockNoMad SherlockNoMad changed the title [graph_trainer] Add MANIFESTO.md and ARCHITECTURE.md [graph_trainer] Add MANIFESTO.md Apr 17, 2026
@SherlockNoMad SherlockNoMad requested a review from ezyang April 17, 2026 17:55
Copy link
Copy Markdown
Contributor

@tianyu-l tianyu-l left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice!

@SherlockNoMad SherlockNoMad merged commit a91f57b into main Apr 20, 2026
13 of 15 checks passed
@tianyu-l tianyu-l deleted the bahuang/graph-trainer-docs branch April 20, 2026 16:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants