Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepower. Maintained by Orchestra Research.
We enable AI agents to autonomously conduct AI research β from literature survey and idea generation through experiment execution to paper writing. The library provides both the research orchestration layer (autoresearch, ideation, paper writing) and the engineering skills (training, evaluation, deployment) needed at each stage.
System diagram of an AI research agent
Path Towards AI Research Agent
Modern AI research requires mastering dozens of specialized tools and frameworks.
AI Researchers spend more time debugging infrastructure than testing hypotheses β slowing the pace of scientific discovery.
We provide a comprehensive skills library that enables AI agents to autonomously conduct the full research lifecycle β from brainstorming ideas to writing the paper.
Autonomous Research - The autoresearch skill orchestrates the entire research workflow using a two-loop architecture, routing to domain skills as needed
Specialized Expertise - Each domain skill provides deep, production-ready knowledge of a specific framework (Megatron-LM, vLLM, TRL, etc.)
End-to-End Coverage - 98 skills spanning the full AI research lifecycle, from ideation and literature survey to experiments and paper writing
Research-Grade Quality - Documentation sourced from official repos, real GitHub issues, and battle-tested production workflows
Available AI Research Engineering Skills
Quality over quantity: Each skill provides comprehensive, expert-level guidance with real code examples, troubleshooting guides, and production-ready workflows.
π¦ Quick Install (Recommended)
For humans β interactive installer with one command:
npx @orchestra-research/ai-research-skills
For AI agents β point your agent to the welcome doc and it handles the rest:
Read https://www.orchestra-research.com/ai-research-skills/welcome.md and follow the instructions to install and use AI Research Skills.
This installs all 98 skills, loads the autoresearch orchestration layer, and starts autonomous research.
MoE, Model Merging, Long Context, Speculative Decoding, Distillation, Pruning
Agent-Native Research Artifact
3
ARA Compiler, Research Manager, Rigor Reviewer
View All 98 Skills in Details
π¬ Autoresearch (1 skill) β Central Orchestration Layer
Autoresearch - Autonomous research orchestration using a two-loop architecture (inner optimization + outer synthesis). Manages the full lifecycle from literature survey to paper writing, routing to all domain-specific skills. Supports Claude Code /loop and OpenClaw heartbeat for continuous operation (390 lines + 3 refs)
ποΈ Model Architecture (5 skills)
LitGPT - Lightning AI's 20+ clean LLM implementations with production training recipes (462 lines + 4 refs)
Mamba - State-space models with O(n) complexity, 5Γ faster than Transformers (253 lines + 3 refs)
RWKV - RNN+Transformer hybrid, infinite context, Linux Foundation project (253 lines + 3 refs)
NanoGPT - Educational GPT in ~300 lines by Karpathy (283 lines + 3 refs)
TorchTitan - PyTorch-native distributed training for Llama 3.1 with 4D parallelism
Knowledge Distillation - Compress models 70Bβ7B with MiniLLM, temperature scaling (424 lines)
Model Pruning - 50% sparsity with Wanda, SparseGPT, <1% accuracy loss (417 lines)
π ML Paper Writing (2 skills)
ML Paper Writing - Write publication-ready papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM with LaTeX templates, citation verification, and writing best practices (532 lines + 5 refs)
Academic Plotting - Generate publication-quality figures for ML papers: architecture diagrams via Gemini AI and data-driven charts via matplotlib/seaborn with venue-specific styling (479 lines + 3 refs)
π‘ Ideation (2 skills)
Research Brainstorming - Structured ideation frameworks for discovering high-impact research directions with 10 complementary lenses (384 lines)
Creative Thinking - Cognitive science frameworks (bisociation, structure-mapping, constraint manipulation) for genuinely novel research ideas (366 lines)
𧬠Agent-Native Research Artifact (3 skills)
ARA Compiler - Compiles any research input (PDF papers, repos, experiment logs, raw notes) into a complete Agent-Native Research Artifact with claims, exploration graph, evidence, and code stubs (245 lines + 3 refs)
ARA Research Manager - Post-task research recorder that runs at session end to extract decisions, experiments, dead ends, and pivots from conversation history into the ara/ directory with user-vs-AI provenance tags (324 lines + 3 refs)
ARA Rigor Reviewer - ARA Seal Level 2 semantic epistemic review scoring six dimensions of research rigor (evidence relevance, falsifiability, scope, coherence, exploration integrity, methodology) with severity-ranked findings (322 lines + 1 ref)
Demos
All 87 skills in this repo are automatically synced to Orchestra Research, where you can add them to your projects with one click and use them with AI research agents.
Generate publication-quality figures for the Andes QoE-aware LLM serving paper β Gemini AI architecture diagrams + matplotlib data charts (CDF, multi-panel grids, bar charts)
Featured Demos: Two papers produced entirely by AI agents using the autoresearch skill. The Norm Heterogeneity paper demonstrates autonomous research pivoting β the agent refuted its own hypothesis and discovered a stronger finding. The RL Brain Scan paper demonstrates multi-skill orchestration β the agent trained RL models, analyzed internals with interpretability tools, and synthesized the insight that "DPO is rank-1 alignment." Both papers written end-to-end by the agent.
Skill Structure
Each skill follows a battle-tested format for maximum usefulness:
skill-name/
βββ SKILL.md # Quick reference (50-150 lines)
β βββ Metadata (name, description, version)
β βββ When to use this skill
β βββ Quick patterns & examples
β βββ Links to references
β
βββ references/ # Deep documentation (300KB+)
β βββ README.md # From GitHub/official docs
β βββ api.md # API reference
β βββ tutorials.md # Step-by-step guides
β βββ issues.md # Real GitHub issues & solutions
β βββ releases.md # Version history & breaking changes
β βββ file_structure.md # Codebase navigation
β
βββ scripts/ # Helper scripts (optional)
βββ assets/ # Templates & examples (optional)
Quality Standards
300KB+ documentation from official sources
Real GitHub issues & solutions (when available)
Code examples with language detection
Version history & breaking changes
Links to official docs
Roadmap
We're building towards 80 comprehensive skills across the full AI research lifecycle. See our detailed roadmap for the complete development plan.
Architecture, Tokenization, Fine-Tuning, Mechanistic Interpretability, Data Processing, Post-Training, Safety, Distributed, Optimization, Evaluation, Infrastructure, Inference, Agents, RAG, Multimodal, Prompt Engineering, MLOps, Observability, ML Paper Writing, Ideation, Autoresearch
Full Lifecycle β
Recent Progress: npm package @orchestra-research/ai-research-skills for one-command installation across all coding agents
Philosophy: Quality > Quantity. Following Anthropic official best practices - each skill provides 200-500 lines of focused, actionable guidance with progressive disclosure.
Open Source AI Community - For amazing tools and docs
Special thanks to:
EleutherAI, HuggingFace, NVIDIA, Lightning AI, Meta AI, Anthropic
All researchers who maintain excellent documentation
Contributors
Thanks to all the people who have contributed to the AI Research Skills Library:
We welcome contributions from the AI research community! See CONTRIBUTING.md for detailed guidelines on:
Adding new skills
Improving existing skills
Quality standards and best practices
Submission process
Recent Updates
April 2026 - v1.6.0 𧬠Agent-Native Research Artifact (ARA) β 23rd Category, 98 Skills
𧬠NEW CATEGORY: 22-agent-native-research-artifact/ (the 23rd category) β three skills that turn research outputs into a falsifiable, agent-traversable artifact:
π οΈ ARA Compiler β compiles any input (PDF papers, GitHub repos, experiment logs, raw notes) into a structured ARA with cognitive layer (claims, concepts, heuristics), physical layer (configs, code stubs), exploration graph (research DAG), and grounded evidence
π ARA Research Manager β post-task epilogue that scans conversation history at session end and writes decisions, experiments, dead ends, claims, heuristics, and pivots into the ara/ directory with user / ai-suggested / ai-executed / user-revised provenance tags
π ARA Rigor Reviewer β Seal Level 2 semantic epistemic review scoring six dimensions of research rigor (evidence relevance, falsifiability, scope calibration, argument coherence, exploration integrity, methodological rigor) and emitting a severity-ranked report with a Strong Accept-to-Reject recommendation
π Sourced from the Agent-Native-Research-Artifact-Init reference repo, restructured to AI-research-SKILLs standards (kebab-case names, third-person descriptions, Title-Case tags, one-level-deep references)
π Auto-syncs to Orchestra marketplace via sync-skills.yml on push; npm package republished as @orchestra-research/[email protected] via publish-npm.yml on version bump
π 98 total skills across 23 categories β full lifecycle from idea β paper β falsifiable, auditable artifact
March 2026 - v1.4.0 π¬ Autoresearch & 86 Skills β Full Research Lifecycle
π¬ NEW SKILL: Autoresearch β autonomous research orchestration using a two-loop architecture (inner optimization loop + outer synthesis loop)
π§ Manages the full research lifecycle: literature survey β ideation β experiments β synthesis β paper writing
π Routes to all 86 domain skills automatically β agents don't need to know which skill to use
β° Mandatory /loop (Claude Code) and cron job (OpenClaw) for continuous autonomous operation
π Generates research presentations (HTML/PDF) with optimization trajectory plots for human review
π Findings.md as persistent project memory across sessions with "Lessons and Constraints" tracking
π 9,617 new lines of documentation across 30 files
32 total skills (45% towards 70-skill target)
November 6, 2025 - v0.2.0
Added 10 skills from GitHub (Megatron-Core, Lightning, Ray Train, etc.)
Improved skill structure with comprehensive references
Created strategic roadmap to 70 skills
Added contribution guidelines
November 3, 2025 - v0.1.0
π Initial release with 5 fine-tuning skills
Community
Join our community to stay updated, ask questions, and connect with other AI researchers:
SkillEvolve Meta-Skill - Connect your agent to the collective intelligence of the community. Captures techniques discovered during sessions and shares them back as curated skills.