semantic-scholar
Semantic Scholar1 events on 2026-04-25role: thematic30d historyaccess: keyless
Keeps: paper title, abstract snippet
Archive source — full history has value. Use pagination to browse older records.
- 2026-04-25Research paper: Scaling Multi-Node Mixture-of-Experts Inference Using Expert Activation Patterns
Query: mixture of experts inference serving Authors: A. Bambhaniya, Geonhwa Jeong, Jason Park, Jiecao Yu, Jaewon Lee Citations: 0 Most recent state-of-the-art (SOTA) large language models (LLMs) use Mixture-of-Experts (MoE) architectures to scale model capacity without proportion