semantic-scholar

Semantic Scholar

1 events on 2026-05-20role: thematic30d historyaccess: keyless

Keeps: paper title, abstract snippet

Archive source — full history has value. Use pagination to browse older records.

2026-05-20Research paper: PALS: Power-Aware LLM Serving for Mixture-of-Experts Models
Query: mixture of experts inference serving Authors: Can Hankendi, Rana Shahout, Minlan Yu, A. Coskun Citations: 1 Large language model (LLM) inference has become a dominant workload in modern data centers, driving significant GPU utilization and energy consumption. While prior s