Show simple item record

dc.contributor.authorLechowicz, Adam
dc.contributor.authorShenoy, Rohan
dc.contributor.authorBashir, Noman
dc.contributor.authorHajiesmaili, Mohammad
dc.contributor.authorWierman, Adam
dc.contributor.authorDelimitrou, Christina
dc.date.accessioned2025-09-10T19:38:55Z
dc.date.available2025-09-10T19:38:55Z
dc.date.issued2025-08-27
dc.identifier.isbn979-8-4007-1524-2
dc.identifier.urihttps://hdl.handle.net/1721.1/162640
dc.descriptionSIGCOMM ’25, Coimbra, Portugalen_US
dc.description.abstractAs large-scale data processing workloads continue to grow, their carbon footprint raises concerns. Prior research on carbon-aware schedulers has focused on shifting computation to align with the availability of low-carbon energy, but these approaches assume that each task can be executed independently. In contrast, data processing jobs have precedence constraints that complicate decisions, since delaying an upstream "bottleneck" task to a low-carbon period also blocks downstream tasks, impacting makespan. In this paper, we show that carbon-aware scheduling for data processing benefits from knowledge of both time-varying carbon and precedence constraints. Our main contribution is PCAPS, a carbon-aware scheduler that builds on state-of-the-art scoring or probability-based techniques - in doing so, it explicitly relates the structural importance of each task against the time-varying characteristics of carbon intensity. To illustrate gains due to fine-grained task-level scheduling, we also study CAP, a wrapper for any carbon-agnostic scheduler that generalizes the provisioning ideas of PCAPS. Both techniques allow a user-configurable priority between carbon and makespan, and we give basic analytic results to relate the trade-off between these objectives. Our prototype on a 100-node Kubernetes cluster shows that a moderate configuration of PCAPS reduces carbon footprint by up to 32.9% without significantly impacting total efficiency.en_US
dc.publisherACM|ACM SIGCOMM 2025 Conferenceen_US
dc.relation.isversionofhttps://doi.org/10.1145/3718958.3750478en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleCarbon- and Precedence-Aware Scheduling for Data Processing Clustersen_US
dc.typeArticleen_US
dc.identifier.citationAdam Lechowicz, Rohan Shenoy, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, and Christina Delimitrou. 2025. Carbon- and Precedence-Aware Scheduling for Data Processing Clusters. In Proceedings of the ACM SIGCOMM 2025 Conference (SIGCOMM '25). Association for Computing Machinery, New York, NY, USA, 1241–1244.en_US
dc.contributor.departmentMassachusetts Institute of Technology. School of Engineeringen_US
dc.identifier.mitlicensePUBLISHER_POLICY
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2025-09-01T07:53:58Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2025-09-01T07:53:59Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record