dc.contributor.author | Lechowicz, Adam | |
dc.contributor.author | Shenoy, Rohan | |
dc.contributor.author | Bashir, Noman | |
dc.contributor.author | Hajiesmaili, Mohammad | |
dc.contributor.author | Wierman, Adam | |
dc.contributor.author | Delimitrou, Christina | |
dc.date.accessioned | 2025-09-10T19:38:55Z | |
dc.date.available | 2025-09-10T19:38:55Z | |
dc.date.issued | 2025-08-27 | |
dc.identifier.isbn | 979-8-4007-1524-2 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/162640 | |
dc.description | SIGCOMM ’25, Coimbra, Portugal | en_US |
dc.description.abstract | As large-scale data processing workloads continue to grow, their carbon footprint raises concerns. Prior research on carbon-aware schedulers has focused on shifting computation to align with the availability of low-carbon energy, but these approaches assume that each task can be executed independently. In contrast, data processing jobs have precedence constraints that complicate decisions, since delaying an upstream "bottleneck" task to a low-carbon period also blocks downstream tasks, impacting makespan. In this paper, we show that carbon-aware scheduling for data processing benefits from knowledge of both time-varying carbon and precedence constraints. Our main contribution is PCAPS, a carbon-aware scheduler that builds on state-of-the-art scoring or probability-based techniques - in doing so, it explicitly relates the structural importance of each task against the time-varying characteristics of carbon intensity. To illustrate gains due to fine-grained task-level scheduling, we also study CAP, a wrapper for any carbon-agnostic scheduler that generalizes the provisioning ideas of PCAPS. Both techniques allow a user-configurable priority between carbon and makespan, and we give basic analytic results to relate the trade-off between these objectives. Our prototype on a 100-node Kubernetes cluster shows that a moderate configuration of PCAPS reduces carbon footprint by up to 32.9% without significantly impacting total efficiency. | en_US |
dc.publisher | ACM|ACM SIGCOMM 2025 Conference | en_US |
dc.relation.isversionof | https://doi.org/10.1145/3718958.3750478 | en_US |
dc.rights | Creative Commons Attribution | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
dc.source | Association for Computing Machinery | en_US |
dc.title | Carbon- and Precedence-Aware Scheduling for Data Processing Clusters | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Adam Lechowicz, Rohan Shenoy, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, and Christina Delimitrou. 2025. Carbon- and Precedence-Aware Scheduling for Data Processing Clusters. In Proceedings of the ACM SIGCOMM 2025 Conference (SIGCOMM '25). Association for Computing Machinery, New York, NY, USA, 1241–1244. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. School of Engineering | en_US |
dc.identifier.mitlicense | PUBLISHER_POLICY | |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2025-09-01T07:53:58Z | |
dc.language.rfc3066 | en | |
dc.rights.holder | The author(s) | |
dspace.date.submission | 2025-09-01T07:53:59Z | |
mit.license | PUBLISHER_CC | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |