dc.contributor.author | Gusak, Danil | |
dc.contributor.author | Mezentsev, Gleb | |
dc.contributor.author | Oseledets, Ivan | |
dc.contributor.author | Frolov, Evgeny | |
dc.date.accessioned | 2025-06-12T20:39:37Z | |
dc.date.available | 2025-06-12T20:39:37Z | |
dc.date.issued | 2024-10-21 | |
dc.identifier.isbn | 979-8-4007-0436-9 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/159399 | |
dc.description.abstract | Scalability is a major challenge in modern recommender systems. In sequential recommendations, full Cross-Entropy (CE) loss achieves state-of-the-art recommendation quality but consumes excessive GPU memory with large item catalogs, limiting its practicality. Using a GPU-efficient locality-sensitive hashing-like algorithm for approximating large tensor of logits, this paper introduces a novel RECE (REduced Cross-Entropy) loss. RECE significantly reduces memory consumption while allowing one to enjoy the state-of-the-art performance of full CE loss. Experimental results on various datasets show that RECE cuts training peak memory usage by up to 12 times compared to existing methods while retaining or exceeding performance metrics of CE loss. The approach also opens up new possibilities for large-scale applications in other domains. | en_US |
dc.publisher | ACM|Proceedings of the 33rd ACM International Conference on Information and Knowledge Management | en_US |
dc.relation.isversionof | https://doi.org/10.1145/3627673.3679986 | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | Association for Computing Machinery | en_US |
dc.title | RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Gusak, Danil, Mezentsev, Gleb, Oseledets, Ivan and Frolov, Evgeny. 2024. "RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders." | |
dc.identifier.mitlicense | PUBLISHER_POLICY | |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2025-06-01T07:47:14Z | |
dc.language.rfc3066 | en | |
dc.rights.holder | The author(s) | |
dspace.date.submission | 2025-06-01T07:47:14Z | |
mit.license | PUBLISHER_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |