Multitask learning for cognitive sciences triplet analysis
Autoři
Stambrouski, T.; da Silva Alves, R.
Rok
2025
Publikováno
Expert Systems with Applications. 2025, 267 1-10. ISSN 0957-4174.
Typ
Článek
Anotace
The triplet-based odd-one-out problem, which involves trials where human subjects are asked to select the most different concept among three, is a well-studied task in cognitive sciences. With the release of a large triplet-based dataset, THINGS, there has been a recent surge in the popularity of machine learning models aimed at learning mathematical representations of object concepts, such as SPoSE, VICE, and CARE. The first two models learn representations by maximizing the similarity between the two most similar objects, while the latter diverges by directly learning the odd-one-out, making its embedding more distant. No prior attempts have integrated both paradigms, which are important for understanding object representation in cognitive science. In this paper, we propose MASTER, a multitask learning method for the triplet problem that encapsulates both paradigms. Our results demonstrate that our method not only better predicts the odd-one-out object but also provides insightful representations for studying these concepts. Furthermore, we studied the conditions under which each model performs better, offering valuable insights for future research on how these paradigms affect human understanding of object concepts.
Generalization Analysis of Deep Non-linear Matrix Completion
Autoři
Ledent, A.; da Silva Alves, R.
Rok
2024
Publikováno
Proceedings of Machine Learning Research. Proceedings of Machine Learning Research, 2024. p. 26290-26360. vol. 235. ISSN 2640-3498.
Typ
Stať ve sborníku
Pracoviště
Anotace
We provide generalization bounds for matrix completion with Schatten 𝑝
quasi-norm constraints, which is equivalent to deep matrix factorization with Frobenius constraints. In the uniform sampling regime, the sample complexity scales like 𝑂˜(𝑟𝑛)
where 𝑛 is the size of the matrix and 𝑟 is a constraint of the same order as the ground truth rank in the isotropic case. In the distribution-free setting, the bounds scale as 𝑂˜(𝑟1−𝑝2𝑛1+𝑝2), which reduces to the familiar 𝑟√𝑛32 for 𝑝=1 . Furthermore, we provide an analogue of the weighted trace norm for this setting which brings the sample complexity down to 𝑂˜(𝑛𝑟) in all cases. We then present a non-linear model, Functionally Rescaled Matrix Completion (FRMC) which applies a single trainable function from ℝ→ℝ
to each entry of a latent matrix, and prove that this adds only negligible terms of the overall sample complexity, whilst experiments demonstrate that this simple model improvement already leads to significant gains on real data. We also provide extensions of our results to various neural architectures, thereby providing the first comprehensive uniform convergence PAC analysis of neural network matrix completion.
Recommendations with minimum exposure guarantees: A post-processing framework
Autoři
Lopes, R.; da Silva Alves, R.; Ledent, A.; Santos, R.L.T.; Kloft, M.
Rok
2024
Publikováno
Expert Systems with Applications. 2024, 236 1-9. ISSN 1873-6793.
Typ
Článek
Pracoviště
Anotace
Relevance-based ranking is a popular ingredient in recommenders, but it frequently struggles to meet fairness criteria because social and cultural norms may favor some item groups over others. For instance, some items might receive lower ratings due to some sort of bias (e.g. gender bias). A fair ranking should balance the exposure of items from advantaged and disadvantaged groups. To this end, we propose a novel post-processing framework to produce fair, exposure-aware recommendations. Our approach is based on an integer linear programming model maximizing the expected utility while satisfying a minimum exposure constraint. The model has fewer variables than previous work and thus can be deployed to larger datasets and allows the organization to define a minimum level of exposure for groups of items. We conduct an extensive empirical evaluation indicating that our new framework can increase the exposure of items from disadvantaged groups at a small cost of recommendation accuracy
Regionalization-Based Collaborative Filtering: Harnessing Geographical Information in Recommenders
Autoři
Rok
2024
Publikováno
ACM Transactions on Spatial Algorithms and Systems. 2024, 10(2), 1-23. ISSN 2374-0361.
Typ
Článek
DOI
Pracoviště
Anotace
Regionalization, also known as spatially constrained clustering, is an unsupervised machine learning technique used to identify and define spatially contiguous regions. In this work, we introduce a methodology to regionalize recommendation systems (RSs) based on a collaborative filtering approach. Two main challenges arise when performing regionalization based on users' preferences in RSs: (1) unstructured data, as interactions are often scarce and observed on a smaller scale; and (2) the difficulty of evaluation of the quality of the clustering results. To address these challenges, our methodology relies on inductive matrix completion (IMC), a fundamental approach to recover unknown entries of a rating matrix while utilizing region information to extract a region-based feature matrix. With this feature matrix, our method becomes adaptive and seamlessly integrates with various regionalization algorithms to create regionalization candidates. This enables us to derive more accurate recommendations that consider regionalized effects and discover interesting patterns in localized user behavior. We experimentally evaluate our model on synthetic datasets to demonstrate its efficacy in settings where our underlying assumptions are correct. Furthermore, we present a real-world case study illustrating the interpretable information the model can derive in terms of regionalized recommendation relevance.
Uncertainty-Adjusted Recommendation via Matrix Factorization With Weighted Losses
Autoři
da Silva Alves, R.; Ledent, A.; Kloft, M.
Rok
2024
Publikováno
IEEE Transactions on Neural Networks and Learning Systems. 2024, 35(11), 15624-15637. ISSN 2162-237X.
Typ
Článek
Pracoviště
Anotace
In a recommender systems (RSs) dataset, observed ratings are subject to unequal amounts of noise. Some users might be consistently more conscientious in choosing the ratings they provide for the content they consume. Some items may be very divisive and elicit highly noisy reviews. In this article, we perform a nuclear-norm-based matrix factorization method which relies on side information in the form of an estimate of the uncertainty of each rating. A rating with a higher uncertainty is considered more likely to be erroneous or subject to large amounts of noise, and therefore more likely to mislead the model. Our uncertainty estimate is used as a weighting factor in the loss we optimize. To maintain the favorable scaling and theoretical guarantees coming with nuclear norm regularization even in this weighted context, we introduce an adjusted version of the trace norm regularizer which takes the weights into account. This regularization strategy is inspired from the weighted trace norm which was introduced to tackle nonuniform sampling regimes in matrix completion. Our method exhibits state-of-the-art performance on both synthetic and real life datasets in terms of various performance measures, confirming that we have successfully used the auxiliary information extracted.
Unraveling the Dynamics of Stable and Curious Audiences in Web Systems
Autoři
da Silva Alves, R.; Ledent, A.; Assunção, R.; Vaz-De-Melo, P.; Kloft, M.
Rok
2024
Publikováno
Proceedings of the ACM Web Conference 2024. New York: ACM, 2024. p. 2464-2475. ISBN 979-8-4007-0171-9.
Typ
Stať ve sborníku
Pracoviště
Anotace
We propose the Burst-Induced Poisson Process (BPoP), a model designed to analyze time series data such as feeds or search queries. BPoP can distinguish between the slowly-varying regular activity of a stable audience and the bursty activity of a curious audience, often seen in viral threads. Our model consists of two hidden, interacting processes: a self-feeding process (SFP) that generates bursty behavior related to viral threads, and a non-homogeneous Poisson process (NHPP) with step function intensity that is influenced by the bursts from the SFP. The NHPP models the normal background behavior, driven solely by the overall popularity of the topic among the stable audience. Through extensive empirical work, we have demonstrated that our model fits and characterizes a large number of real datasets more effectively than state-of-the-art models. Most importantly, BPoP can quantify the stable audience of media channels over time, serving as a valuable indicator of their popularity.
Bridging Offline-Online Evaluation with a Time-dependent and Popularity Bias-free Offline Metric for Recommenders
Autoři
Rok
2023
Publikováno
Proceedings of EvalRS: A Rounded Evaluation Of Recommender Systems 2023. CEUR-WS.org, 2023. ISSN 1613-0073.
Typ
Stať ve sborníku
Pracoviště
Anotace
The evaluation of recommendation systems is a complex task. The offline and online evaluation metrics for recommender systems are ambiguous in their true objectives. The majority of recently published papers benchmark their methods using ill-posed offline evaluation methodology that often fails to predict true online performance. Because of this, the impact that academic research has on the industry is reduced. The aim of our research is to investigate and compare the online performance of offline evaluation metrics. We show that penalizing popular items and considering the time of transactions during the evaluation significantly improves our ability to choose the best recommendation model for a live recommender system. Our results, averaged over five large-size real-world live data procured from recommenders, aim to help the academic community to understand better offline evaluation and optimization criteria that are more relevant for real applications of recommender systems.
Generalization Bounds for Inductive Matrix Completion in Low-noise Settings
Autoři
Ledent, A.; da Silva Alves, R.; Lei, Y.; Guermeur, Y.; Kloft, M.
Rok
2023
Publikováno
Proceedings of the 37th AAAI Conference on Artificial Intelligence. Menlo Park: AAAI Press, 2023. p. 8447-8455. vol. 37. ISSN 2374-3468. ISBN 978-1-57735-880-0.
Typ
Stať ve sborníku
Pracoviště
Anotace
We study inductive matrix completion (matrix completion with side information) under an i.i.d. subgaussian noise assumption at a low noise regime, with uniform sampling of the entries. We obtain for the first time generalization bounds with the following three properties: (1) they scale like the standard deviation of the noise and in particular approach zero in the exact recovery case; (2) even in the presence of noise, they converge to zero when the sample size approaches infinity; and (3) for a fixed dimension of the side information, they only have a logarithmic dependence on the size of the matrix. Differently from many works in approximate recovery, we present results both for bounded Lipschitz losses and for the absolute loss, with the latter relying on Talagrand-type inequalities. The proofs create a bridge between two approaches to the theoretical analysis of matrix completion, since they consist in a combination of techniques from both the exact recovery literature and the approximate recovery literature.
Uncertainty-adjusted Inductive Matrix Completion with Graph Neural Networks
Autoři
Kasalický, P.; Ledent, A.; da Silva Alves, R.
Rok
2023
Publikováno
RecSys '23: Proceedings of the 17th ACM Conference on Recommender Systems. New York: Association for Computing Machinery, 2023. p. 1169-1174. ISBN 979-8-4007-0241-9.
Typ
Stať ve sborníku
Pracoviště
Anotace
We propose a robust recommender systems model which performs matrix completion and a ratings-wise uncertainty estimation jointly. Whilst the prediction module is purely based on an implicit low-rank assumption imposed via nuclear norm regularization, our loss function is augmented by an uncertainty estimation module which learns an anomaly score for each individual rating via a Graph Neural Network: data points deemed more anomalous by the GNN are downregulated in the loss function used to train the low-rank module. The whole model is trained in an end-to-end fashion, allowing the anomaly detection module to tap on the supervised information available in the form of ratings. Thus, our model’s predictors enjoy the favourable generalization properties that come with being chosen from small function space (i.e., low-rank matrices), whilst exhibiting the robustness to outliers and flexibility that comes with deep learning methods. Furthermore, the anomaly scores themselves contain valuable qualitative information. Experiments on various real-life datasets demonstrate that our model outperforms standard matrix completion and other baselines, confirming the usefulness of the anomaly detection module.
Scalable Linear Shallow Autoencoder for Collaborative Filtering
Autoři
Rok
2022
Publikováno
RecSys '22: Proceedings of the 16th ACM Conference on Recommender Systems. New York: Association for Computing Machinery, 2022. p. 604-609. ISBN 978-1-4503-9278-5.
Typ
Stať ve sborníku
Pracoviště
Anotace
Recently, the RS research community has witnessed a surge in popularity for shallow autoencoder-based CF methods. Due to its straightforward implementation and high accuracy on item retrieval metrics, EASE is potentially the most prominent of these models. Despite its accuracy and simplicity, EASE cannot be employed in some real-world recommender system applications due to its inability to scale to huge interaction matrices. In this paper, we proposed ELSA, a scalable shallow autoencoder method for implicit feedback recommenders. ELSA is a scalable autoencoder in which the hidden layer is factorizable into a low-rank plus sparse structure, thereby drastically lowering memory consumption and computation time. We conducted a comprehensive offline experimental section that combined synthetic and several real-world datasets. We also validated our strategy in an online setting by comparing ELSA to baselines in a live recommender system using an A/B test. Experiments demonstrate that ELSA is scalable and has competitive performance. Finally, we demonstrate the explainability of ELSA by illustrating the recovered latent space.