Unsupervised learning of sentence embeddings using compositional n-gram features M Pagliardini, P Gupta, M Jaggi NAACL-HLT, 2018, 2017 | 826 | 2017 |
Better word embeddings by disentangling contextual n-gram information P Gupta, M Pagliardini, M Jaggi NAACL-HLT, 2019, 2019 | 44 | 2019 |
Taming gans with lookahead T Chavdarova, M Pagliardini, SU Stich, M Jaggi, F Fleuret ICLR 2021, 2020 | 30* | 2020 |
Agree to disagree: Diversity through disagreement for better transferability M Pagliardini, M Jaggi, F Fleuret, SP Karimireddy ICLR 2023, 2022 | 29 | 2022 |
The peril of popular deep learning uncertainty estimation methods Y Liu, M Pagliardini, T Chavdarova, SU Stich Bayesian Deep Learning workshop, at NeurIPS 2021, 2021 | 12 | 2021 |
Unsupervised learning of sentence embeddings using compositional n-gram features (2017) M Pagliardini, P Gupta, M Jaggi arXiv preprint arXiv:1703.02507, 2017 | 12 | 2017 |
Faster Causal Attention Over Large Sequences Through Sparse Flash Attention M Pagliardini, D Paliotta, M Jaggi, F Fleuret arXiv preprint arXiv:2306.01160, 2023 | 3 | 2023 |
Improving generalization via uncertainty driven perturbations M Pagliardini, G Manunza, M Jaggi, MI Jordan, T Chavdarova arXiv preprint arXiv:2202.05737, 2022 | 2 | 2022 |
Fast causal attention with dynamic sparsity D Paliotta, M Pagliardini, M Jaggi, F Fleuret Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023 | 1 | 2023 |
Revisiting the ACVI Method for Constrained Variational Inequalities T Chavdarova, M Pagliardini, T Yang, MI Jordan arXiv preprint arXiv:2210.15659, 2022 | 1 | 2022 |
Diversity through Disagreement for Better Transferability M Pagliardini, M Jaggi, F Fleuret, SP Karimireddy NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and …, 2022 | 1 | 2022 |
Fast Attention Over Long Sequences With Dynamic Sparse Flash Attention M Pagliardini, D Paliotta, M Jaggi, F Fleuret Thirty-seventh Conference on Neural Information Processing Systems, 2023 | | 2023 |
DoGE: Domain Reweighting with Generalization Estimation S Fan, M Pagliardini, M Jaggi arXiv preprint arXiv:2310.15393, 2023 | | 2023 |
CoTFormer: More Tokens With Attention Make Up For Less Depth A Mohtashami, M Pagliardini, M Jaggi arXiv preprint arXiv:2310.10845, 2023 | | 2023 |
Improved Generalization-Robustness Trade-off via Uncertainty Targeted Attacks M Pagliardini, G Manunza, M Jaggi, T Chavdarova | | 2021 |
A Primal-dual Approach for Solving Variational Inequalities with General-form Constraints T Chavdarova, M Pagliardini, T Yang, MI Jordan | | |
MLO J Bachmann Ona, SA Bahreinian, LF Barba Flores, T Bossy, ... | | |