Follow
Matteo Pagliardini
Title
Cited by
Cited by
Year
Unsupervised learning of sentence embeddings using compositional n-gram features
M Pagliardini, P Gupta, M Jaggi
NAACL-HLT, 2018, 2017
8262017
Better word embeddings by disentangling contextual n-gram information
P Gupta, M Pagliardini, M Jaggi
NAACL-HLT, 2019, 2019
442019
Taming gans with lookahead
T Chavdarova, M Pagliardini, SU Stich, M Jaggi, F Fleuret
ICLR 2021, 2020
30*2020
Agree to disagree: Diversity through disagreement for better transferability
M Pagliardini, M Jaggi, F Fleuret, SP Karimireddy
ICLR 2023, 2022
292022
The peril of popular deep learning uncertainty estimation methods
Y Liu, M Pagliardini, T Chavdarova, SU Stich
Bayesian Deep Learning workshop, at NeurIPS 2021, 2021
122021
Unsupervised learning of sentence embeddings using compositional n-gram features (2017)
M Pagliardini, P Gupta, M Jaggi
arXiv preprint arXiv:1703.02507, 2017
122017
Faster Causal Attention Over Large Sequences Through Sparse Flash Attention
M Pagliardini, D Paliotta, M Jaggi, F Fleuret
arXiv preprint arXiv:2306.01160, 2023
32023
Improving generalization via uncertainty driven perturbations
M Pagliardini, G Manunza, M Jaggi, MI Jordan, T Chavdarova
arXiv preprint arXiv:2202.05737, 2022
22022
Fast causal attention with dynamic sparsity
D Paliotta, M Pagliardini, M Jaggi, F Fleuret
Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023
12023
Revisiting the ACVI Method for Constrained Variational Inequalities
T Chavdarova, M Pagliardini, T Yang, MI Jordan
arXiv preprint arXiv:2210.15659, 2022
12022
Diversity through Disagreement for Better Transferability
M Pagliardini, M Jaggi, F Fleuret, SP Karimireddy
NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and …, 2022
12022
Fast Attention Over Long Sequences With Dynamic Sparse Flash Attention
M Pagliardini, D Paliotta, M Jaggi, F Fleuret
Thirty-seventh Conference on Neural Information Processing Systems, 2023
2023
DoGE: Domain Reweighting with Generalization Estimation
S Fan, M Pagliardini, M Jaggi
arXiv preprint arXiv:2310.15393, 2023
2023
CoTFormer: More Tokens With Attention Make Up For Less Depth
A Mohtashami, M Pagliardini, M Jaggi
arXiv preprint arXiv:2310.10845, 2023
2023
Improved Generalization-Robustness Trade-off via Uncertainty Targeted Attacks
M Pagliardini, G Manunza, M Jaggi, T Chavdarova
2021
A Primal-dual Approach for Solving Variational Inequalities with General-form Constraints
T Chavdarova, M Pagliardini, T Yang, MI Jordan
MLO
J Bachmann Ona, SA Bahreinian, LF Barba Flores, T Bossy, ...
The system can't perform the operation now. Try again later.
Articles 1–17