Skywork: A more open bilingual foundation model T Wei, L Zhao, L Zhang, B Zhu, L Wang, H Yang, B Li, C Cheng, W Lü, ... arXiv preprint arXiv:2310.19341, 2023 | 87 | 2023 |
FastICA algorithm: five criteria for the optimal choice of the nonlinearity function A Dermoune, T Wei IEEE transactions on signal processing 61 (8), 2078-2087, 2013 | 58 | 2013 |
A convergence and asymptotic analysis of the generalized symmetric FastICA algorithm T Wei IEEE transactions on signal processing 63 (24), 6445-6458, 2015 | 46 | 2015 |
Cmath: Can your language model pass chinese elementary school math test? T Wei, J Luan, W Liu, S Dong, B Wang arXiv preprint arXiv:2306.16636, 2023 | 38 | 2023 |
Masked conditional random fields for sequence labeling T Wei, J Qi, S He, S Sun arXiv preprint arXiv:2103.10682, 2021 | 24 | 2021 |
Skywork-moe: A deep dive into training techniques for mixture-of-experts language models T Wei, B Zhu, L Zhao, C Cheng, B Li, W Lü, P Cheng, J Zhang, X Zhang, ... arXiv preprint arXiv:2406.06563, 2024 | 21 | 2024 |
A study of the fixed points and spurious solutions of the deflation-based FastICA algorithm T Wei Neural Computing and Applications 28 (1), 13-24, 2017 | 12 | 2017 |
On the spurious solutions of the FastICA algorithm T Wei 2014 IEEE Workshop on Statistical Signal Processing (SSP), 161-164, 2014 | 11 | 2014 |
Sensing tensors with Gaussian filters S Chrétien, T Wei IEEE Transactions on Information Theory 63 (2), 843-852, 2016 | 10 | 2016 |
Longskywork: A training recipe for efficiently extending context length in large language models L Zhao, T Wei, L Zeng, C Cheng, L Yang, P Cheng, L Wang, C Li, X Wu, ... arXiv preprint arXiv:2406.00605, 2024 | 9 | 2024 |
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models--The Story Goes On L Zeng, L Zhong, L Zhao, T Wei, L Yang, J He, C Cheng, R Hu, Y Liu, ... arXiv preprint arXiv:2407.08348, 2024 | 7 | 2024 |
Von Neumann's trace inequality for tensors S Chrétien, T Wei Linear Algebra and its Applications 482, 149-157, 2015 | 7 | 2015 |
General linear mixed model and signal extraction problem with constraint A Dermoune, N Rahmania, T Wei Journal of Multivariate Analysis 105 (1), 311-321, 2012 | 6 | 2012 |
Optimization hyper-parameter laws for large language models X Xie, K Ding, S Yan, KC Toh, T Wei arXiv preprint arXiv:2409.04777, 2024 | 5 | 2024 |
Asymptotic analysis of the generalized symmetric FastICA algorithm T Wei 2014 IEEE Workshop on Statistical Signal Processing (SSP), 460-463, 2014 | 5 | 2014 |
A flexible multi-task model for bert serving T Wei, J Qi, S He arXiv preprint arXiv:2107.05377, 2021 | 3 | 2021 |
Von Neumann's inequality for tensors S Chrétien, T Wei arXiv preprint arXiv:1502.01616, 2015 | 3 | 2015 |
On the subdifferential of symmetric convex functions of the spectrum for symmetric and orthogonally decomposable tensors S Chretien, T Wei Linear Algebra and its Applications 542, 84-100, 2018 | 2 | 2018 |
Joint estimation and model order selection for one dimensional ARMA models via convex optimization: a nuclear norm penalization approach S Chrétien, T Wei, BAH Al-sarray arXiv preprint arXiv:1508.01681, 2015 | 2 | 2015 |
Convex recovery of tensors using nuclear norm penalization S Chrétien, T Wei Latent Variable Analysis and Signal Separation: 12th International …, 2015 | 2 | 2015 |