Suivre
Tianwen Wei
Tianwen Wei
Affiliation inconnue
Adresse e-mail validée de kunlun-inc.com
Titre
Citée par
Citée par
Année
Skywork: A more open bilingual foundation model
T Wei, L Zhao, L Zhang, B Zhu, L Wang, H Yang, B Li, C Cheng, W Lü, ...
arXiv preprint arXiv:2310.19341, 2023
872023
FastICA algorithm: five criteria for the optimal choice of the nonlinearity function
A Dermoune, T Wei
IEEE transactions on signal processing 61 (8), 2078-2087, 2013
582013
A convergence and asymptotic analysis of the generalized symmetric FastICA algorithm
T Wei
IEEE transactions on signal processing 63 (24), 6445-6458, 2015
462015
Cmath: Can your language model pass chinese elementary school math test?
T Wei, J Luan, W Liu, S Dong, B Wang
arXiv preprint arXiv:2306.16636, 2023
382023
Masked conditional random fields for sequence labeling
T Wei, J Qi, S He, S Sun
arXiv preprint arXiv:2103.10682, 2021
242021
Skywork-moe: A deep dive into training techniques for mixture-of-experts language models
T Wei, B Zhu, L Zhao, C Cheng, B Li, W Lü, P Cheng, J Zhang, X Zhang, ...
arXiv preprint arXiv:2406.06563, 2024
212024
A study of the fixed points and spurious solutions of the deflation-based FastICA algorithm
T Wei
Neural Computing and Applications 28 (1), 13-24, 2017
122017
On the spurious solutions of the FastICA algorithm
T Wei
2014 IEEE Workshop on Statistical Signal Processing (SSP), 161-164, 2014
112014
Sensing tensors with Gaussian filters
S Chrétien, T Wei
IEEE Transactions on Information Theory 63 (2), 843-852, 2016
102016
Longskywork: A training recipe for efficiently extending context length in large language models
L Zhao, T Wei, L Zeng, C Cheng, L Yang, P Cheng, L Wang, C Li, X Wu, ...
arXiv preprint arXiv:2406.00605, 2024
92024
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models--The Story Goes On
L Zeng, L Zhong, L Zhao, T Wei, L Yang, J He, C Cheng, R Hu, Y Liu, ...
arXiv preprint arXiv:2407.08348, 2024
72024
Von Neumann's trace inequality for tensors
S Chrétien, T Wei
Linear Algebra and its Applications 482, 149-157, 2015
72015
General linear mixed model and signal extraction problem with constraint
A Dermoune, N Rahmania, T Wei
Journal of Multivariate Analysis 105 (1), 311-321, 2012
62012
Optimization hyper-parameter laws for large language models
X Xie, K Ding, S Yan, KC Toh, T Wei
arXiv preprint arXiv:2409.04777, 2024
52024
Asymptotic analysis of the generalized symmetric FastICA algorithm
T Wei
2014 IEEE Workshop on Statistical Signal Processing (SSP), 460-463, 2014
52014
A flexible multi-task model for bert serving
T Wei, J Qi, S He
arXiv preprint arXiv:2107.05377, 2021
32021
Von Neumann's inequality for tensors
S Chrétien, T Wei
arXiv preprint arXiv:1502.01616, 2015
32015
On the subdifferential of symmetric convex functions of the spectrum for symmetric and orthogonally decomposable tensors
S Chretien, T Wei
Linear Algebra and its Applications 542, 84-100, 2018
22018
Joint estimation and model order selection for one dimensional ARMA models via convex optimization: a nuclear norm penalization approach
S Chrétien, T Wei, BAH Al-sarray
arXiv preprint arXiv:1508.01681, 2015
22015
Convex recovery of tensors using nuclear norm penalization
S Chrétien, T Wei
Latent Variable Analysis and Signal Separation: 12th International …, 2015
22015
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20