Anton Lokhmotov
Anton Lokhmotov
Verified email at - Homepage
Cited by
Cited by
Automatically tuning sparse matrix-vector multiplication for GPU architectures
A Monakov, A Lokhmotov, A Avetisyan
International Conference on High-Performance Embedded Architectures and …, 2010
Mlperf inference benchmark
VJ Reddi, C Cheng, D Kanter, P Mattson, G Schmuelling, CJ Wu, ...
2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020
Pencil: A platform-neutral compute intermediate language for accelerator programming
R Baghdadi, U Beaugnon, A Cohen, T Grosser, M Kruse, C Reddy, ...
2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015
Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation
D Grewe, A Lokhmotov
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics …, 2011
Deriving efficient data movement from decoupled access/execute specifications
LW Howes, A Lokhmotov, AF Donaldson, PHJ Kelly
International Conference on High-Performance Embedded Architectures and …, 2009
Benchmarking TinyML systems: Challenges and direction
CR Banbury, VJ Reddi, M Lam, W Fu, A Fazel, J Holleman, X Huang, ...
arXiv preprint arXiv:2003.04821, 2020
Collective mind: Towards practical and collaborative auto-tuning
G Fursin, R Miceli, A Lokhmotov, M Gerndt, M Baboulin, AD Malony, ...
Scientific Programming 22 (4), 309-329, 2014
Collective Knowledge: towards R&D sustainability
G Fursin, A Lokhmotov, E Plowman
2016 Design, Automation & Test in Europe Conference & Exhibition (DATE), 864-869, 2016
VOBLA: A vehicle for optimized basic linear algebra
U Beaugnon, A Kravets, S Van Haastregt, R Baghdadi, D Tweed, J Absar, ...
Proceedings of the 2014 SIGPLAN/SIGBED conference on Languages, compilers …, 2014
PENCIL: Towards a platform-neutral compute intermediate language for DSLs
R Baghdadi, A Cohen, S Guelton, S Verdoolaege, J Inoue, T Grosser, ...
arXiv preprint arXiv:1302.5586, 2013
PENCIL language specification
R Baghdadi, A Cohen, T Grosser, S Verdoolaege, A Lokhmotov, J Absar, ...
INRIA, 2015
Auto-parallelisation of Sieve C++ programs
A Donaldson, C Riley, A Lokhmotov, A Cook
European Conference on Parallel Processing, 18-27, 2007
Delayed side-effects ease multi-core programming
A Lokhmotov, A Mycroft, A Richards
European Conference on Parallel Processing, 641-650, 2007
A Collective Knowledge workflow for collaborative research into multi-objective autotuning and machine learning techniques
G Fursin, A Lokhmotov, D Savenko, E Upton
arXiv preprint arXiv:1801.08024, 2018
Collective Mind, Part II: Towards performance-and cost-aware software engineering as a natural science
G Fursin, A Memon, C Guillon, A Lokhmotov
arXiv preprint arXiv:1506.06256, 2015
Optimizing OpenCL kernels for the ARM Mali-T600 GPUs
J Gronqvist, A Lokhmotov
GPU Pro 360 Guide to Mobile Devices, 167-198, 2018
Generating GPU code from a high-level representation for image processing kernels
R Membarth, A Lokhmotov, J Teich
European Conference on Parallel Processing, 270-280, 2011
Towards metaprogramming for parallel systems on a chip
L Howes, A Lokhmotov, AF Donaldson, PHJ Kelly
European Conference on Parallel Processing, 36-45, 2009
Optimal bit-reversal using vector permutations
A Lokhmotov, A Mycroft
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms …, 2007
Multi-objective autotuning of mobilenets across the full software/hardware stack
A Lokhmotov, N Chunosov, F Vella, G Fursin
Proceedings of the 1st on Reproducible Quality-Efficient Systems Tournament …, 2018
The system can't perform the operation now. Try again later.
Articles 1–20