Senior GPU Compiler Software Development Engineer
Project description
Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.
Responsibilities
New features development, support and optimization of OpenAI/Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests.
Must have skills
-
Strong C/C++ programming skills
-
Experience with compiler internals (llvm, gcc or any other)
-
Basic Python programming skills
-
Experience in performance analysis
Nice to have
-
Basic understanding of ML technologies
-
Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)
-
Experience with PyTorch
-
Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation
-
Knowledge of ROCm infrastructure
-
Experience in CMake, make/ninja build system
-
GEMM performance fundamentals
-
Experience with Docker
Languages
English: B2 Upper Intermediate
What we offer
-
Luxoft Training Center (more than 400 professional training programs, the High Performers Club)
-
Self-Learning Library
-
Internal Mobility (rotation between projects and accounts, new career opportunities)
-
Global Relocation
-
Mentoring Program (professional career development for leaders)
-
Recognition and Evaluation (feedback culture, regular appraisals)
-
Professional Communities (join one of our many internal communities: Agile Community, Tech Community, Business Analysis Community, etc.)
-
Team Events (take part in the many fun social activities organized by the Luxoft team in Serbia
Luxoft is committed to fostering a diverse and inclusive workplace.
We show fairness to all throughout our talent acquisition and management process.