Project description:
* working on gpu support for openai/triton — a language and compiler for writing highly efficient custom deep-learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with triton on gpus with rocm.
responsibilities
* :new features development, support and optimization of openai/triton project for gpus. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests
.
mandatory skills descriptio
* n:• strong c/c++ programming skil
* ls• experience with compiler internals (llvm, gcc or any othe
* r)• basic python programming skil
* ls• experience in performance analys
is
nice-to-have skills descripti
* on:• basic understanding of ml technolog
* ies• experience with gpgpu (general purpose gpu) computing (hip, cuda, opencl, et
* c.)• experience with pyto
* rch• experience with llvm and mlir compiler infrastructure, analysis or optimizations implementat
* ion• knowledge of rocm infrastruct
* ure• experience in cmake, make/ninja build sys
* tem• gemm performance fundament
* als• experience with doc
ker