Senior Deep Learning Engineer
Toronto (preferred) / North America / Remote Software / Full-Time / Hybrid
US locations Seattle, Austin, SF Bay
We’re looking for best in class engineers to join our existing top-notch team. When you join Untether AI, you will be part of a team that designs, develops and verifies the software that interacts with our chip, collaborating with our hardware engineers and with fellow software engineers in the process. By creating software that fully realizes the capabilities of the hardware, you will help get AI inference to the general populace.
As part of this exceptional team, you are able to - and get excited by - identifying functional/performance bottlenecks and how to alleviate them in order to achieve scalable and reliable software. You excel in an environment with complex software and hardware designs.
We are looking for an experienced Senior Deep Learning Engineer to join our Neural Networks team. The main objective of our team is to make it as seamless as possible for researchers and developers to successfully deploy neural networks to run on our hardware. This involves optimizing a variety of common neural networks to run efficiently on our architectures. The Senior Deep Learning Engineer will also be designing and building a model optimization software pipeline so that our customers’ models can be automatically prepared for on-chip inference. If you want to be working at the bleeding edge of AI, this role is for you!
Requirements
Computer Science, Engineering or related degree
5+ years experience, with a healthy amount of both deep learning and software engineering (2+ years each)
Experience with one or more deep learning frameworks (preferably Tensorflow and/or PyTorch)
Experience working with one or more of the following: CNNs, LSTMs/RNNs, Transformers
Experience reproducing academic work from the field of AI
Strong math background
Strong Python skills
Experience with C/C++
Preferred Skills & Experience
Masters or PhD in Computer Science, Engineering, or related field
Embedded or low-level programming experience
CUDA/OpenGL experience
Experience deploying neural networks to production
Experience with model compression techniques such as quantization, quantization-aware training, knowledge distillation, pruning, etc.
General Skills
Enthusiastic about all things AI
Passionate about solving hard problems and pushing the envelope of what is possible
Comfortable with both big picture thinking and in-the-weeds deep dives
Driven, organized and self-directed
Ability to build projects from the ground up
Comfortable working within a rapidly evolving project
Strong focus on testing and software quality
Team focus and desire to help build the team