Software Engineer Job at Acceler8 Talent, Boston, MA

dmtxbFNzNlNPREJ0emFSMXZYWS9JTHdVa2c9PQ==
  • Acceler8 Talent
  • Boston, MA

Job Description

Software Engineer

We’re hiring a Software Engineer to lead efforts in optimizing and deploying machine learning inference across a range of modern hardware platforms. This role is critical as we expand the reach of our AI models into real-world applications requiring high-throughput and low-latency performance.

You’ll be joining a team focused on building and shipping high-performance AI systems, with a strong emphasis on practical deployment and optimization. Our engineers work close to the hardware, blending systems expertise with machine learning fluency to deliver reliable and efficient solutions.

The Software Engineer will take ownership of optimizing inference pathways across GPU, CPU, and emerging accelerator platforms. You’ll be expected to work independently, experiment with performance techniques, and interface closely with model and systems teams. The work is technical, hands-on, and has direct impact.

What we can offer you

  • Deep ownership over core inference systems
  • Collaboration with experts in systems, ML, and compiler optimization
  • Access to a wide range of hardware accelerators and software stacks
  • A tight feedback loop from experimentation to production
  • Clear, technical impact on real-world AI deployment
  • Competitive salary and equity package

Key responsibilities

  • Optimize inference stacks for GPU, CPU, and NPU architectures
  • Build and maintain performant inference pipelines using CUDA, C++, and Triton
  • Interface with Python/PyTorch-based ML models to ensure smooth deployment
  • Tune low-level primitives for maximum hardware utilization
  • Deliver end-to-end optimized inference setups with minimal supervision
  • Stay current with developments in quantization, decoding strategies, and model execution
  • Improve throughput, minimize latency, and adapt solutions across diverse environments

Relevant keywords: Training Infrastructure Engineer, inference optimization, CUDA, Triton, C++, GPU, CPU, NPU, PyTorch, vllm, DeepSpeed, ggml, low-latency, high-throughput, model deployment, ML systems

Job Tags

Similar Jobs

The Academy Hotel

Hotel Laundry Attendant Job at The Academy Hotel

 ...SEEKING INDIVIDUALS - to ensure the hotel's established standards of cleanliness are met. Prior experience with Hotel laundry a plus, but will train those willing to learn. PRIMARY DUTIES & RESPONSIBILITIES Must be able to work any day of the week, including... 

Clarion Security Llc

Unarmed Security Officer EULESS - 4 DAYS Per week- 34HRS Job at Clarion Security Llc

 ...Clarion Security LLC takes pride in building its culture of excellence one professional team member at a time. Not only do we hire the...  ...company where you are more than just another guard. As a Clarion Unarmed Security Officer, you will be responsible for the security and... 

M&M Pools/Spas

Swimming Pool Service Technician Job at M&M Pools/Spas

Starting out as a service assistant to one of our Veteran Service Men Pool Closing Weekly Pool Maintenance

cGxPServe

Sr. SAS Programmer Job at cGxPServe

 ...focus on submission expertise, within the pharmaceutical or biotechnology industry. Proficiency in programming languages such as SAS and R. Strong knowledge in Windows and UNIX environment, with proficiency in SAS/Base, SAS Macros, SAS/Graph, SAS/SQL, SAS Grid,... 

Insight Global

Assistant Project Manager Job at Insight Global

 ...Job Title: Project Control Specialist (Assistant Project Manager) Duration: 5 Month Contract (Potential to Extend/Convert) Location: Orlando, FL 3827...  ...& Aviation industry in Orlando, FL. This is fairly entry level position, with some years of experience required in...