Posted 1 month, 1 week ago
NVIDIA | vLLM + SGLang | Deep Learning Inference | Remote (North America preferred) Hi everyone — I’m Akbar, Senior Manager of Deep Learning Inference Software at NVIDIA. I lead our engineering efforts around vLLM and SGLang, two of the most widely …
Remote (North America preferred), Santa Clara, CA
Posted 1 month, 1 week ago
Building coding agent systems and an agentic cloud. Small senior team (ex-DeepMind, OpenAI, Microsoft Research, Amazon, Cambridge University; multiple PhDs). Work includes distributed systems, OS/sandboxing, ML and LLM inference/post-training, long-…
London, UK
Posted 5 months, 1 week ago
We're building an AI inference service leveraging confidential computing to ensure that prompts remain encrypted end-to-end. Our core engineering stack includes Go, Kubernetes, gRPC, and vLLM, with some web development using NextJS and Svelte. Most …
Posted 7 months, 1 week ago
We are a team of ~20 people, building cutting-edge open-source tools for confidential computing and a 'confidential GenAI' service on top of those. Our products span unusually far across the tech stack, starting at measured boot, through Kubernetes …
Posted 7 months, 1 week ago
We build Africa’s leading identity verification service, expanding access to secure financial services across the continent in a fast-growing market. We have built proprietary machine learning algorithms and a technology platform to cater for all sk…
Remote (US, Europe, Africa)
Posted 10 months ago
We’re democratizing SME financing in MENA by building the Moody's for micro-financing. Our team comes from Revolut, Nvidia, Ubisoft, Bloomberg, OakNorth, and other leading companies. We’re looking for an MLOps wizard to help us scale our ML infrastr…
Riyadh, Saudi Arabia, GLOBAL REMOTE