Key Responsibilities Inference Platform & Optimization: Build and optimize enterprise LLM serving platforms (e.g., vLLM, TensorRT-LLM) using techniques like PagedAttention, continuous batching, and quantization (AWQ/FP8) for high throughput and low latency. GPU Pooling & AI Infra:
Job Responsibilities: 1. Responsible for the research, deployment, and optimization of large models, including training, fine-tuning, prompt word engineering, and private knowledge accumulation; 2. Undertake the implementation of large models and AI tools in enterprise business scenarios,
Who we are looking for We are seeking a Senior Architect lead with strong technical depth and proven leadership experience in building and evolving enterprise-scale, cloud-native platforms. The ideal candidate is passionate about solving complex business
At HUBER+SUHNER, we design and create essential components that transport power and data through networks. This is how our employees around the globe contribute to a world where people get and stay connected. Head of Design
Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $3B in revenue in our last
About Us: We are brand builders who focus our passion and creativity to build Calvin Klein and TOMMY HILFIGER into the most desirable lifestyle brands in the world and at the same time position PVH as