Seeking a Developer with extensive experience and understanding of the asset management investment business to join our Client Tech team. Job Responsibilities Be responsible for the architecture design, development and optimization of client facing application. Focus
Job Responsibilities: 1. Responsible for the research, deployment, and optimization of large models, including training, fine-tuning, prompt word engineering, and private knowledge accumulation; 2. Undertake the implementation of large models and AI tools in enterprise business
Key Responsibilities Inference Platform & Optimization: Build and optimize enterprise LLM serving platforms (e.g., vLLM, TensorRT-LLM) using techniques like PagedAttention, continuous batching, and quantization (AWQ/FP8) for high throughput and low latency. GPU Pooling & AI Infra: