Refine Reset All
Recent Searches clear
Sort by
Job Type
Employer/Recruiter
Experience
Date Posted
Job Type
Employer/Recruiter
Experience
All Filters

Devops Aws Jobs In 上海 - 22 Job Positions Available

1 – 18 of 22 jobs
智元创新(上海)科技有限公司 jobs

DevOps / SRE 实习生上海实习职位描述参与 CI/CD 流水线的搭建、优化与日常维护(Jenkins / GitHub Actions / ArgoCD)协助维护 Kubernetes 集群,处理 Pod 调度、资源配额、健康检查等日常问题 参与监控告警体系建设,配置 Prometheus / Grafana 告警规则和 Dashboard 协助故障排查与复盘,输出 Postmortem 文档 编写和维护基础设施自动化脚本(Shell / Python) 参与值班轮班,学习线上问题响应和处置流程 整理内部运维文档与 Runbook 职位要求在校本科或研究生,计算机、软件工程、网络工程等相关专业熟悉 Linux 基础命令,能独立完成文件管理、进程排查、网络诊断有至少一门编程语言基础:Python / Go / Shell理解

智元创新(上海)科技有限公司  25 days ago
Nagarro jobs

Job Description CANDIDATE PROFILE Required Qualifications • 5+ years of experience in platform engineering, DevOps, or infrastructure automation roles • Expert-level proficiency in Terraform, including advanced features such as workspaces, remote state management, and module composition •

Nagarro  10 days ago
Renesas Electronics jobs

Job Description Have clear and solid relationships with software development departments. Plan and document work and projects. Build and continuously optimize CI/CD process and streamline automation effort for server provisioning and applications deployment. Build a resilient

Renesas Electronics  3 days ago
酷睿程 (CARIZON) jobs

后台开发工程师上海社招全职互联网 / 电子 / 网游职位描述1、负责研发效能平台后端系统的架构设计、开发和优化,支撑高并发、高可用的业务场景。2、使用Golang开发高性能微服务,优化API响应速度,保障系统稳定性和扩展性。3、负责数据库设计及优化,熟练使用 MySQL/PostgreSQL,并合理应用 Redis/Kafka 等中间件提升系统性能。4、与前端、测试、产品团队协作,完成需求分析、接口设计及联调,确保项目按时高质量交付。5、持续优化系统架构,提升代码质量,参与技术方案评审,推动DevOps及自动化测试落地。6、结合业务发展,能够独立进行项目规划与设计。职位要求技术能力1、5年以上后端开发经验,至少3年Go语言开发经验,熟悉 Gin/Echo/Fiber 等框架,熟悉领域驱动设计和应用2、熟悉 RESTful API 设计,了解 gRPC 或 GraphQL 加分。3、精通 MySQL/PostgreSQL,熟悉MongoDB,掌握索引优化、事务处理及分库分表策略。4、熟悉 Redis 缓存、消息队列(Kafka/RabbitMQ)及分布式锁应用。5、有微服务架构经验,熟悉使用 Docker/Kubernetes 及云服务(AWS/阿里云/腾讯云)。加分项1、拥有效能平台开发经验者优先(如流水线、监控、日志、测试管理等),如有智能驾驶或互联网相关行业更佳。2、熟悉 Prometheus/Grafana 监控,或 ELK 日志分析。3、了解 CI/CD 流程,有 Jenkins/argoCD等实践经验。4、熟悉使用大模型工具提升工作效率。软技能1、良好的编码习惯,追求高性能、可维护的代码。2、逻辑清晰,能独立解决问题,具备良好的团队协作精神投递...

酷睿程 (CARIZON)  26 days ago
Polymer Capital jobs

Company IntroductionPolymer Capital Management is a market-neutral, multi-manager investment platform focused on Asia. Our goal is to discover and nurture the regions top investment talent by combining established institutional support with extensive local financial market knowledge.

Polymer Capital  26 days ago
Sia jobs

Company Description Sia is a next-generation, global management consulting group. Founded in 1999, we were born digital. Today our strategy and management capabilities are augmented by data science, enhanced by creativity and driven by responsibility. We’re

Sia  20 days ago
DXC Technology jobs

Job Description: • 具备大型数据平台及云环境(如Azure、AWS、Snowflake、Databricks)架构和管理经验,具备敏捷、DevOps和DataOps实践的实际知识。 • 熟练使用容器化和云自动化工具(Docker、Kubernetes、Terraform等),并对数据生命周期管理和数据治理最佳实践有深刻理解。 • 云计算平台(如AWS、Azure、GCP) • 基础设施即代码(IaC)工具(如 Terraform、CloudFormation) • Kubernetes和容器化(例如Docker) • CI/CD工具(例如,Jenkins、GitLab CI、GitHub Actions) • 脚本和编程语言(例如 Python、Go 语言) • 系统架构与设计 • 解决问题和故障排除 • 沟通与协作 At DXC Technology, we believe strong connections and community are key

DXC Technology  19 days ago
Bambu Lab jobs

资深运维开发工程师 热招 上海 全职 研发 职位描述 我们正在寻找一位兼具 稳定性治理能力 与 运维开发能力 的资深工程师,加入云端 SRE 团队,负责支撑业务增长阶段下的多云、多集群云原生基础设施稳定运行与持续优化。你将面向业务增长带来的稳定性、性能、容量和成本挑战,参与 Kubernetes 集群治理、Elasticsearch 等关键基础组件优化、线上故障治理、容量规划和变更风险控制。同时,你也将推动自动化运维平台和工具链建设,将线上问题沉淀为平台能力、工程规范和长期机制,提升研发、数据、安全、合规等团队的协作效率。1. 稳定性治理:负责云端基础设施及关键基础组件的稳定性建设,定位并解决线上性能瓶颈、容量风险和可用性问题,保障业务系统稳定运行;2. 性能优化:针对 Elasticsearch 等核心组件开展性能调优、容量评估、资源治理和架构优化,提升系统吞吐、查询效率和服务可靠性;3. 云原生基础设施:负责 Kubernetes 集群及 CNCF 云原生生态组件的日常运维、架构优化和稳定性提升,支撑并保障多个 Kubernetes 集群的可靠运行;4. 多云平台治理:参与阿里云 ACK、AWS EKS、GCP GKE 等多云托管 Kubernetes 环境的运维、治理和优化,提升多云环境下的可观测性、弹性、成本效率和运维一致性;5. 故障与变更管理:负责线上告警处理、故障应急、根因分析、复盘改进和生产变更管理,建立可持续的稳定性改进机制;6. 自动化与平台建设:开发和维护自动化运维平台、工具链和流程系统,提升发布、变更、巡检、告警、权限、资源交付等环节的自动化水平;7. 跨团队协作:与后端研发、数据、安全、合规等团队紧密协作,推动基础设施问题定位、流程规范、权限治理、合规要求和稳定性改进落地。

Bambu Lab  12 days ago
Reckitt jobs

We are Reckitt Home to the worlds best loved and trusted hygiene, health, and nutrition brands. Our purpose defines why we exist: to protect, heal and nurture in the relentless pursuit of a cleaner, healthier world.

Reckitt  10 days ago
Scopely jobs

Scopely is looking for a Server Engineer, Tooling to join an unannounced title based in our Shanghai office (5 days in the office). At Scopely, we care deeply about what we do and want to inspire

Scopely  5 days ago
Nagarro jobs

Job Description Location: Shanghai China Employment Type: Full-Time About the Role seeking a Product Owner to own and evolve core hotel business systems (including PMS, POS, CRM, loyalty, and back-office operations) across global properties. You will

Nagarro  3 days ago
VAST jobs

高级 SRE 工程师 (AI-INF-基础设施) Beijing、Shanghai Experienced Full-time Responsibilities 岗位职责1、多云架构管理与业务落地:负责公司在 AWS、阿里云等主流公有云上的基础设施规划、建设与日常运维;能够独立对接业务团队,完成复杂业务系统的架构设计、资源规划、部署上线及全生命周期管理。2、K8s 集群稳定性保障:负责公司海量/大规模Kubernetes集群的构建、稳定性优化、容量规划与调度策略调优;负责服务容器化改造及网络、存储等云原生组件的疑难问题排查。3、AI 算力基础设施运维:保障大模型训练和推理任务的稳定运行,熟悉异构算力(如 NVIDIA GPU)服务器的驱动、网络(InfiniBand/RoCE)及监控排障,优化GPU资源调度与利用率。4、CI/CD 与自动化流水线:设计并优化持续集成与持续交付(CI/CD)流水线(如 GitHub Actions, GitLab CI, ArgoCD),推动基础设施即代码(IaC,如Terraform)的落地,提升研发交付效率。5、可观测性系统建设:负责构建和优化全链路监控与告警体系,深入应用 Prometheus、Grafana、Alertmanager等开源工具,制定核心业务的SLI/SLO/SLA,建立高效的故障应急响应流程。6、运维平台自研开发:深入研发团队痛点,能够使用Python或Go语言独立设计并开发运维自动化平台、底座工具或 AI-Agent 智能巡检系统,用工程化手段消除组织内的“脏活累活”(Toil)。7、应急响应与 On-Call:参与生产环境的On-Call值班,对线上突发事件进行快速响应、定位、止血与复盘,沉淀故障知识库。 Qualifications 任职要求1、教育背景与经验:计算机或相关专业本科及以上学历,5年以上SRE、DevOps或运维开发经验(有AI算力集群或大规模 K8s 运维经验者优先)。2、公有云专长:熟练掌握AWS、阿里云等至少两家主流公有云厂商的架构体系,熟悉其 IAM、VPC、EKS/ACK、RDS等核心服务及跨云互联方案。3、云原生深度掌握:深入理解 Kubernetes 架构与底层原理,熟练掌握常用组件(Ingress, CoreDNS, Flannel/Calico等),具备强大的Pod/Node级别排错、性能调优和网络抓包能力。4、AI / 算力经验(硬性加分项):熟悉大模型分布式训练(如 Megatron-LM, DeepSpeed)或模型推理(如 vLLM, TensorRT-LLM)的基础设施支撑,有4090或

VAST  4 days ago
BBGC jobs

We are a global technology consultancy firm with offices in Middle East, Asia, Europe and USA. We deliver business benefits through innovation. We leverage cutting-edge technology led solutions delivered by a team of skilled professionals, from

BBGC  3 days ago
LexisNexis jobs

Do you have exceptional data capabilities? Would you like to join a Global leader in Legal Analytics and Technology? Role Overview The Machine Learning Engineer develops, deploys, and maintains scalable AI/ML solutions that meet production standards.

LexisNexis  2 days ago
Amazon.com jobs

We are seeking a highly technical Senior Process Engineer to lead AI-driven engineering initiatives in the Compliance domain. This senior role focuses on designing, building, and operating AI-powered solutions that improve Seller compliance outcomes and significantly

Amazon.com  2 days ago
RELX jobs

Do you have exceptional data capabilities? Would you like to join a Global leader in Legal Analytics and Technology? Role Overview The Machine Learning Engineer develops, deploys, and maintains scalable AI/ML solutions that meet production standards.

RELX  2 days ago
Turner & Townsend jobs

Company Description Turner & Townsend is a global leading programme management company with over 22,000 people in more than 60 countries. Working with our clients across real estate, infrastructure, energy and natural resources, we transform together

Turner & Townsend  26 days ago
Western Digital jobs

Company Description At Western Digital, our vision is to power global innovation and push the boundaries of technology to make what you thought was once impossible, possible. At our core, Western Digital is a company of

Western Digital  10 days ago

Subscribe for job alerts and resources to make your job search easier!

Confirmation email sent to

Check your email and click on the link to start receiving your job alerts

Receive the latest job openings for:

devops aws jobs in 上海

You also might be interested in:

AI

Python

Product Development

Data Science

Data Analytics

Automation

Confirmation email sent to

Check your email and click on the link to start receiving your job alerts

All Filters Apply
Sort by
Job Type
Employer/Recruiter
Experience