【Platform Team】網站可靠性工程師 Site Reliability Engineer

iKala 愛卡拉

Job Description

We are looking for a Site Reliability Engineer (SRE) to make sure our cloud-based commerce platform is up and running and healthy.

As a SRE for iKala Commerce, you will be responsible for everything from our cloud infrastructure and operating systems to developing tools for code deployment and service monitoring. You will also review our code and system design and partner with developers to build our applications.

The SRE role is an integral member of our product development team. You will be a part of the team that makes crucial decisions about how to manage and scale complex, high-performance distributed systems. You will also provide your own perspective on our backend systems and constantly develop innovative ways to improve the way we manage the underlying infrastructure. Our ideal candidate should be able to develop applications on his/her own, but more eager to accelerate the whole team by building systems to improve performance and operational efficiency.

Ultimately, you should be involved in all stages of software development to define and improve our SLOs, SLAs & SLIs.

Our current tech stack include:
GCP, Kubernetes, Helm, Terraform, Stackdriver, Grafana, Prometheus, Elastic.

Requirement

【Responsibilities】 
1. Designing & implementing infrastructure for collecting metrics, crunching data and improving service monitoring to detect problems before they're visible to our customers.
2. Building systems to automate our server lifecycle, from configuration management, CI/CD to server bootstrap and decommission.
3. Troubleshooting, performing root cause analysis, and resolving production issues from the application and network layers all the way down to the system level.
4. Participating in solution design and advising other developers when building new features so that they're scalable, maintainable, and performing well.
5. Improving the observability of our applications through monitoring, alerting, logging, tracing and profiling, and building such observability features into a common platform.
6. Practicing sustainable incident response and blameless postmortems.
7. Proactively identifying and reducing issues through design, testing, and implementation of software-based solutions.

【Requirements】
1. BS/MS degree in Computer Science, Engineering or equivalent practical experience.
2. 3+ years with UNIX/Linux systems.
3. 1+ years of experience in software development, and familiar with shell script or one particular language.
4. 3+ years of experience operating and building software in cloud environments including GCP or AWS.
5. Experience in system / relational database administration.
6. Experience with configuration management software such as Terraform, Ansible, Puppet, or Chef.
7. 1+ years of production experience with Docker & Kubernetes.

More Info>>>https://www.ikala.tv

Remote type

Hybrid Interview
Hybrid Job

提供彈性遠端工作機會

Benifit

Benifit

勞保、健保、特別休假、勞退、婚假

Benifit

【休假制度】
除勞基法規定特休外,每年額外提供五天全薪彈性休假,以及全薪病假、女性同仁全薪生理假、產檢假、志工假。

【工作環境】 
明亮開放的辦公環境,依照職務需求提供彈性遠端工作(Work-from-Home),更有零食及咖啡無限供應,視障按摩師駐辦公室按摩服務。

【海外輪調機會】
提供同仁申請海外輪調機會,體驗異國職場文化的機會。

【多元溝通】
每兩週舉辦一次公司全員大會並提供美味餐點,讓員工可以深入瞭解公司的發展近況,並設有員工匿名提問、反饋制度,讓所有人都可以提出建議,意見不用怕被忽視。

【職訓課程】
完整的新人訓練,幫助新進的同仁能快速上手融入團隊。邀請外部講師或內部同仁授課,也可以申請工作技能相關的課程補助,在職涯發展的路上持續精進自己的專業。

【娛樂活動】
福委會舉辦的國內員工旅遊和趣味活動,例如:聖誕節交換禮物、辦公室密室逃脫...等。

【獎金制度】
提供優渥的人才推薦獎金,獎勵優秀同事表現的同儕獎金,另外包含年終獎金、端午節禮金、中秋節禮金以及生日禮金。

【職業安全】
新進和在職同仁提供年度健康檢查,並邀請執照醫生在公司舉辦年度健康講座,以及場駐護理師健康生理心理諮詢。
 

Salary Range

Negotiable (Above 40K TWD)