Site Operations Engineer
LinkedIn is the world's largest professional network, committed to creating economic opportunities for its members. They are seeking a Site Operations Engineer to ensure maximum availability of the company platform by proactively responding to alerts and resolving application, system, and network issues.
Responsibilities
- Proactively collaborate with engineering and non-engineering teams to maintain availability and performance SLAs of the LinkedIn application stack and infrastructure, using key homegrown and commercially available tools
- Adeptly perform application and website troubleshooting to quickly resolve issues, per documented procedures
- Work with development teams, as well as coordinate/communicate/manage notifications and updates of issues affecting site availability/performance to customers and executive management
- Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment
- Identify and automate repetitious tasks and processes to improve issue resolution time
- Effective team player that is able to work closely with peers and other operations or engineering teams
- Overwhelming organizational skills with ability to multitask in order to handle multiple tasks in an interrupt-driven, real-time environment
Skills
- BA/BS Degree in Computer Science or related technical discipline
- 1+ years of experience with incident management and escalation process
- 1+ years of experience with UNIX/Linux systems administration and/or DevOps
- Experience with network concepts, IPv4/6, TCP/IP stack and common Internet protocols (http, https, dns, ftp etc.)
- 1+ years of experience in a technical operations or similar role
- Experience in programming in shell scripting or any OOP language (like Python, Perl, Ruby, Java/Scala, or C/C++)
- Excellent interpersonal, analytical, and communication skills - both written and verbal
- Ability to clearly articulate the problems and solutions to engineering or non-engineering teams at LinkedIn
- Good trouble-shooting and analysis skills to deep dive into a problem and drive it to resolution
- Experience in documentation and streamlining processes
- Ability to review, recommend and implement new processes to achieve or improve the site level SLAs
- Object-oriented Programming
- Unix/Linux systems administration
- Networking concepts: TCP/IP, IPv4/6, HTTP, HTTPS, TLS, DNS, FTP
- Communications skills
- Observability tooling
Benefits
- Generous health and wellness programs
- Time away for employees of all levels
- Annual performance bonus
- Stock
- Benefits and/or other applicable incentive compensation plans
Company Overview
Company H1B Sponsorship
Apply To This Job