Jobs NYC

Job Information

StubHub inc. Lead Senior Staff Site Reliability Engineer in New York, New York

Lead Senior Staff Site Reliability Engineer (StubHub, Inc., New York, NY)Develop, design, implement, and maintain observability solutions to ensure reliability, availability, and overall performance of critical systems including continuously improving observability capabilities to proactively identify and address potential issues before they impact users. Work closely with cross-functional teams to identify bottlenecks, optimize resource utilization, address potential scalability challenges, and improve overall system reliability. Communicate effectively across teams to ensure smooth operations and resolve complex technical and development issues. Leverage automation tools and scripting languages to reduce manual intervention and increase the efficiency of operations across the engineering organization. Seek opportunities to improve system reliability, robustness, scalability, and performance through innovative solutions and best practices including evaluating new technologies, conducting performance tuning, and implementing optimizations to enhance the overall system infrastructure. Lead the business by ensuring systems are highly available and always functioning. Foster a culture of software engineering excellence that ensures our customers, partners, and internal teams have a consistent and high-quality experience on a global scale. Manage and mentor a team of Site Reliability Engineers, fostering a collaborative and high-performance culture. Conduct regular one-on-one meetings, provide feedback, and support professional growth through training and career development opportunities. Ensure team alignment with organizational objectives, establish clear performance goals, and promote accountability. Oversee team workload distribution, on-call scheduling, and resource allocation to maintain a sustainable work-life balance while achieving operational excellence. Act as a primary point of contact for escalations, guiding the team through complex problem-solving scenarios and driving resolution. Telecommuting may be permitted up to 2 days per week. When not telecommuting, must report to StubHub, Inc. at 3 World Trade Center - 175 Greenwich Street, 59th Floor, New York, New York 10007. Salary: $275,000 to $350,000 per year.MINIMUM REQUIREMENTS: Bachelor’s degree or U.S. equivalent in Computer Science, Computer Engineering, Computer Information Systems, or a related field, plus 5 years of professional experience as a Software Engineer, Systems Engineer, or any occupation/position/job title involving the reliability and maintenance of software infrastructure.In lieu of a Bachelor's degree plus 5 years of experience, the employer will accept 7 years of professional experience as a Software Engineer, Systems Engineer, or any occupation/position/job title involving the reliability and maintenance of software infrastructure.Must also have experience in the following: 5 years of professional experience with statistical sampling (including simple random sampling, systematic sampling, clustered sampling and probabilistic sampling including HyperLogLog), analysis tools (including BigQuery, Jupyter, Numpy, Scipy, scikit-learn, and R), and methods (including regression modelling, correlation analysis, probability analysis, ETL, dataprep, and dashboarding) to support data-driven decision making; 5 years of professional experience implementing automation tools and infrastructure including Infrastructure as Code (IaC) practices to streamline resource deployment processes; 5 years of professional experience utilizing Go, Python, or Node.js; 5 years of professional experience testing and adapting new technologies and tools including microservices, gRPC, Docker, Kubernetes, Cassandra, BigTable, Hadoop, Dataproc, and Dataflow as well as conducting performance tuning and implementing optimizations to enhance overall system infrastructure; 5 years of professional experience designing and improving reliability and robustness of core infrastructure (including Kubernetes and gRPC); and 5 years of professional experience implementing standardized processes and frameworks including enterprise risk management, business continuity planning, LFI incident analysis and FEMA ICS methodology for incident management and analysis.CONTACT: Apply online at: https://jobs.lever.co/StubHub

Minimum Salary: 275,000 Maximum Salary: 350,000 Salary Unit: Yearly

DirectEmployers