Site Reliability Engineer

St Leonards, New South Wales Mastercard

Posted today

Job Viewed

Tap Again To Close

Job Description

**Our Purpose**
_Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential._
**Title and Summary**
Site Reliability Engineer
The BizOps team is looking for a Site Reliability Engineer who can help us solve problems, build our CI/CD pipeline and lead Mastercard in DevOps automation and best practices.
- Are you a born problem solver who loves to figure out how something works?
- Are you a CI/CD geek who loves all things automation?
- Do you have a low tolerance for manual work and look to automate everything you can?
Business Operations is leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
Mission
The role of business operations is to be the production readiness steward for the platform. This is accomplished by closely partnering with developers to design, build, implement, and support technology services.
A business operations engineer will ensure operational criteria like system availability, capacity, performance, monitoring, self-healing, and deployment automation are implemented throughout the delivery process.
Business Operations plays a key role in leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change and standards throughout the development, quality, release, and product organizations.
We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products.
The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience, and increase the overall value of supported applications.
Biz Ops teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments.
A biz ops focus is also on streamlining and standardizing traditional application specific support activities and centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders.
Ultimately, the role of biz ops is to align Product and Customer Focused priorities with Operational needs. We regularly review our run state not only from an internal perspective, but also understanding and providing the feedback loop to our development partners on how we can improve the customer experience of our applications.
Responsibilities
- Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement.
- Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
- Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
- Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
- Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
- Practice sustainable incident response and blameless postmortems.
- Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover
- Work with a global team spread across tech hubs in multiple geographies and time zones
- Share knowledge and mentor junior resources
Qualifications
- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
- Experience with algorithms, data structures, scripting, pipeline management, and software design.
- Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
- Ability to help debug and optimize code and automate routine tasks.
- We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.
- Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl or Ruby.
- Interest in designing, analyzing and troubleshooting large-scale distributed systems.
- We need team members with an appetite for change and pushing the boundaries of what can be done with automation.
- Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
- Experience in industry standard CI/CD tools like Git/BitBucket, Jenkins, Maven, Artifactory, and Chef.
- Experience designing and implementing an effective and efficient CI/CD flow that gets code from dev to prod with high quality and minimal manual effort is desired.
**Corporate Security Responsibility**
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
+ Abide by Mastercard's security policies and practices;
+ Ensure the confidentiality and integrity of the information being accessed;
+ Report any suspected information security violation or breach, and
+ Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
This advertiser has chosen not to accept applicants from your region.

Senior Site Reliability Engineer

Sydney, New South Wales ServiceNow, Inc.

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone.
**Do you**
+ know **Linux** in various levels of diagnostics and troubleshooting?
+ write code to **automate** repetitive tasks every time you face repetitive work?
+ smile when you solve an issue in Frankfurt from your laptop in Sydney?
**Answer 'yes' to these questions and we would like to hear from you. Go ahead, hit the Apply button and let's have a chat about your skills and experiences.**
**Want to know more about us?**
Now that we have set the pace, keep reading if you want to understand more about the role and the SRE team. We hope it will be helpful.
**Let's start with the role**
**As reliability engineer in the SRE team you will**
+ Provide relief and sustainable resolution to issues within our infrastructure.
+ Use your experience in software development, systems engineering and networking to proactively prevent repeatable issues.
+ Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design.
+ Drive a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions.
**Now a bit about the SRE team**
The SRE team is a group of highly technical engineers who are tasked with maintaining and developing the reliability, scalability and performance of the ServiceNow infrastructure. The SRE is empowered to drive technical resolutions across the technology stack from hardware through to application and all stops in between. They are also tasked with driving forward the operability of the platform to drive down the number of incidents and to reduce MTTR.
To accomplish this the team combines software development, networking and systems engineering expertise with a strong desire to be challenged by problems of scale and complexity and to make services better for our customers.
**To be successful in this role you have**
+ Knowledge of Linux systems.
+ Coding experience, we normally prefer Python or JavaScript.
+ Networking skills, IP addressing, routing protocols.
+ Monitoring of systems, applications and networks.
+ Uncompromising attention to detail.
+ Ability to work one weekend day on a 4 days/week work.
**We also have pluses!**
These are not a 'must', but please highlight them on your resume if you have:
Experience in cloud architecture or web applications engineering. Experience in databases performance, replication, high availability. A bachelor's or master's degree in a technical area.
Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving. This may include using AI-powered tools, automating workflows, analyzing AI-driven insights, or exploring AI's potential impact on the function or industry.
**_Why ServiceNow_**
_ServiceNow's DNA is built in purpose and values. We offer a culture of belonging, inclusivity, collaboration, and customer focus._
_Work-life balance and well-being are our topmost priorities._
_We offer flexible work arrangements._
_We provide competitive compensation, generous benefits, and a professional atmosphere. This is a very collaborative and inclusive work environment where individuals strong in aptitude and attitude can grow their careers through working with some of the most advanced technologies and talented professionals in the business._
**Work Personas**
We approach our distributed world of work with flexibility and trust. Work personas (flexible, remote, or required in office) are categories that are assigned to ServiceNow employees depending on the nature of their work. Learn more here ( .
**Equal Opportunity Employer**
ServiceNow is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status, or any other category protected by law. In addition, all qualified applicants with arrest or conviction records will be considered for employment in accordance with legal requirements.
**Accommodations**
We strive to create an accessible and inclusive experience for all candidates. If you require a reasonable accommodation to complete any part of the application process, or are unable to use this online application and need an alternative method to apply, please contact for assistance.
**Export Control Regulations**
For positions requiring access to controlled technology subject to export control regulations, including the U.S. Export Administration Regulations (EAR), ServiceNow may be required to obtain export control approval from government authorities for certain individuals. All employment is contingent upon ServiceNow obtaining any export license or other approval that may be required by relevant export control authorities.
From Fortune. ©2025 Fortune Media IP Limited. All rights reserved. Used under license.
This advertiser has chosen not to accept applicants from your region.

Sr Site Reliability Engineer

Sydney, New South Wales Cisco

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

Meet The Team
The Site Reliability Engineering (SRE) team at Duo, a part of Cisco, plays a crucial role in maintaining the reliability, availability, and performance of Duo's security services. They are responsible for ensuring service reliability by implementing robust monitoring and alerting systems to proactively detect and address issues. The team leads incident management efforts to resolve service outages and degradations swiftly. They focus on developing automation tools to streamline operations and improve efficiency. The SRE team continuously optimizes service performance and collaborates with development teams to ensure new features are designed with scalability and reliability in mind. Additionally, they conduct post-incident reviews to identify root causes and implement preventive measures, ensuring Duo's solutions remain dependable, secure at every layer and high-performing to meet user expectations.
Your Impact
As a Site Reliability Engineer on our Site Reliability Engineering team, you will develop software and tools to empower Duo's product development teams to run and maintain their services in production. You will collaborate with a wide range of internal partners to engineer automated solutions in an effort to remove toil and enhance stability for a variety of infrastructure, with an emphasis on scalability. You will face challenges that require an engineering mindset and a desire to automate everything possible.
Skills you have:
You have designed components in cloud based services including infrastructure
You can contribute to a meeting where an outcome is a technical decision made
You have a history of writing performant, maintainable, testable code
You enjoy learning and elevating your team by contributing to code reviews
You are passionate about automation and reducing toil
You are committed to quality and experienced with modern software testing practices
You care about contributing to an amazing work culture and environment
Minimum Qualifications
* 7+ years in Site Reliability Engineering (SRE) or a related IT field.
* Proficient with 6+ years of experience in Python.
* 4+ years of experience with AWS and SaaS solutions.
* Previous experience with automated configuration tools, specifically Terraform and Ansible.
Preferred Qualifications
* Experience with Container Orchestration including Kubernetes and Docker
* Design and own Technical Solutions for broad or complex requirements with insightful and strategic approaches
* Able to write Performant, Maintainable, Testable code
* Prior experience deploying Cloud Services, Monitoring, Alerting, and Handling Escalations
* Experience supporting a High-Availability SaaS environment
* Charting new DevOps practices without a well-defined roadmap
Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.
Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer, Google Cloud Storage

Sydney, New South Wales Google

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google's technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan ( for more information.
Minimum qualifications:
+ Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
+ 2 years of experience with software development in one or more programming languages.
+ 2 years of experience with data structures or algorithms.
Preferred qualifications:
+ Master's degree in Computer Science or Engineering.
+ 2 years of experience designing, analyzing, and troubleshooting distributed systems.
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance.
Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
+ Write product or system development code.
+ Review code developed by other engineers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
+ Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
+ Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
+ Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also and If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form:
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia

Sydney, New South Wales Bank of America

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia
Sydney, Australia
**Job Description:**
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being a diverse and inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
**Enterprise Cloud Platforms Team:**
Our team designs, builds, and maintains Public Cloud platforms for Bank of America's. We provide our customers an innovative platform with bult-in integrations that allow for a faster time-to-market with reduced complexity. We believe in a high-quality engineering culture, a customer focused mindset, and building for scale and resiliency. As part of this team, you will have a large impact on the evolution of next generation Cloud services for Bank of America and explore an extensive list of new technologies that will drive innovation across our company.
We are seeking Site Reliability Engineers (SREs) to design, build, and maintain our next-gen platforms. The role provides opportunity to work with wide range of technologies and build a unique perspective that comes with integrating disparate services (both on-prem/off-prem) which must interact seamlessly with each other. You will work with colleagues that are fun, smart, hardworking, and driven. You will be part of a global team that is growing, giving you room to innovate and be creative.
**Position Summary**
+ Collaborates with a diverse set of engineers, architects, and teams to design, develop, test, and implement secure, robust, highly available and scalable solutions for BofA's External Cloud Platform
+ Collaborates other software engineers and teams to design and implement deployment approaches using highly scalable, automated, continuous integration and continuous delivery pipelines.
+ Responsible for all aspects of reliability, collaborates with technical experts, key stakeholders, and team members to resolve complex problems, owning the issue until you are sure it will not reoccur.
+ Deep understanding of SRE practices, service level indicators, and service level objectives; proactively utilize them to resolve issues before they impact customers.
+ Gather, analyze, synthesize, and develop visualizations and reporting from large, diverse data sets in service of continuous improvement of the platform.
+ Implement infrastructure, configuration, and network as code for the applications and platforms in your remit.
+ Identify opportunities to eliminate toil and automate the triage of issues to improve overall operational stability.
+ Collaborate with a global team to identify, analyze, and resolve platform vulnerabilities.
+ Proactively promotes the adoption of site reliability engineering best practices within the team and organization.
+ Participate in 24x7 on-call coverage follow the sun model and performs blameless Postmortems (RCAs) as needed.
**Required Skills:**
+ 7 years of combined experience in either SRE, software development, or infrastructure engineering (4 years with an advanced degree in Computer Science or related technical field).
+ 3+ years of hands-on experience building and maintaining cloud platforms on a major cloud service provider.
+ Strong experience in implementing, monitoring, and maintaining a highly scalable and resilient Data Services platform on major CSP's like AWS, Azure or GCP.
+ Strong experience with monitoring tools such as Grafana, Prometheus, Splunk, or Dynatrace, as well as cloud native tools like CloudWatch & CloudTrail, Azure Monitor and Log Analytics
+ Proficiency in implementing, monitoring, and maintaining a Databricks, RDS, or OpenAI platform.
+ Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net; 5+ years applied experience in Python/Java
+ Proficiency in implementing CI/CD pipelines with tools such as git and Jenkins, familiarity with using a GitOps model.
+ Advanced knowledge of networking (firewalls, DNS, Load Balancing, Proxies, etc.)
+ Advanced understanding of Linux & Windows operating systems including shell scripting
+ Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
+ Proven ability to work independently with minimal supervision and as part of a global team with direct responsibilities and an ability to juggle competing priorities and adapt to changes in project scope.
**Desired Skills**
+ Strong experience working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and PingIdentity, Okta, or other SSO solutions.
+ Proficiency in creating automation using Python, Terraform, or Ansible
+ Proficiency in implementing, monitoring, and maintaining a Databricks, CosmosDB, or OpenAI platform.
+ Experience in implementing, monitoring, and maintaining a highly scalable and resilient enterprise platform on Microsoft Azure using native services related to compute, storage, networking, security, and observability.
+ Experience with containerization technologies such as EC2, EKS, Fargate, Openshift, or Kubernetes.
+ Understanding of cost management, inventory management, FinOps model
Bank of America and its affiliates consider for employment and hire qualified candidates without regard to race, religious creed, religion, color, sex, sexual orientation, genetic information, gender, gender identity, gender expression, age, national origin, ancestry, citizenship, protected veteran or disability status or any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other bases such as medical condition, marital status or any other factor that is irrelevant to the performance of our teammates.
To view the "Know your Rights" poster, CLICK HERE ( .
View the LA County Fair Chance Ordinance ( .
Bank of America aims to create a workplace free from the dangers and resulting consequences of illegal and illicit drug use and alcohol abuse. Our Drug-Free Workplace and Alcohol Policy ("Policy") establishes requirements to prevent the presence or use of illegal or illicit drugs or unauthorized alcohol on Bank of America premises and to provide a safe work environment.
To view Bank of America's Drug-free Workplace and Alcohol Policy, CLICK HERE .
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations. Should you be offered a role with Bank of America, your hiring manager will provide you with information on the in-office expectations associated with your role. These expectations are subject to change at any time and at the sole discretion of the Company. To the extent you have a disability or sincerely held religious belief for which you believe you need a reasonable accommodation from this requirement, you must seek an accommodation through the Bank's required accommodation request process before your first day of work.
This communication provides information about certain Bank of America benefits. Receipt of this document does not automatically entitle you to benefits offered by Bank of America. Every effort has been made to ensure the accuracy of this communication. However, if there are discrepancies between this communication and the official plan documents, the plan documents will always govern. Bank of America retains the discretion to interpret the terms or language used in any of its communications according to the provisions contained in the plan documents. Bank of America also reserves the right to amend or terminate any benefit plan in its sole discretion at any time for any reason.
This advertiser has chosen not to accept applicants from your region.

Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia

Sydney, New South Wales Bank of America

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia
Sydney, Australia
**Job Description:**
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being a diverse and inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
**Enterprise Cloud Platforms Team:**
Our team designs, builds, and maintains Public Cloud platforms for Bank of America's. We provide our customers an innovative platform with bult-in integrations that allow for a faster time-to-market with reduced complexity. We believe in a high-quality engineering culture, a customer focused mindset, and building for scale and resiliency. As part of this team, you will have a large impact on the evolution of next generation Cloud services for Bank of America and explore an extensive list of new technologies that will drive innovation across our company.
We are seeking Site Reliability Engineers (SREs) to design, build, and maintain our next-gen platforms. The role provides opportunity to work with wide range of technologies and build a unique perspective that comes with integrating disparate services (both on-prem/off-prem) which must interact seamlessly with each other. You will work with colleagues that are fun, smart, hardworking, and driven. You will be part of a global team that is growing, giving you room to innovate and be creative.
**Position Summary**
+ Collaborates with a diverse set of engineers, architects, and teams to design, develop, test, and implement secure, robust, highly available and scalable solutions for BofA's External Cloud Platform
+ Collaborates other software engineers and teams to design and implement deployment approaches using highly scalable, automated, continuous integration and continuous delivery pipelines.
+ Responsible for all aspects of reliability, collaborates with technical experts, key stakeholders, and team members to resolve complex problems, owning the issue until you are sure it will not reoccur.
+ Deep understanding of SRE practices, service level indicators, and service level objectives; proactively utilize them to resolve issues before they impact customers.
+ Gather, analyze, synthesize, and develop visualizations and reporting from large, diverse data sets in service of continuous improvement of the platform.
+ Implement infrastructure, configuration, and network as code for the applications and platforms in your remit.
+ Identify opportunities to eliminate toil and automate the triage of issues to improve overall operational stability.
+ Collaborate with a global team to identify, analyze, and resolve platform vulnerabilities.
+ Proactively promotes the adoption of site reliability engineering best practices within the team and organization.
+ Participate in 24x7 on-call coverage follow the sun model and performs blameless Postmortems (RCAs) as needed.
**Required Skills:**
+ 7 years of combined experience in either SRE, software development, or infrastructure engineering (4 years with an advanced degree in Computer Science or related technical field).
+ 3+ years of hands-on experience building and maintaining cloud platforms on a major cloud service provider.
+ Strong experience in implementing, monitoring, and maintaining a highly scalable and resilient Data Services platform on major CSP's like AWS, Azure or GCP.
+ Strong experience with monitoring tools such as Grafana, Prometheus, Splunk, or Dynatrace, as well as cloud native tools like CloudWatch & CloudTrail, Azure Monitor and Log Analytics
+ Proficiency in implementing, monitoring, and maintaining a Databricks, RDS, or OpenAI platform.
+ Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net; 5+ years applied experience in Python/Java
+ Proficiency in implementing CI/CD pipelines with tools such as git and Jenkins, familiarity with using a GitOps model.
+ Advanced knowledge of networking (firewalls, DNS, Load Balancing, Proxies, etc.)
+ Advanced understanding of Linux & Windows operating systems including shell scripting
+ Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
+ Proven ability to work independently with minimal supervision and as part of a global team with direct responsibilities and an ability to juggle competing priorities and adapt to changes in project scope.
**Desired Skills**
+ Strong experience working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and PingIdentity, Okta, or other SSO solutions.
+ Proficiency in creating automation using Python, Terraform, or Ansible
+ Proficiency in implementing, monitoring, and maintaining a Databricks, CosmosDB, or OpenAI platform.
+ Experience in implementing, monitoring, and maintaining a highly scalable and resilient enterprise platform on Microsoft Azure using native services related to compute, storage, networking, security, and observability.
+ Experience with containerization technologies such as EC2, EKS, Fargate, Openshift, or Kubernetes.
+ Understanding of cost management, inventory management, FinOps model
Bank of America and its affiliates consider for employment and hire qualified candidates without regard to race, religious creed, religion, color, sex, sexual orientation, genetic information, gender, gender identity, gender expression, age, national origin, ancestry, citizenship, protected veteran or disability status or any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other bases such as medical condition, marital status or any other factor that is irrelevant to the performance of our teammates.
To view the "Know your Rights" poster, CLICK HERE ( .
View the LA County Fair Chance Ordinance ( .
Bank of America aims to create a workplace free from the dangers and resulting consequences of illegal and illicit drug use and alcohol abuse. Our Drug-Free Workplace and Alcohol Policy ("Policy") establishes requirements to prevent the presence or use of illegal or illicit drugs or unauthorized alcohol on Bank of America premises and to provide a safe work environment.
To view Bank of America's Drug-free Workplace and Alcohol Policy, CLICK HERE .
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations. Should you be offered a role with Bank of America, your hiring manager will provide you with information on the in-office expectations associated with your role. These expectations are subject to change at any time and at the sole discretion of the Company. To the extent you have a disability or sincerely held religious belief for which you believe you need a reasonable accommodation from this requirement, you must seek an accommodation through the Bank's required accommodation request process before your first day of work.
This communication provides information about certain Bank of America benefits. Receipt of this document does not automatically entitle you to benefits offered by Bank of America. Every effort has been made to ensure the accuracy of this communication. However, if there are discrepancies between this communication and the official plan documents, the plan documents will always govern. Bank of America retains the discretion to interpret the terms or language used in any of its communications according to the provisions contained in the plan documents. Bank of America also reserves the right to amend or terminate any benefit plan in its sole discretion at any time for any reason.
This advertiser has chosen not to accept applicants from your region.

Software Engineer, Site Reliability Engineering, Campus

Sydney, New South Wales Google

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

For Australia applicants:
At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google's technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan ( for more information.
Minimum qualifications:
+ Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
+ Experience with software development in one or more programming languages during coursework/projects, research, internships, or practical experience in school, work, or open source projects.
+ Experience with data structures or algorithms.
Preferred qualifications:
+ Master's degree in Computer Science or Engineering, or a related field.
Hope is not a strategy. Engineering solutions to design, build, and maintain efficient large-scale systems is a true strategy, and a good one.
Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services-both our internally critical and our externally-visible systems-have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance.
SRE is also a mindset and a set of engineering approaches to running better production systems-we build our own creative engineering solutions to operations problems. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. As SREs are responsible for the big picture of how our systems relate to each other, we use a breadth of tools and approaches to solve a broad spectrum of problems. Practices such as limiting time spent on operational work, blameless postmortems and proactive identification of potential outages factor into iterative improvement that is key to both product quality and interesting and dynamic day-to-day work.
SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
To learn more:
+ Check out Site Reliability Engineering ( , written by Google SREs.
+ Watch a recorded Hangout on Air ( to meet some of our SREs.
+ Read a career profile ( about why a software engineer chose to join SRE.
In this role, with your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
For United States applicants:
The US base salary range for this full-time position is $118,000-$170,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google ( .
+ Write product or system development code.
+ Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
+ Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
+ Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
+ Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also and If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form:
This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About The Latest Site engineer Jobs in Parramatta !

Software Engineer, Site Reliability Engineering, Caching

Sydney, New South Wales Google

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google's technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan ( for more information.
Minimum qualifications:
+ Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
+ 2 years of experience with data structures/algorithms and software development in one or more programming languages.
Preferred qualifications:
+ Master's degree in Computer Science or Engineering, or a related field.
+ Experience with object-oriented programming languages such as C++ and Python.
+ Experience with Borg and the Google production environment.
+ Experience in designing, analyzing and maintaining large-scale distributed systems.
+ Experience in owning a small-to-medium area and deliver projects separately with some guidance from executive team members.
+ Excellent problem-solving and troubleshooting skills in software systems.
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance.
Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
The Caching Site Reliability Engineering (SRE) is a team in Core Data Foundations that manages critical, business, and user-impacting services. We provide SRE partnership for the caching and caching related services like Static Content Service, Laelaps, Punctual, and Memstore. These services underpin Search, Ads, Gaea Identity, Workspace, and many other critical systems.
+ Work with development partners to improve the reliability, scalability, and efficiency of the services, and make new services meet production best practices.
+ Develop automation and improve next-generation services reliability to accelerate service convergence and migration.
+ Identify and automate away operational toil.
+ Mitigate production outages.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also and If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form:
This advertiser has chosen not to accept applicants from your region.

Senior Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia

Sydney, New South Wales Bank of America

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

Senior Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia
Sydney, Australia
**Job Description:**
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being a diverse and inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
**Enterprise Cloud Platforms Team:**
Our team designs, builds, and maintains Public Cloud platforms for Bank of America's. We provide our customers an innovative platform with bult-in integrations that allow for a faster time-to-market with reduced complexity. We believe in a high-quality engineering culture, a customer focused mindset, and building for scale and resiliency. As part of this team, you will have a large impact on the evolution of next generation Cloud services for Bank of America and explore an extensive list of new technologies that will drive innovation across our company.
We are seeking Senior Site Reliability Engineers (SREs) to design, build, and maintain our next-gen platforms. The role provides opportunity to work with wide range of technologies and build a unique perspective that comes with integrating disparate services (both on-prem/off-prem) which must interact seamlessly with each other. You will work with colleagues that are fun, smart, hardworking, and driven. You will be part of a global team that is growing, giving you room to innovate and be creative.
**Position Summary**
+ Collaborates with a diverse set of engineers, architects, and teams to design, develop, test, and implement secure, robust, highly available and scalable solutions for BofA's External Cloud Platform
+ Collaborates other software engineers and teams to design and implement deployment approaches using highly scalable, automated, continuous integration and continuous delivery pipelines.
+ Responsible for all aspects of reliability, collaborates with technical experts, key stakeholders, and team members to resolve complex problems, owning the issue until you are sure it will not reoccur.
+ Deep understanding of SRE practices, service level indicators, and service level objectives; proactively utilize them to resolve issues before they impact customers.
+ Gather, analyze, synthesize, and develop visualizations and reporting from large, diverse data sets in service of continuous improvement of the platform.
+ Implement infrastructure, configuration, and network as code for the applications and platforms in your remit.
+ Identify opportunities to eliminate toil and automate the triage of issues to improve overall operational stability.
+ Collaborate with a global team to identify, analyze, and resolve platform vulnerabilities.
+ Proactively promotes the adoption of site reliability engineering best practices within the team and organization.
+ Participate in 24x7 on-call coverage follow the sun model and performs blameless Postmortems (RCAs) as needed.
**Required Skills:**
+ 15 years of combined experience in either SRE, software development, or infrastructure engineering (10 years with an advanced degree in Computer Science or related technical field).
+ 7+ years of hands-on experience building and maintaining cloud platforms on a major cloud service provider.
+ Strong experience in implementing, monitoring, and maintaining a highly scalable and resilient Data Services platform on major CSP's like AWS, Azure or GCP.
+ Strong experience with monitoring tools such as Grafana, Prometheus, Splunk, or Dynatrace, as well as cloud native tools like CloudWatch & CloudTrail, Azure Monitor and Log Analytics
+ Proficiency in implementing, monitoring, and maintaining a Databricks, RDS, or OpenAI platform.
+ Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net; 5+ years applied experience in Python/Java
+ Proficiency in implementing CI/CD pipelines with tools such as git and Jenkins, familiarity with using a GitOps model.
+ Advanced knowledge of networking (firewalls, DNS, Load Balancing, Proxies, etc.)
+ Advanced understanding of Linux & Windows operating systems including shell scripting
+ Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
+ Proven ability to work independently with minimal supervision and as part of a global team with direct responsibilities and an ability to juggle competing priorities and adapt to changes in project scope.
**Desired Skills**
+ Strong experience working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and PingIdentity, Okta, or other SSO solutions.
+ Proficiency in creating automation using Python, Terraform, or Ansible
+ Proficiency in implementing, monitoring, and maintaining a Databricks, CosmosDB, or OpenAI platform.
+ Experience in implementing, monitoring, and maintaining a highly scalable and resilient enterprise platform on Microsoft Azure using native services related to compute, storage, networking, security, and observability.
+ Experience with containerization technologies such as EC2, EKS, Fargate, Openshift, or Kubernetes.
+ Understanding of cost management, inventory management, FinOps model
Bank of America and its affiliates consider for employment and hire qualified candidates without regard to race, religious creed, religion, color, sex, sexual orientation, genetic information, gender, gender identity, gender expression, age, national origin, ancestry, citizenship, protected veteran or disability status or any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other bases such as medical condition, marital status or any other factor that is irrelevant to the performance of our teammates.
To view the "Know your Rights" poster, CLICK HERE ( .
View the LA County Fair Chance Ordinance ( .
Bank of America aims to create a workplace free from the dangers and resulting consequences of illegal and illicit drug use and alcohol abuse. Our Drug-Free Workplace and Alcohol Policy ("Policy") establishes requirements to prevent the presence or use of illegal or illicit drugs or unauthorized alcohol on Bank of America premises and to provide a safe work environment.
To view Bank of America's Drug-free Workplace and Alcohol Policy, CLICK HERE .
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations. Should you be offered a role with Bank of America, your hiring manager will provide you with information on the in-office expectations associated with your role. These expectations are subject to change at any time and at the sole discretion of the Company. To the extent you have a disability or sincerely held religious belief for which you believe you need a reasonable accommodation from this requirement, you must seek an accommodation through the Bank's required accommodation request process before your first day of work.
This communication provides information about certain Bank of America benefits. Receipt of this document does not automatically entitle you to benefits offered by Bank of America. Every effort has been made to ensure the accuracy of this communication. However, if there are discrepancies between this communication and the official plan documents, the plan documents will always govern. Bank of America retains the discretion to interpret the terms or language used in any of its communications according to the provisions contained in the plan documents. Bank of America also reserves the right to amend or terminate any benefit plan in its sole discretion at any time for any reason.
This advertiser has chosen not to accept applicants from your region.

Senior Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia

Sydney, New South Wales Bank of America

Posted 1 day ago

Job Viewed

Tap Again To Close

Job Description

Senior Site Reliability Engineer, Enterprise Cloud Platforms, Global Technology, Australia
Sydney, Australia
**Job Description:**
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being a diverse and inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
**Enterprise Cloud Platforms Team:**
Our team designs, builds, and maintains Public Cloud platforms for Bank of America's. We provide our customers an innovative platform with bult-in integrations that allow for a faster time-to-market with reduced complexity. We believe in a high-quality engineering culture, a customer focused mindset, and building for scale and resiliency. As part of this team, you will have a large impact on the evolution of next generation Cloud services for Bank of America and explore an extensive list of new technologies that will drive innovation across our company.
We are seeking Senior Site Reliability Engineers (SREs) to design, build, and maintain our next-gen platforms. The role provides opportunity to work with wide range of technologies and build a unique perspective that comes with integrating disparate services (both on-prem/off-prem) which must interact seamlessly with each other. You will work with colleagues that are fun, smart, hardworking, and driven. You will be part of a global team that is growing, giving you room to innovate and be creative.
**Position Summary**
+ Collaborates with a diverse set of engineers, architects, and teams to design, develop, test, and implement secure, robust, highly available and scalable solutions for BofA's External Cloud Platform
+ Collaborates other software engineers and teams to design and implement deployment approaches using highly scalable, automated, continuous integration and continuous delivery pipelines.
+ Responsible for all aspects of reliability, collaborates with technical experts, key stakeholders, and team members to resolve complex problems, owning the issue until you are sure it will not reoccur.
+ Deep understanding of SRE practices, service level indicators, and service level objectives; proactively utilize them to resolve issues before they impact customers.
+ Gather, analyze, synthesize, and develop visualizations and reporting from large, diverse data sets in service of continuous improvement of the platform.
+ Implement infrastructure, configuration, and network as code for the applications and platforms in your remit.
+ Identify opportunities to eliminate toil and automate the triage of issues to improve overall operational stability.
+ Collaborate with a global team to identify, analyze, and resolve platform vulnerabilities.
+ Proactively promotes the adoption of site reliability engineering best practices within the team and organization.
+ Participate in 24x7 on-call coverage follow the sun model and performs blameless Postmortems (RCAs) as needed.
**Required Skills:**
+ 15 years of combined experience in either SRE, software development, or infrastructure engineering (10 years with an advanced degree in Computer Science or related technical field).
+ 7+ years of hands-on experience building and maintaining cloud platforms on a major cloud service provider.
+ Strong experience in implementing, monitoring, and maintaining a highly scalable and resilient Data Services platform on major CSP's like AWS, Azure or GCP.
+ Strong experience with monitoring tools such as Grafana, Prometheus, Splunk, or Dynatrace, as well as cloud native tools like CloudWatch & CloudTrail, Azure Monitor and Log Analytics
+ Proficiency in implementing, monitoring, and maintaining a Databricks, RDS, or OpenAI platform.
+ Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net; 5+ years applied experience in Python/Java
+ Proficiency in implementing CI/CD pipelines with tools such as git and Jenkins, familiarity with using a GitOps model.
+ Advanced knowledge of networking (firewalls, DNS, Load Balancing, Proxies, etc.)
+ Advanced understanding of Linux & Windows operating systems including shell scripting
+ Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
+ Proven ability to work independently with minimal supervision and as part of a global team with direct responsibilities and an ability to juggle competing priorities and adapt to changes in project scope.
**Desired Skills**
+ Strong experience working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and PingIdentity, Okta, or other SSO solutions.
+ Proficiency in creating automation using Python, Terraform, or Ansible
+ Proficiency in implementing, monitoring, and maintaining a Databricks, CosmosDB, or OpenAI platform.
+ Experience in implementing, monitoring, and maintaining a highly scalable and resilient enterprise platform on Microsoft Azure using native services related to compute, storage, networking, security, and observability.
+ Experience with containerization technologies such as EC2, EKS, Fargate, Openshift, or Kubernetes.
+ Understanding of cost management, inventory management, FinOps model
Bank of America and its affiliates consider for employment and hire qualified candidates without regard to race, religious creed, religion, color, sex, sexual orientation, genetic information, gender, gender identity, gender expression, age, national origin, ancestry, citizenship, protected veteran or disability status or any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other bases such as medical condition, marital status or any other factor that is irrelevant to the performance of our teammates.
To view the "Know your Rights" poster, CLICK HERE ( .
View the LA County Fair Chance Ordinance ( .
Bank of America aims to create a workplace free from the dangers and resulting consequences of illegal and illicit drug use and alcohol abuse. Our Drug-Free Workplace and Alcohol Policy ("Policy") establishes requirements to prevent the presence or use of illegal or illicit drugs or unauthorized alcohol on Bank of America premises and to provide a safe work environment.
To view Bank of America's Drug-free Workplace and Alcohol Policy, CLICK HERE .
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations. Should you be offered a role with Bank of America, your hiring manager will provide you with information on the in-office expectations associated with your role. These expectations are subject to change at any time and at the sole discretion of the Company. To the extent you have a disability or sincerely held religious belief for which you believe you need a reasonable accommodation from this requirement, you must seek an accommodation through the Bank's required accommodation request process before your first day of work.
This communication provides information about certain Bank of America benefits. Receipt of this document does not automatically entitle you to benefits offered by Bank of America. Every effort has been made to ensure the accuracy of this communication. However, if there are discrepancies between this communication and the official plan documents, the plan documents will always govern. Bank of America retains the discretion to interpret the terms or language used in any of its communications according to the provisions contained in the plan documents. Bank of America also reserves the right to amend or terminate any benefit plan in its sole discretion at any time for any reason.
This advertiser has chosen not to accept applicants from your region.

Nearby Locations

Other Jobs Near Me

View all Site Engineer jobs View all jobs in Parramatta