As a Site Reliability Engineer, you will be responsible for setting up and maintaining the company's product components in customer IT infrastructure and managing data pipeline.What you will do :- Being responsible for the deployment of the software in a cloud infrastructure (AWS or Azure) and making sure it is running correctly- Being answerable for the architecture and technical leadership of the complete DevOps infrastructure (designing, building, optimizing data platforms)- Installation, configuration, and maintenance of dependency packages/software- Database (PostgreSQL, MongoDB), Message Broker (RMQ, Kafka), Cache (Redis), HTTP Server (NGINX) etc- Being responsible for maintaining high availability of production environment and coordinating with Dev/QA leads to review/ approve deployments- Being responsible for creating software deployment strategies essential for the successful deployment of software in the cloud- Identifying issues in the production phase of the system and implementing monitoring solutions to overcome those issues- Being involved in creating technology infrastructure, automation tools, and maintaining configuration management- Being responsible for continuous deployment and integration frameworks across sites, automating all tasks for deploying code and dataWhat you need to have :- 2 - 4 years of experience in dockerizing, cloud deployment, security, system configuration, automation and implementation- Sound knowledge of containerizing of solution (packing, deploying, and running applications using Docker, Docker Swarm and Kubernetes)- Sound knowledge of various tools and technologies with awareness of technologies like Python, PostgreSQL and MongoDB- Experience in configuration, maintaining and securing Linux and configuring/ automating the monitoring tools using scripts- Ability to strategize, monitor and maintain deployment of micro-services eco system on premise and on cloud (AWS, Azure, GCP).- Ability to monitor health of PCs/VMs and awareness of usage of standard tools to generate periodic health report- Thorough understanding of cloud platforms, databases, and data clusters and expertise in scaling distributed data systems- Sound knowledge of Unix and Window OS administration- Proficiency in Bash Schell scripting and Windows services- Sound knowledge of distributed deployment paradigm setting up cluster of nodes, high- availability, load-balancing, master-follower concepts- Sound knowledge of continuous integration/deployment and ability to set up CI-CD pipelines in Jenkins and Python source codeless deployment- Awareness of the importance, sensitivity, and sanctity of a production environment, taking responsibility and ownership, pride- Ability to detect and protect deployment from Cyber-attacks and ability to find and bridge security loopholes in the deployment- Ability to resolve deployment platform and infrastructure related issues as per service level agreements (SLAs) with customers- Ability to remain calm during uncertainties, and work in a collaborative environment- Capability of developing a good rapport with other people in the organization, good task prioritization and delegation skills- Good oral and written communication and crisis management skills with an in-person or at least virtual availability at the time of need (ref:hirist.com)