Who is a Site Reliability Engineer (SRE) - Flagship.io

文章推薦指數: 80 %
投票人數:10人

A site reliability engineer (SRE) creates a bridge between development and IT operations by taking on the tasks typically done by operations. SoftwareReleaseGlossaryMostcommonlyusedtermsandacronymsbyproductmanagers,engineersanddevopsABCDEFGHIJKLMNOPQRSTUVWXYZSiteReliabilityEngineer Asitereliabilityengineer(SRE)createsabridgebetweendevelopmentandIToperationsbytakingonthetaskstypicallydonebyoperations.Instead,suchtasksaregiventothesetypesofengineerswhouseautomationtoolstosolveproblemsbycreatingscalableandreliablesoftwaresystems. StandardizationandautomationareattheheartofwhatanSREdoes,especiallyassystemsmigratetothecloud.Thus,theyoftenhaveabackgroundinsoftwareorsystemengineeringorsystemadministrationwithIToperationsexperience. Whatissitereliabilityengineering? Wewillstartwithadefinitionofwhatthistypeofengineeringisbeforewemoveontotheroleandresponsibilitiesofasitereliabilityengineer. SitereliabilityengineeringisatermthatwasfirstcoinedbyGoogle,whereitisdescribedas“whenyoutreatoperationsasifit’sasoftwareproblem.” ThemainpurposeofSREisdevelopingsoftwaresystemsandautomatedsolutionsforoperationalaspects.Thus,SREdoestheworktraditionallydonebyoperationsbutinsteadusingengineerswithsoftwareexpertisetosolvecomplexproblems. Therefore,sitereliabilityengineeringcanbeconsideredasetofpracticesthatincorporatesaspectsofsoftwareengineeringintooperationstherebyincreasingtheefficiencyandreliabilityofsoftwaresystemsandimprovingworkflow. SREandDevOps SitereliabilityengineeringiscloselyrelatedtoDevOps,anotherconceptthatlinkssoftwaredevelopmentandoperations,andcanbeseenasageneralizationofcoreSREprinciples.Consequently,SREplaysalargepartinsuccessfullyimplementingDevOpspractices. Additionally,bothDevOpsandSREseektobridgethegapbetweenoperationsanddevelopmentteamstodeliversoftwarefaster. However,anarticlebyGooglemakesadistinctionbetweenthetwotermsstatingthatSRE“happenstoembodythephilosophiesofDevOps,buthasamuchmoreprescriptivewayofmeasuringandachievingreliabilitythroughengineeringandoperationswork.Inotherwords,SREprescribeshowtosucceedinthevariousDevOpsareas.” ClickheretoreadmoreaboutDevOpsandwhataDevOpsengineerdoes. Whatdoesasitereliabilityengineerdo? Asitereliabilityengineer(SRE)worksbetweendevelopmentandoperations.TheSRE,then,isasoftwaredeveloperwithexperienceinandknowledgeofIToperations. Alotofthisrolerevolvesaroundwritinganddevelopingcodetoautomateprocesses,suchasanalyzinglogs,testingproductionenvironmentsandrespondingtoanyissues,sothisengineerwillbeanexpertinwritingcode. Suchautomationallowsdevelopers,inturn,tofocusexclusivelyonfeaturedevelopmentenablingthemtobringnewfeaturestoproductionasquicklyaspossible. Theoperationsteam,fortheirpart,willfindtheirworkloaddecreasingasaSREwillautomatesolutionsforanyrecurringproblem. Thus,he/shewillbeshiftingbetweendevelopmentandoperationsworkandmaintainabalancebetweenthem. BecauseanSREengineer’smainfocusisonautomation,thismeansthathe/sheenhancesperformance,efficiencyandmonitoringofsoftwaredevelopmentprocesses. Requiredskillset SREsdedicatetheirtimetocreatingsoftwarethatwillimprovethereliabilityofsystems,fixingissuesandrespondingtoincidentsandissues.Assuch,theywillneedvarioustechnicalskills.  Theywillneedtohaveknowledgeofvariousautomationtoolsastheyareusuallyresponsibleforbuildingandintegratingsoftwaretoolstoenhanceanorganizationalsystem’sreliabilityandscalability. Asmentionedabove,theSREwillrequireknowledgeofcodingandmostofthecommonprogramminglanguagesincludingRuby,JavascriptandPHP. He/shewillalsoneedtohaveexpertiseinthemajorcloudproviderssuchasAWSandGoogleCloud. DailyrolesandresponsibilitiesofanSRE Automation Asmentionedpreviously,SREengineersbuildtoolsforautomationtomanageIToperations.Thus,insteadofmanuallyperformingthesefunctions,theiraimistoautomatethem.Suchfunctionsinclude: ContinuousintegrationandcontinuousdeliveryMonitoring IncidentresponseAlerts  Monitoring SREengineersareresponsibleforensuringthattheunderlyinginfrastructureisrunningsmoothlyandthatsystemsandtoolsareworkingasexpected.  Theyalsomonitorcriticalapplicationsandservicestominimizedowntimeandensuretheiravailability. Issueresolution Theseengineersworkcloselywithdevelopers,especiallywhenissuesarisesotheywillcollaboratewithdeveloperstohelpwithtroubleshootingandprovideconsultationwhenalertsareissued. Thisengineerwillinvestigateandthenresolvetheissueintheeventthatadeveloperrunsintoaproblem. Followingtheincidentresolution,theengineerwillrevisittheissueanddeterminethecausetoensureitdoesn’thappenagain. Crossteamcollaboration Basedontheabove,SREsworkacrossdifferentteams,mainlyoperationsanddevelopment.Bybuildingreliablesystemsandprovidingsupporttotheseteams,thiswillgivetheseteamsmoretimetodiverttheirattentiontobuildingnewfeaturesandhencegettheseoutfastertocustomers. CommontoolsusedbySREs Monitoring:suchtoolsincludeAWSCloudWatchandNewRelic Incidentmanagement/on-call:suchasPagerDutyandVictorOpsProjectmanagementandissuetracking:suchasJiraandTrelloInfrastructureorchestration:includingTerraformandSaltStack Tofindoutmoretoolsfromprojectmanagementtoolstoinfrastructureandcontainerorchestrationusedbysitereliabilityengineers,checkoutthiscuratedlistofSREtools. HowmuchdoesanSREmake? Accordingtopayscale,thistypeofengineermakesasalaryanywherebetween$76,000to$158,000ayearintheUnitedStateswiththeaveragebeing$117,768peryear. Conclusion Asitereliabilityengineerisbecominganincreasinglyimportantrolewithinorganizations.Itisachallengingrolethatrequiresapassionforcodingandautomation. Havingsuchengineersinyourorganizationwillhelpreduceyouroperationalcostswhileimprovingthereliabilityofyoursystems. Info Moretermsfromtheglossary SoakTesting Soaktestingisatypeofperformanceandloadtestthatevaluateshowasoftwareapplicationhandlesagrowingnumberofusersforanextendedperiodoftime. Readdescription→ UserAcceptanceTesting Useracceptancetesting(UAT)isusedtoverifywhetherasoftwaremeetsbusinessrequirementsandwhetherit’sreadyforusebycustomers. Readdescription→ FakeDoorTesting Fakedoortestingisamethodwhereyoucanmeasureinterestinaproductornewfeaturewithoutactuallycodingit. Readdescription→ linkedin-squaretwittergithubangle-downyoutube-playcrossmenu Copylink CopyCopied



請為這篇文章評分?