Site Reliability Engineering:How Google Runs Production ...

文章推薦指數: 80 %
投票人數:10人

The overwhelming majority of a software systems lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that ... SiteReliabilityEngineering:HowGoogleRunsProductionSystemsApril2016 April2016ReadMore Authors:BetsyBeyer,ChrisJones,JenniferPetoff,NiallRichardMurphy Publisher:O'ReillyMedia,Inc.ISBN:978-1-4919-2912-4Pages:552 AvailableatAmazonSavetoBinderBinderExportCitationCitation Shareon BibliometricsCitationcount41Downloads(6weeks)0Downloads(12months)0Downloads(cumulative)0 Sections SiteReliabilityEngineering:HowGoogleRunsProductionSystems2016 Abstract Theoverwhelmingmajorityofasoftwaresystemslifespanisspentinuse,notindesignorimplementation.So,whydoesconventionalwisdominsistthatsoftwareengineersfocusprimarilyonthedesignanddevelopmentoflarge-scalecomputingsystems?Inthiscollectionofessaysandarticles,keymembersofGooglesSiteReliabilityTeamexplainhowandwhytheircommitmenttotheentirelifecyclehasenabledthecompanytosuccessfullybuild,deploy,monitor,andmaintainsomeofthelargestsoftwaresystemsintheworld.YoulllearntheprinciplesandpracticesthatenableGoogleengineerstomakesystemsmorescalable,reliable,andefficientlessonsdirectlyapplicabletoyourorganization.Thisbookisdividedintofoursections:IntroductionLearnwhatsitereliabilityengineeringisandwhyitdiffersfromconventionalITindustrypracticesPrinciplesExaminethepatterns,behaviors,andareasofconcernthatinfluencetheworkofasitereliabilityengineer(SRE)PracticesUnderstandthetheoryandpracticeofanSREsday-to-daywork:buildingandoperatinglargedistributedcomputingsystemsManagementExploreGoogle'sbestpracticesfortraining,communication,andmeetingsthatyourorganizationcanuse CitedByDiasA,CorreiaLandMalheirosN (2021) ,ASystematicLiteratureReviewonVirtualMachineConsolidation,ACMComputingSurveys,54:8,(1-38),Onlinepublicationdate:30-Nov-2022.HoleK (2022) ,Tutorialonsystemswithantifragilitytodowntime,Computing,104:1,(73-93),Onlinepublicationdate:1-Jan-2022.JalodiaN,TanejaM,DavyAandDezfouliBAResidualLSTMbasedMulti-LabelClassificationFrameworkforProactiveSLAManagementinaLatencyCriticalNFVApplicationUse-Case2022IEEE19thAnnualConsumerCommunications&NetworkingConference(CCNC),(782-789)BronsonN,AghayevA,CharapkoAandZhuTMetastablefailuresindistributedsystemsProceedingsoftheWorkshoponHotTopicsinOperatingSystems,(221-227)LeiteL,PintoG,KonFandMeirellesP (2021) ,Theorganizationofsoftwareteamsinthequestforcontinuousdelivery,InformationandSoftwareTechnology,139:C,Onlinepublicationdate:1-Nov-2021.AlvesIandRochaCQualifyingsoftwareengineersundergraduatesinDevOps-challengesofintroducingtechnicalandnon-technicalconceptsinaproject-orientedcourseProceedingsofthe43rdInternationalConferenceonSoftwareEngineering:JointTrackonSoftwareEngineeringEducationandTraining,(144-153)PopeMandSillitoJQuartermasterProceedingsofthe43rdInternationalConferenceonSoftwareEngineering:CompanionProceedings,(57-60)PereiraCAfunctionalparadigmforcapacityplanningofcloudcomputingworkloadsProceedingsofthe43rdInternationalConferenceonSoftwareEngineering:CompanionProceedings,(281-283)MathewsD,VermaM,AggarwalPandLakshmiJTowardsfailurecorrelationforimprovedcloudapplicationserviceresilienceProceedingsofthe14thIEEE/ACMInternationalConferenceonUtilityandCloudComputingCompanion,(1-8)NoklebergCandHawkesB (2021) ,Applicationframeworks,CommunicationsoftheACM,64:7,(42-49),Onlinepublicationdate:1-Jul-2021.MaasM,AndersenD,IsardM,JavanmardM,McKinleyKandRaffelCLearning-basedMemoryAllocationforC++ServerWorkloadsProceedingsoftheTwenty-FifthInternationalConferenceonArchitecturalSupportforProgrammingLanguagesandOperatingSystems,(541-556)LeiteL,KonF,PintoGandMeirellesPPlatformTeamsProceedingsoftheIEEE/ACM42ndInternationalConferenceonSoftwareEngineeringWorkshops,(505-511)HauerT,HoffmannP,LunneyJ,ArdeleanDandDiwanAMeaningfulavailabilityProceedingsofthe17thUsenixConferenceonNetworkedSystemsDesignandImplementation,(545-558)ChoI,SaeedA,FriedJ,ParkS,AlizadehMandBelayAOverloadcontrolforµs-scaleRPCswithbreakwaterProceedingsofthe14thUSENIXConferenceonOperatingSystemsDesignandImplementation,(299-314)NgoK,SenSandLloydWToleratingslowdownsinreplicatedstatemachinesusingcopilotsProceedingsofthe14thUSENIXConferenceonOperatingSystemsDesignandImplementation,(583-598)GrieblerD,VogelA,DeSensiD,DaneluttoMandFernandesL (2019) ,Simplifyingandimplementingservicelevelobjectivesforstreamparallelism,TheJournalofSupercomputing,76:6,(4603-4628),Onlinepublicationdate:1-Jun-2020.AggarwalP,GuptaA,MohapatraP,NagarS,MandalA,WangQandParadkarALocalizationofOperationalFaultsinCloudApplicationsbyMiningCausalDependenciesinLogsUsingGoldenSignalsService-OrientedComputing–ICSOC2020Workshops,(137-149)SlossB,NukalaSandRauV (2019) ,Metricsthatmatter,CommunicationsoftheACM,62:4,(88-88),Onlinepublicationdate:20-Mar-2019.Gamez-DiazA,FernandezP,Ruiz-CortésA,MolinaP,KolekarN,BhogillP,MohaanMandMéndezFTheroleoflimitationsandSLAsintheAPIindustryProceedingsofthe201927thACMJointMeetingonEuropeanSoftwareEngineeringConferenceandSymposiumontheFoundationsofSoftwareEngineering,(1006-1014)Lagar-CavillaA,AhnJ,SouhlalS,AgarwalN,BurnyR,ButtS,ChangJ,ChauguleA,DengN,ShahidJ,ThelenG,YurtseverK,ZhaoYandRanganathanPSoftware-DefinedFarMemoryinWarehouse-ScaleComputersProceedingsoftheTwenty-FourthInternationalConferenceonArchitecturalSupportforProgrammingLanguagesandOperatingSystems,(317-330)LouC,HuangPandSmithSComprehensiveandEfficientRuntimeCheckinginSystemSoftwarethroughWatchdogsProceedingsoftheWorkshoponHotTopicsinOperatingSystems,(51-57)WiedemannA,WiescheMandKrcmarHIntegratingDevelopmentandOperationsinCross-FunctionalTeams-TowardaDevOpsCompetencyModelProceedingsofthe2019onComputersandPeopleResearchConference,(14-19)MassonC,RimJandLeeH (2019) ,DDSketch,ProceedingsoftheVLDBEndowment,12:12,(2195-2205),Onlinepublicationdate:1-Aug-2019.Wirfs-BrockRandHvatumLWhowillreadmypatterns?Proceedingsofthe26thConferenceonPatternLanguagesofPrograms,(1-21)NiedermaierS,KoetterF,FreymannAandWagnerSOnObservabilityandMonitoringofDistributedSystems–AnIndustryInterviewStudyService-OrientedComputing,(36-52)AndreadisG,VersluisL,MastenbroekFandIosupAAreferencearchitecturefordatacenterschedulingProceedingsoftheInternationalConferenceforHighPerformanceComputing,Networking,Storage,andAnalysis,(1-15)MekuriaR,McGrathM,RiccobeneV,Bayon-MolinoV,TseliosC,ThomsonJandDobrodubAAutomatedprofilingofvirtualizedmediaprocessingfunctionsusingtelemetryandmachinelearningProceedingsofthe9thACMMultimediaSystemsConference,(150-161)GhirottiS,ReillyTandRentzA (2018) ,Trackingandcontrollingmicroservicedependencies,CommunicationsoftheACM,61:11,(98-104),Onlinepublicationdate:26-Oct-2018.NukalaSandRauV (2018) ,WhySREdocumentsmatter,CommunicationsoftheACM,61:12,(45-51),Onlinepublicationdate:20-Nov-2018.EsparrachiariS,ReillyTandRentzA (2018) ,TrackingandControllingMicroserviceDependencies,Queue,16:4,(44-65),Onlinepublicationdate:1-Aug-2018.NukalaSandRauV (2018) ,WhySREDocumentsMatter,Queue,16:4,(66-91),Onlinepublicationdate:1-Aug-2018.SlossB,NukalaSandRauV (2018) ,MetricsThatMatter,Queue,16:6,(86-105),Onlinepublicationdate:1-Dec-2018.VeeraraghavanK,MezaJ,MichelsonS,PanneerselvamS,GyoriA,ChouD,MargulisS,ObenshainD,PadmanabhaS,ShahA,SongYandXuTMaelstromProceedingsofthe13thUSENIXconferenceonOperatingSystemsDesignandImplementation,(373-389)AndreadisG,VersluisL,MastenbroekFandIosupAAreferencearchitecturefordatacenterschedulingProceedingsoftheInternationalConferenceforHighPerformanceComputing,Networking,Storage,andAnalysis,(1-15)ZhangQ,YuG,GuoC,DangY,SwansonN,YangX,YaoR,ChintalapatiM,KrishnamurthyAandAndersonTDeepviewProceedingsofthe15thUSENIXConferenceonNetworkedSystemsDesignandImplementation,(519-532)GanE,DingJ,TaiK,SharanVandBailisP (2018) ,Moment-basedquantilesketchesforefficienthighcardinalityaggregationqueries,ProceedingsoftheVLDBEndowment,11:11,(1647-1660),Onlinepublicationdate:1-Jul-2018.WeichbrodtLMeasuringoperationalqualityofrecommendationsProceedingsofthe12thACMConferenceonRecommenderSystems,(485-485)TreynorB,DahlinM,RauVandBeyerB (2017) ,Thecalculusofserviceavailability,CommunicationsoftheACM,60:9,(42-47),Onlinepublicationdate:23-Aug-2017.SlossB,DahlinM,RauVandBeyerB (2017) ,TheCalculusofServiceAvailability,Queue,15:2,(49-67),Onlinepublicationdate:1-Apr-2017.AlvaroPandTymonS (2017) ,Abstractingthegeniusesawayfromfailuretesting,CommunicationsoftheACM,61:1,(54-61),Onlinepublicationdate:27-Dec-2017.RongKandBailisP (2017) ,ASAP,ProceedingsoftheVLDBEndowment,10:11,(1358-1369),Onlinepublicationdate:1-Aug-2017.SavetoBinderCreateaNewBinderNameCancelCreate Contributors BetsyBeyer StanfordUniversity PublicationYears2016-2019Publicationcounts10Citationcount60AvailableforDownload8Downloads(cumulative)182,324Downloads(12months)24,008Downloads(6weeks)5,471AverageDownloadsperArticle22,791AverageCitationperArticle6ViewFullProfile ChrisJones PublicationYears2016-2016Publicationcounts1Citationcount41AvailableforDownload0Downloads(cumulative)0Downloads(12months)0Downloads(6weeks)0AverageDownloadsperArticle0AverageCitationperArticle41ViewFullProfile JenniferPetoff PublicationYears2016-2016Publicationcounts1Citationcount41AvailableforDownload0Downloads(cumulative)0Downloads(12months)0Downloads(6weeks)0AverageDownloadsperArticle0AverageCitationperArticle41ViewFullProfile NiallRichardMurphy PublicationYears2016-2018Publicationcounts2Citationcount48AvailableforDownload0Downloads(cumulative)0Downloads(12months)0Downloads(6weeks)0AverageDownloadsperArticle0AverageCitationperArticle24ViewFullProfile Comments ExportCitationsSelectCitationformatBibTeXEndNoteACMRefDownloadcitationCopycitationPreviewisnotavailable.Byclickingdownload,anewtabwillopentostarttheexportprocess.Theprocessmaytakeafewminutesbutonceitfinishesafilewillbedownloadedonyourbrowsersopleasedonotclosethenewtab.Download AboutCookiesOnThisSiteWeusecookiestoensurethatwegiveyouthebestexperienceonourwebsite. LearnmoreGotit!



請為這篇文章評分?