Powered by RND
PodcastsTechnologyThis is Fine! A podcast about resilience engineering and software

This is Fine! A podcast about resilience engineering and software

Colette Alexander and Clint Byrum
This is Fine! A podcast about resilience engineering and software
Latest episode

Available Episodes

5 of 23
  • Root Cause Analysis vs. Resilience Engineering w/special guest Lorin Hochstein
    A history of the 5 whys and root cause analysis from papersSome critiques of the 5 whys:From John Allspaw: https://www.oreilly.com/radar/the-infinite-hows/From Alan J Card: https://qualitysafety.bmj.com/content/26/8/671James Reason and the Swiss Cheese Model: https://pmc.ncbi.nlm.nih.gov/articles/PMC8514562/James Reason’s book Human Error: https://bookshop.org/p/books/human-error/9e06d8a100a07537?ean=9780521314190&next=tAnd a classic from Sidney Dekker (et al.) on the implication of complexity within safety investigations:https://www.sciencedirect.com/science/article/abs/pii/S0925753511000105?via%3DihubWe always recommend the Howie Guide: https://howie-guide.pagerduty.com/STAMP is starting to get popular: https://functionalsafetyengineer.com/introduction-to-stamp/Google’s STAMP paper: https://www.usenix.org/publications/loginonline/evolution-sre-googleGoogle’s STAMP discussion on ProdCast: https://sre.google/prodcast/#season4-episode7And presentation at SRECon: https://www.usenix.org/conference/srecon25americas/presentation/kleinNancy Leveson’s google scholar is always worth browsing: https://scholar.google.com/citations?user=78y4sEcAAAAJ&hl=enAllspaw’s LinkedIn post that we quoted: https://www.linkedin.com/posts/jallspaw_important-reminders-about-learning-effectively-activity-7378775591447183360-c_eDLorin’s Law: https://surfingcomplexity.blog/2017/06/24/a-conjecture-on-why-reliable-systems-fail/Want to talk more about this subject? We’re doing a live event co-sponsored by RISF and you can sign up for it here: https://resilienceinsoftware.org/networks/events/146485
    --------  
    59:43
  • First Stories/Second Stories
    More robustness than resilience, but worth repeating that you should always check your earthquake go-bag: https://www.earthquakeauthority.com/blog/2019/how-to-make-an-earthquake-emergency-kitClint did ASA 103: https://americansailing.com/learn-to-sail/certifications/asa-103-coastal-cruising/Since this is a science podcast, there is a scientific reason people get emotional on airplanes: https://www.cntraveler.com/story/why-do-we-always-cry-on-planes52 Hertz Whale documentary: https://en.wikipedia.org/wiki/The_Loneliest_Whale:_The_Search_for_52And Leslie Jamison wrote 52 Blue as a chapter in one of her essay collections (you can read it excerpted here: https://slate.com/technology/2014/08/52-blue-the-loneliest-whale-in-the-world.html )Colette was wrong, Jamison referenced a famous Kathryn Schulz piece in one of her own essays, which was the source of confusion - The Big One: ​​https://www.newyorker.com/magazine/2015/07/20/the-really-big-one about a cataclysmic earthquake on the west coast. In case you’re curious, Colette uses scholar.google.com and paperpile.com shamelessly live.We reference A Tale of Two Stories: Contrasting View of Patient Safety by Richard Cook and Dave Woods: https://www.researchgate.net/publication/245102691_A_Tale_of_Two_Stories_Contrasting_Views_of_Patient_Safety?enrichId=rgreq-a699511fb5bc518bf1584a0a6613d8d0-XXX&enrichSource=Y292ZXJQYWdlOzI0NTEwMjY5MTtBUzoyMDYyMjM2NjExMTMzNDdAMTQyNjE3ODk2MDQ4NA%3D%3D&el=1_x_2&_esc=publicationCoverPdfThe Beaumaiden report (that dives into a deeper, second story) is here: https://dmaib.com/reports/2021/beaumaiden-grounding-on-18-october-2021We will continue to point to DORA’s organizational model page: https://dora.dev/capabilities/generative-organizational-culture/Some Wikipedia on double loop learning: https://en.wikipedia.org/wiki/Double-loop_learningColette mentioned Mads Møller’s Lund HFSS thesis on deaths and accountability: https://lup.lub.lu.se/student-papers/search/publication/9106422And Bram Couteaux’s Lund HFSS thesis on the drunk flight attendants/pilots court: https://lup.lub.lu.se/student-papers/search/publication/9111661J Paul Reed wrote about being ‘Blame Aware’ - https://medium.com/@jpaulreed/why-blameless-postmortems-might-feel-wrong-cbeee00d51b2
    --------  
    52:55
  • How (Not) to Introduce Resilience Engineering at Work with special guest Michelle Casey
    Lorikeets are pretty: https://en.wikipedia.org/wiki/Rainbow_lorikeetYou think Colette’s kidding about the kangaroo? https://www.youtube.com/watch?v=DQjHVRHXbc8 The Mackinac Bridge is long: https://en.wikipedia.org/wiki/Mackinac_BridgeMichelle’s Blog post:https://resilienceinsoftware.org/news/1288714DORA has some good writing on Westrum’s cultural models if you’re wondering about it: https://dora.dev/capabilities/generative-organizational-culture/The link to our TiF live event with Michelle where we will be discussing the blog post! https://resilienceinsoftware.org/networks/events/143194Please ask us questions! You can go to thisisfinepod.com to get the link to our anonymous google form!
    --------  
    53:21
  • How long should you wait after an incident to do your retro?
    Corn sweat is a real thing: https://www.scientificamerican.com/article/humidity-from-corn-sweat-intensifies-extreme-heat-wave-in-midwest-u-s/Also, plugging Tajin here, because: https://en.wikipedia.org/wiki/Taj%C3%ADn_seasoningWikipedia tells me Tajin is Mexican. I dunno, Clint.Beaumaiden report, for those that didn’t listen to the prior episode where we mentioned it: https://dmaib.com/reports/2021/beaumaiden-grounding-on-18-october-2021John Allspaw’s talk at Spotify that we referenced: https://www.youtube.com/watch?v=M8mYPyRG1fQLorin’s Law is always a good plug: https://surfingcomplexity.blog/2017/06/24/a-conjecture-on-why-reliable-systems-fail/Clint’s book recommendation: https://bookshop.org/p/books/the-15-commitments-of-conscious-leadership-a-new-paradigm-for-sustainable-success-diana-chapman/14574335?ean=9780990976905&next=tSend us questions! at thisisfinepod.com or find us on LinkedIn here: https://www.linkedin.com/company/this-is-fine-a-podcast-about-software-and-resilience-engineering/You can come to the Lund panelist event for RISF by signing up here: https://resilienceinsoftware.org/networks/events/133948
    --------  
    44:24
  • Lund University - Academic Theory and Practice
    A huge thanks to our panelists:⁠John Allspaw⁠⁠Jed Needle⁠⁠Chad Todd⁠RISF and TiF will host a live follow up to this episode on July 31st! You can sign up here: ⁠https://resilienceinsoftware.org/networks/events/133948⁠If you’re interested in Lund’s Masters of Science program in Human Factors and Systems Safety, or any of their learning labs, you can check out more info here: ⁠https://www.humanfactors.lth.se/⁠⁠Adaptive Capacity Labs⁠ is how Jed was introduced to some of the concepts of LFI & Resilience Engineering, which eventually landed him at Lund.John mentioned SciShow Tangents, a podcast by Hank Green and Ceri Riley: ⁠https://www.youtube.com/c/scishowtangents⁠As well as Conway’s Law: https://en.wikipedia.org/wiki/Conway%27s_lawAnd Dunbar’s Number: ⁠https://en.wikipedia.org/wiki/Dunbar%27s_number⁠ And the Theory of Graceful Extensibility, which you can read about here: ⁠https://infoscience.epfl.ch/server/api/core/bitstreams/87cfe245-c138-43cb-87c9-4062dc1a0519/content⁠Lund theses list: https://www.humanfactors.lth.se/ny-sajt/msc-programme/msc-theses/Our panel’s select theses that they love:Colette’s pick: ⁠https://lup.lub.lu.se/student-papers/search/publication/9106422⁠Chad’s pick: ⁠https://lup.lub.lu.se/student-papers/search/publication/9009930⁠John’s picks were all of the software theses, I’m probably missing some but this is my attempt:John’s (was the first): ⁠https://lup.lub.lu.se/student-papers/search/publication/8084520⁠ J Paul Reed: ⁠https://lup.lub.lu.se/student-papers/search/publication/8966930⁠ Chad’s thesis on handovers in software: ⁠https://lup.lub.lu.se/student-papers/search/publication/9076274⁠ Michael Wettick: ⁠https://lup.lub.lu.se/student-papers/search/publication/9150096⁠ Colette’s thesis on QRA: ⁠https://lup.lub.lu.se/student-papers/search/publication/9148570⁠Jessica De Vita: ⁠https://lup.lub.lu.se/student-papers/search/publication/9149521⁠ Dr. Raymer’s I want to Treat the Patient and Not the Alarm: ⁠https://lup.lub.lu.se/student-papers/search/publication/2861164⁠
    --------  
    1:04:32

More Technology podcasts

About This is Fine! A podcast about resilience engineering and software

A podcast about resilience engineering and software. Ever wondered why things on the internet break? Do you work in software and wish that you could have a Dear-Abby-Like call-in show that could answer your deepest questions about how to make your workplace suck less? We're here to help! Write us anonymously at our open question form Email us at: [email protected] Call us and leave a voicemail, or text us at: ‪(401) 592-7574‬
Podcast website

Listen to This is Fine! A podcast about resilience engineering and software, Shell Game and many other podcasts from around the world with the radio.net app

Get the free radio.net app

  • Stations and podcasts to bookmark
  • Stream via Wi-Fi or Bluetooth
  • Supports Carplay & Android Auto
  • Many other app features
Social
v7.23.9 | © 2007-2025 radio.de GmbH
Generated: 10/20/2025 - 7:20:05 AM