Skip to content

Commit

Permalink
fix some spelling and grammer
Browse files Browse the repository at this point in the history
  • Loading branch information
electron0zero authored Aug 17, 2024
1 parent d97786e commit 1d95f63
Show file tree
Hide file tree
Showing 4 changed files with 49 additions and 49 deletions.
25 changes: 13 additions & 12 deletions pages/about.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,40 +4,41 @@ title: About
permalink: /about/
---

## What is Failure modes?
## What is Failure Modes?

Failure modes is community of software practitioners that likes share and discuss about the failure modes they have seen in real life.
Failure Modes is a community of software practitioners that likes to share and discuss the failure modes they have seen in production systems.

We host meetup in Bangalore, India. Meetups are invite only and follow [Chatham House Rule](https://en.wikipedia.org/wiki/Chatham_House_Rule) to ensure an safe space to share failures.
We host meetups in Bangalore, India.

Meetups are invite-only and follow [Chatham House Rule](https://en.wikipedia.org/wiki/Chatham_House_Rule) to ensure a safe space to share failures.

> **Chatham House Rule:** When a meeting, or part thereof, is held under the Chatham House Rule,
> participants are free to use the information received, but neither the identity nor the affiliation of
> the speaker(s), nor that of any other participant, may be revealed.
>
> [Source - chathamhouse.org](https://www.chathamhouse.org/about/chatham-house-rule)
See [Meetup](/meetup/) page for more details about meetup.
See the [meetup](/meetup/) page for more details about the meetup.

Browse around this website, and expore the collection of incidents and [learn](/learn/) from literature
Browse around this website and explore the collection of incidents and [learn](/learn/) from literature
on how and why software systems fail, and how we can build better systems

## Why Failure modes?
## Why Failure Modes?

Running things in production is hard and running distributed systems in production is extra hard.
Running software systems in production is hard, and running distributed systems in production is even harder.

Failure Modes is an effort to curate resources and stories from the community,
to learn and get better at running large scale software in production.

## Contributing to Failure modes

- Observed an interesting failure mode in your day to day job?, saw and intesting post incident report from a company?, [create an issue](https://github.com/electron0zero/failure-modes/issues/new) to add that to our collection.
- Observed an interesting failure mode in your day-to-day job? Saw an interesting post-incident report from a company?, [create an issue](https://github.com/electron0zero/failure-modes/issues/new) to add that to our collection.

- Wrote a blogpost about a failure you saw in production? [create an issue](https://github.com/electron0zero/failure-modes/issues/new) with blogpost link to add it to our collection.
- Wrote a blog post about a failure you saw in production? [create an issue](https://github.com/electron0zero/failure-modes/issues/new) with blog post link to add it to our collection.

- Saw something intesting about failure modes on the internet? [create an issue](https://github.com/electron0zero/failure-modes/issues/new) and share the link. It can be anything from incident postmortems, blog posts, projects, talks, tweets, research, etc.
- Saw something interesting about failure modes on the internet? [create an issue](https://github.com/electron0zero/failure-modes/issues/new) and share the link. It can be anything from incident postmortems, blog posts, projects, talks, tweets, research, and more.

Huge thanks to our [contributors](https://github.com/electron0zero/failure-modes/graphs/contributors) :bowing_man: :bowing_woman: :tada:

Have suggestions or questions, reach out to Suraj on twitter [@electron0zero](https://twitter.com/electron0zero) or open [an issue](https://github.com/electron0zero/failure-modes/issues)
Have suggestions or questions? Reach out to Suraj on twitter [@electron0zero](https://twitter.com/electron0zero) or open [an issue](https://github.com/electron0zero/failure-modes/issues)

:boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom: :boom:
29 changes: 14 additions & 15 deletions pages/learn.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,11 +6,11 @@ permalink: /learn/

## Learn about building resilient systems

Collection of resources to learn about failures, and failure modes of software systems.
A collection of resources to learn about failures and failure modes of software systems.

## Blog Posts

Blog Posts on failures, reliability, testing and other relevant topics
Blog posts on failures, reliability, testing, and other relevant topics

- [Chaos Engineering — Review Lineage Driven Failure Injection(LDFI)](https://medium.com/becloudy/chaos-engineering-review-lineage-driven-failure-injection-ldfi-a1c831abe504)

Expand All @@ -22,14 +22,13 @@ Blog Posts on failures, reliability, testing and other relevant topics

- [Lessons learned in incident management - Dropbox](https://dropbox.tech/infrastructure/lessons-learned-in-incident-management)

- [Post Mortem - The Cloudflare Blog](https://blog.cloudflare.com/tag/postmortem), list postmortems from cloudflare
- [Post Mortem - The Cloudflare Blog](https://blog.cloudflare.com/tag/postmortem), lists postmortems from cloudflare

- [How we’re building a production readiness review process at Grafana Labs](https://grafana.com/blog/2021/10/13/how-were-building-a-production-readiness-review-process-at-grafana-labs/)

## Talks

Talks on how systems fail, demo of systems, and other wisdom on how
we can build better systems -
Talks on how systems fail, demos of systems, and other wisdom on how we can build better systems -

- [Debugging Under Fire: Keep your Head when Systems have Lost their Mind - Bryan Cantrill](https://www.youtube.com/watch?v=30jNsCVLpAE)

Expand All @@ -53,7 +52,7 @@ we can build better systems -

## Tools & Projects

Tools and projects focused on failures, and failure modes of software systems.
Tools and projects focused on failures and failure modes of software systems.

### [Chaos Toolkit](https://github.com/chaostoolkit)
The Open Source Platform for Chaos Engineering
Expand All @@ -63,8 +62,8 @@ Chaos Monkey is a resiliency tool that helps applications tolerate random instan

### [Learning from Incidents in Software](https://www.learningfromincidents.io/)
Incidents are costly. Without spending time analyzing and determining the conditions
that exist in order for an incident to take place, we won't learn how to successfully
remove nor recover from these conditions in the future.
that exist for an incident to take place, we won't learn how to successfully
remove or recover from these conditions in the future.

Let's help each other learn.

Expand All @@ -74,8 +73,8 @@ understanding and coping with the immense levels of complexity involved in the
operation of critical digital services.

### [Resilience engineering papers](https://github.com/lorin/resilience-engineering)
Contains notes about people active in resilience engineering, as well as
some influential researchers who are no longer with us
Contains notes about people active in resilience engineering as well as
some influential researchers who are no longer with us.


### [Kubernetes Failure Stories](https://github.com/hjacobs/kubernetes-failure-stories)
Expand All @@ -86,7 +85,7 @@ Chaos Mesh is a cloud-native Chaos Engineering platform that orchestrates chaos
on Kubernetes environments.

### [Debugging stories - Dan Luu](https://github.com/danluu/debugging-stories)
Collection of links to various debugging stories.
A collection of links to various debugging stories.

### [A List of Post-mortems! - Dan Luu](https://github.com/danluu/post-mortems)
A collection of postmortems.
Expand All @@ -99,7 +98,7 @@ Curated list of resources on testing distributed systems

## Research

Research on failures and how to test, build and operate reliable systems -
Research on failures and how to test, build, and operate reliable systems -

- [Lineage-driven Fault Injection - the morning paper](https://blog.acolyer.org/2015/03/26/lineage-driven-fault-injection/)

Expand All @@ -113,7 +112,7 @@ Research on failures and how to test, build and operate reliable systems -

- [Report from the SNAFU catchers Workshop on Coping With Complexity](https://snafucatchers.github.io/)

- [How Complex Systems Fail - Richard I. Cook](https://how.complexsystems.fail/), [Original pdf](https://www.gwern.net/docs/technology/2000-cook.pdf)
- [How Complex Systems Fail - Richard I. Cook](https://how.complexsystems.fail/), ([Original pdf](https://www.gwern.net/docs/technology/2000-cook.pdf))

### Fault Isolation using Shuffule Sharding
- [AWS re:Invent 2018: How AWS Minimizes the Blast Radius of Failures (ARC338)](https://www.youtube.com/watch?v=swQbA4zub20)
Expand All @@ -127,7 +126,7 @@ Research on failures and how to test, build and operate reliable systems -

## Systems

Real world failure stories and incident postmortems of widely used systems
Real-world failure stories and incident postmortems of widely used systems

### PostgreSQL
- [Transaction ID wraparound outage at mandrill](https://mailchimp.com/what-we-learned-from-the-recent-mandrill-outage/)
Expand All @@ -141,7 +140,7 @@ Real world failure stories and incident postmortems of widely used systems
### Kubernetes
- [Compilation of public failure/horror stories related to Kubernetes](https://github.com/hjacobs/kubernetes-failure-stories)
- [10 Ways to Shoot Yourself in the Foot with Kubernetes, #9 Will Surprise You - Laurent Bernaille](https://www.youtube.com/watch?v=QKI-JRs2RIE)
- [Kubernetes pods /etc/resolv.conf ndots:5 option and why it may negatively affect your application performances](https://pracucci.com/kubernetes-dns-resolution-ndots-options-and-why-it-may-affect-application-performances.html), also see [DNS Lookups in Kubernetes](https://mrkaran.dev/posts/ndots-kubernetes/)
- [Kubernetes pods /etc/resolv.conf ndots:5 option and why it may negatively affect your application performance](https://pracucci.com/kubernetes-dns-resolution-ndots-options-and-why-it-may-affect-application-performances.html), also see [DNS Lookups in Kubernetes](https://mrkaran.dev/posts/ndots-kubernetes/)

### YugabyteDB
- [How Plume Handled Billions of Operations Per Day Despite an AWS Zone Outage](https://blog.yugabyte.com/how-plume-handled-billions-of-operations-per-day-despite-an-aws-zone-outage/)
16 changes: 8 additions & 8 deletions pages/meetup.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,11 +8,11 @@ permalink: /meetup/

We have occasional meetups in Bangalore, India.

We meet whenever I have time to manage the logistics of the meetup, and have a venue that can host us. :smile:
We meet whenever I have time to manage the logistics of the meetup and have a venue that can host us. :smile:

Meetups are invite-only and follow [Chatham House Rule](https://en.wikipedia.org/wiki/Chatham_House_Rule) to ensure an safe space to share failiures.
Meetups are invite-only and follow [Chatham House Rule](https://en.wikipedia.org/wiki/Chatham_House_Rule) to ensure a safe space to share failures.

Interested in hosting one of our meetups in Bangalore? [Send me a direct message on Twitter](https://twitter.com/electron0zero).
Are you interested in hosting one of our meetups in Bangalore? [Send me a direct message on Twitter](https://twitter.com/electron0zero).

## Next Meetup

Expand All @@ -24,7 +24,7 @@ Date: 24th August, 2024

Join the [WhatsApp Community](https://chat.whatsapp.com/IQOeAnHctWu2FSbgZ0Brro) to stay in the loop.

You can also subscribe to the Meetup Calendar at [meetup.ics](/meetup.ics). Use the following links:
You can also subscribe to the meetup calendar at [meetup.ics](/meetup.ics). Use the following links:

- [iOS/MacOS](webcal://failuremodes.dev/meetup.ics)
- [Google Calendar](https://calendar.google.com/calendar/u/0?cid=webcal%3A%2F%2Ffailuremodes.dev%2Fmeetup.ics)
Expand All @@ -38,9 +38,9 @@ For other apps, you can import the generic [ICS file](https://failuremodes.dev/m

Date: 17th March, 2024

Hosted by [IG Group](https://www.ig.com) with the help from [Srivatsa RV](https://twitter.com/rv_srivatsa), and [Mehul Ved](https://twitter.com/mehulved), thank you :bow:
Hosted by [IG Group](https://www.ig.com) with help from [Srivatsa RV](https://twitter.com/rv_srivatsa), and [Mehul Ved](https://twitter.com/mehulved), thank you :bow:

### Failure Modes - 3nd Meetup
### Failure Modes - 3rd Meetup

[Announcement Tweet](https://twitter.com/electron0zero/status/1746149250201715007)

Expand All @@ -58,7 +58,7 @@ Hosted by [IG Group](https://www.ig.com) with the help from [Srivatsa RV](https:

Date: 5th Feb, 2023

Hosted by [DeepSource](https://deepsource.com/), thank you :bow:
Hosted by [DeepSource](https://deepsource.com/). thank you :bow:

### Failure Modes - 1st Meetup

Expand All @@ -68,4 +68,4 @@ Hosted by [DeepSource](https://deepsource.com/), thank you :bow:

Date: 25th Jan, 2020

Hosted By [Clarisights](https://clarisights.com/), thank you :bow:
Hosted By [Clarisights](https://clarisights.com/). thank you :bow:
Loading

0 comments on commit 1d95f63

Please sign in to comment.