Practical Implementation of Chaos Engineering at Qunar Travel

Practical Implementation of Chaos Engineering at Qunar Travel

Author Introduction Yu Haiying joined Qunar Travel in 2014 as a test development engineer, responsible for testing the Qunar flight service backend. Since 2021, he has been responsible for promoting and implementing chaos engineering at Qunar, focusing on quality gap detection and construction based on chaos engineering. 1. Introduction Qunar’s chaos engineering relies on ChaosBlade … Read more

Practical Implementation and Development Thoughts of Chaos Engineering at Ping An Bank

Practical Implementation and Development Thoughts of Chaos Engineering at Ping An Bank

In recent years, with the popularization of new technologies such as distributed system architecture, microservices concepts, and PaaS service components in the financial industry, traditional monolithic business system architectures based on mainframes and storage are rapidly evolving towards distributed system architectures based on cloud foundations (referred to as new system architecture). The new system architecture … Read more

Practical Application Scenarios of Chaos Engineering

Practical Application Scenarios of Chaos Engineering

We welcome fintech professionals to actively submit articles! Submission Email: [email protected] —— Financial Digitalization Written by / Software Development Center of Zhongyuan Bank Li Zhou Background and Objectives Currently, the digital economy is developing rapidly, and customers’ demand for online and scenario-based financial services is continuously increasing. Financial institutions in the banking industry are accelerating their … Read more

Practical Experience with HTTP Timeout and Fault Testing

Practical Experience with HTTP Timeout and Fault Testing

In fault testing, the HTTP protocol is an extremely common and important testing target. Whether it is inter-service communication in a microservices architecture or external API calls, HTTP plays a critical role in data exchange. When issues such as slow responses, abnormal connections, or request failures occur, anomalies at the HTTP layer are often the … Read more

K8S Lecture 24: Introduction to Chaos Engineering – Core Features of Chaos Mesh and Fault Injection

K8S Lecture 24: Introduction to Chaos Engineering - Core Features of Chaos Mesh and Fault Injection

1. Introduction: Why is Chaos Engineering Needed? In the cloud-native era, system complexity is growing exponentially, and traditional testing methods can no longer cover all failure scenarios. Chaos Engineering is a proactive experimental approach that injects faults to help us discover system weaknesses before real failures occur in production environments. As a CNCF incubated project, … Read more

Practical Implementation of Network Isolation Fault Injection in Database Clusters

Practical Implementation of Network Isolation Fault Injection in Database Clusters

1. Introduction In database clusters, network isolation is one of the common fault scenarios. For example, when a node cannot communicate with other nodes due to a network interruption, the cluster may trigger a master-slave switch, data synchronization interruption, or split-brain issues. By actively injecting network isolation faults, we can verify the cluster’s high availability, … Read more

Fault Injection Testing in Go: A Practical Approach to Chaos Engineering for System Resilience

Fault Injection Testing in Go: A Practical Approach to Chaos Engineering for System Resilience

Click the “blue text” above to follow us Server down! Database unreachable! Network timeout! — Do these words make your scalp tingle? In a production environment, systems can face various bizarre failures at any time. But how do we know if the system can withstand these “critical hits”? Waiting until something goes wrong to regret? … Read more

Chaos Engineering Tools: Implementing Pod-Level Fault Injection with Go

Chaos Engineering Tools: Implementing Pod-Level Fault Injection with Go

Click the “blue text” above to follow us Late-night overtime, just about to leave, suddenly the production environment alarms go off! Checking logs, monitoring, troubleshooting… After two hours of hassle, I found out it was a cascading failure caused by a timeout in a dependent service. Sigh! Does this situation sound familiar? In a microservices … Read more

Injecting Faults and Debugging with ChaosBlade-Operator in K8S

Injecting Faults and Debugging with ChaosBlade-Operator in K8S

▌Introduction: A New Tool for Chaos Engineering ChaosBlade, as an open-source chaos engineering toolchain from Alibaba, abstracts chaos experiments into Kubernetes CRD resources through the ChaosBlade-Operator project. Utilizing the open capabilities of K8S, CRD resources can manage all software and hardware resources and interact with various resources, achieving declarative chaos experiment management that makes fault … Read more

Innovative Development: Fudian Bank Completes Its First Chaos Attack and Defense Drill for Business Systems and Pre-Production Fault Injection for the Next-Generation Core System

Innovative Development: Fudian Bank Completes Its First Chaos Attack and Defense Drill for Business Systems and Pre-Production Fault Injection for the Next-Generation Core System

In the context of continuous innovation and development in financial digitization, Fudian Bank’s information system has gradually transitioned from the original monolithic centralized architecture to a distributed architecture, and from the original IOE architecture to a fully domestically produced architecture. In 2024, Fudian Bank will take the lead in launching the cloud migration of the … Read more