Author: White Japan

System observability and fault analysis are important measurement standards in system operation and maintenance. With the evolution of technology architecture, resource unit, resource acquisition mode, and communication mode, the system encounters great challenges. These challenges are also forcing the development of o&M related technologies. Throughout the development history of operation and maintenance monitoring, monitoring and observability have been developed for nearly 30 years.

Nowadays, with the implementation of the cloud native architecture, the application architecture is gradually transformed from a single system to microservices, and the business logic then becomes the call and request between microservices. At the same time, virtualization becomes more thorough, container management platform is accepted by more and more enterprises, three-party components are gradually evolving into cloud services, and the entire application architecture becomes cloud native architecture. The service invocation path becomes longer, which makes the traffic direction uncontrollable and difficult to troubleshoot. The ability to continuously analyze the entire application lifecycle of development, test and maintenance by covering the entire stack of various observable data (metrics, logs, links, events) becomes a must for all enterprises.

More and more enterprises have realized that observable capability has become the infrastructure and necessary capability of cloud native, and the observable capability domain has evolved from simple operation and maintenance mode to test and development mode. The observable purpose has expanded from supporting the normal operation of the business to accelerating business innovation and making the business iterate quickly.

Looking back on 2021, Ali Cloud continues to explore in the field of observation. While serving tens of millions of customers, it actively summarizes and refines its own exploration and practice in the field of observation. With the help of application real-time monitoring service (ARMS), enterprises are helped to build a full-stack cloud native observable platform.

Therefore, today we take stock of the highlights of 2021, with you a complete review of Ali Cloud observable 2021.

Part 1: Overview of key new products

(1) Grafana service: What exactly is Grafana service?

​​https://developer.aliyun.com/article/795852​​

(2) Kubernetes monitoring: How do we monitor containers as they become more and more widely used?

​​https://developer.aliyun.com/article/786530​​

(3) Yundiao: Refuse to be a Backburner! How to Use Website Performance optimization to drive product Experience Improvement

​​https://developer.aliyun.com/article/785937​​

(4) Intelligent alarm: How to build a highly coordinated and accurate alarm system in the face of high wind?

​​https://developer.aliyun.com/article/794013​​

(5) Application security: How to Strengthen application Security capability and Fully intercept Log4j Vulnerability Attacks

​​https://developer.aliyun.com/article/845136​​

Part 2: Comprehensive review of the series of courses

(1) Observable series of open courses

Vol.1ALL in one: how to build an end-to-end observable system

​​https://yqh.aliyun.com/live/detail/26691​​

Vol.2 Best Practices for Service Full Link Tracing

​​https://yqh.aliyun.com/live/detail/26692​​

Vol.3 Business & Interpretation of User Experience Observable Scenarios

​​https://yqh.aliyun.com/live/detail/26696​​

Vol.4 How to Establish an Efficient Alarm System to Improve the Efficiency of Daily Operation and Maintenance

​​https://developer.aliyun.com/live/248036​​

(2) Website performance and experience optimization series of open courses

How to Use Performance Optimization to Drive User Experience Improvement

​​https://yqh.aliyun.com/live/detail/26181​​

Vol.2 How to Conduct CDN and Download Optimization Analysis

​​https://yqh.aliyun.com/live/detail/26215​​

Best Practices for The Discovery, Location, and Resolution of Vol.3 Hijacking

​​https://yqh.aliyun.com/live/detail/26706​​

(3) “Kubernetes Monitoring Series Open Course”

Vol.1 Explore Application Architectures and Discover Unexpected Network Traffic

Yqh.aliyun.com/live/detail…

How to Find Service and Workload Exceptions in Kubernetes Vol.2

​​https://yqh.aliyun.com/live/detail/26421​​

Vol.3 Using Kubernetes to monitor the problem of Resource usage and Uneven traffic Distribution

​​https://yqh.aliyun.com/live/detail/26606​​

Vol.4 How to Use Kubernetes monitoring location slow call?

​​https://yqh.aliyun.com/live/detail/26835​​

Vol.5 Use Kubernetes to monitor and Locate the Root Cause of abnormal Pod Status

​​https://yqh.aliyun.com/live/detail/27006​​

Part3: Industry practice case sharing

ARMS Escorts Deep Drawing Intelligent System and Brings ultimate User Experience

​​https://developer.aliyun.com/article/781580​​

Walnut Programming: The Road to Building Front-end Observability

​​https://developer.aliyun.com/article/781396​​

“Cloud Dial-test power-saving card robot to optimize the performance of overseas websites”

​​https://developer.aliyun.com/article/792477​​

Cloud Dial-test helps Weidong Cloud Education to comprehensively improve global user Experience

​​https://developer.aliyun.com/article/858688​​

Part4: Classic ebook download

Website Performance and Experience Optimization

​​https://developer.aliyun.com/topic/download?spm=5176.25571740.J_8000295170.2.7ccf363cC16UPK&id=7952​​

“Double Eleven E-commerce Website User Experience Report”

​​https://developer.aliyun.com/special/download?spm=a2c6h.14164896.0.0.3d434ffbfs2YMg&id=8196​​

For more on what to see, click here!