×
Community Blog SysOM 2.0 Supports One-Stop Operating System Migration

SysOM 2.0 Supports One-Stop Operating System Migration

This article discusses upgrades in SysOM version 2.0 and the importance of achieving one-stop migration assessment, migration implementation, and post-migration optimization through tools.

By System O&M SIG

1

CentOS has announced the end of life (EOL) for the CentOS Linux project. Both enterprises and individual users face problems, including CentOS updates, maintenance, and migration. It is inefficient while you carry out the migration manually, for the reason that the migration process cannot be standardized, and in-place migration cannot be implemented. It will also consume plenty of manpower and resources. So, it is impractical to migrate systems manually. It is imperative to achieve one-stop migration assessment, migration implementation, and post-migration optimization through tools.

Based on this, OpenAnolis launched SysOM 2.0, an automated O&M platform centered around operating system migration and O&M. The architecture and core features of version 2.0 have been upgraded. There are three core capabilities: operating system migration, upgraded diagnostic center, and overall architecture upgrade. SysOM 2.0 provides complete migration capabilities, which include migration assessment, migration tools, comparison of pre-migration with post-migration, and system optimization, ensuring a closed loop of operating system management from migration to O&M. For migration scenarios, SysOM 2.0 also enriches related features in existing modules (such as the monitoring center and diagnostic center), improving the O&M experience of the operating system.

Operating System Migration

After CentOS stops service, are you still worried about what, whether or how to change the system, and whether the system will go wrong after changing it? The new Operating System Migration feature of SysOM 2.0 can give you the answer. SysOM 2.0 allows you to migrate all versions of operating systems from CentOS 7 and CentOS 8 to Anolis OS 7 and Anolis OS 8, providing users with a simple and visualized interface to complete one-stop migration.

The features of the operating system migration module in SysOM 2.0 include migration assessment and migration implementation. It supports in-place migration and batch migration to solve the problem that the number of user machines is too large to rotate. It supports diagnostic analysis and system tuning for system exceptions after migration.

Migration Assessment: Before migration, the automatic migration assessment feature allows users to understand the compatibility of the Anolis OS with the original system after migration (such as software compatibility and hardware compatibility). In addition, detailed compatibility reports are provided for users to make informed decisions before the subsequent migration to Anolis OS.

Migration assessment features include:

  • Migration Risk Assessment: It performs a comprehensive migration risk assessment for the operating system.
  • System Assessment: It assesses system-level configurations (such as built-in environment variables, service commands, and kernel system calls before and after the migration).
  • Hardware Assessment: It assesses system hardware and board information.
  • Application Assessment: It assesses the compatibility of installed applications.

2
Figure: Migration Assessment

3
Figure: Migration Assessment Report

Migration Implementation: After completing the migration assessment, you can complete the system migration through the interface operation of migration implementation. You can perform a system backup in advance through the interface to avoid failure or unexpected migration results during the migration process. The migration implementation feature supports standalone migration, batch migration, one-step migration, one-click migration, backup restoration, and offline migration.

The migration implementation process includes the following:

  • Implementation Configuration: Some operations for the implementation configuration.
  • System Backup: If it is necessary, the current system will be backed up.
  • Environment Preparation: Environment preparation and tool deployment before migration.
  • Risk Assessment: A risk assessment is performed to implement the migration.
  • Migration Implementation: After the risk assessment is passed, the system migration operation is performed.
  • Machine Restart: After the migration is completed, you need to restart the machine. When the machine is successfully restarted, the system switches to Anolis OS, marking the completion of the system migration.

If you have backed up the system, you can use the system restore feature to restore the current system to the unmigrated state at any time.

4
Figure: Batch Migration Implementation

5
Figure: Migration Implementation

Monitoring Center

SysOM 2.0 has a new monitoring report feature. This feature collects and visualizes the total resource usage, basic metric trends, and metric fluctuations of the system before and after the migration. This allows you to visualize the changes in the operating system before and after the migration. At the same time, by running a test task for a period of time before and after the migration, you can get an intuitive comparison of the performance of the actual business running on the two operating systems.

Resource Change Monitoring

Migration monitoring visualizes the changes and trends of common resources used before and after the migration. You can visually compare the changes in system resources before and after the migration.

6
Figure: Migration Monitoring

Basic Metric Monitoring

At the same time, migration monitoring monitors common metrics (CPU, memory, network, I/O, and disk) and visualizes the real-time value, change trend, and fluctuation range of each metric. You can visually compare the fluctuation of each metric in the time dimension before and after the migration.

7
Figure: Basic Monitoring

Diagnostic Center

SysOM 2.0 provides a comprehensive diagnosis of scheduling, storage, network, and memory to help operating system users troubleshoot and locate problems in a comprehensive manner. New diagnostic features include scheduling jitter diagnosis, I/O latency analysis, I/O hang diagnosis, network packet loss diagnosis, network jitter diagnosis, network retransmission diagnosis, Cache analysis, Out of Memory (OOM) diagnosis, and the support for custom command sending.

Scheduling Diagnostic Center

Scheduling Jitter Diagnosis: In system O&M scenarios, if the CPU runs in the sys mode for a long time, the programs in the user mode cannot be scheduled. If interrupts are disabled for a long time, the CPU cannot receive TICK interrupts normally, causing scheduling jitter. These two cases are often accompanied by sudden scheduling latency in the business process or even transient system hangs. Scheduling jitter records the time point at which the scheduling jitter occurs, the number of occurrences, and the specific value of the jitter to help users better locate the problems that occur in this scenario.

8

Storage Diagnostic Center

I/O Latency Analysis: High I/O latency generally indicates I/O performance bottlenecks (such as excessive I/O traffic, backlogs, storage device bottlenecks, storage device anomalies, OS storage stack anomalies, etc.), resulting in slow processing of I/O requests and high IO latency. It monitors the historical I/O latency level of each storage device and counts the number of times the I/O latency deviates from the historical latency level per minute. This helps you quickly locate at which level the IO latency is consumed the most, which is easy to locate problems.

I/O Traffic Analysis: If the system I/O traffic is too high and the disk I/O is full, it is easy to cause competition for I/O resources, and user processes with I/O requirements may be blocked. This situation generally means I/O resources have not been properly allocated, causing some processes to occupy a large amount of I/O resources beyond expectations. I/O traffic analysis monitors the I/O resources (such as IOPS and throughput) usage of each storage device at the process level and can analyze the process with the largest resource usage to facilitate problem location.

I/O Hang Diagnosis: I/O hang is a disaster for the system. It is important to detect the I/O hang, switch I/O traffic to normal storage devices in time, and isolate abnormal storage devices. This monitoring item monitors whether there is I/O hang problem on the I/O access path of each storage device of the system.

Network Diagnostic Center

Network Packet Loss Diagnosis: Packet loss diagnosis monitors and records packet loss events, hardware or network interface card devices that lose packets, packet loss points and times, and causes of packet loss. This helps diagnose and locate network packet loss problems.

Network Jitter Diagnosis: Jitter diagnosis supports the ICMP packet. It consists of two parts. One part is the message time delay of the ping initiator (the path of sending messages). The other part is the message time delay of the ping receiver (the path of receiving messages).

Network Retransmission Diagnosis: Retransmission diagnosis records the retransmission time, IP, TCP socket status, and congestion status to help you understand the occurrence of network retransmission.

Memory Diagnostic Center

Cache Analysis in Memory: You can use the cache analysis feature to resolve the files corresponding to the file cache and shared memory in the system or within containers and container groups (as well as the percentage of active and inactive file caches).

OOM Diagnosis: OOM is a common exception in the production environment. When an OOM occurs, it is accompanied by a large number of kernel logs, which are often difficult to analyze. This diagnosis helps locate OOM caused by improper settings (such as memory leaks, cpuset, and mempolicy in system cgroups).

Custom Diagnostic Center

Command Diagnosis: Considering there are various scenarios when O&M personnel diagnose problems, and these scenarios may not be accurately covered by some of SysOM's existing diagnostic features, a new command diagnosis feature has been introduced to support users to input the commands they need as they normally do in the terminal. Then, they can view the returned results.

Overall Architecture Upgrade

The SysOM 1.0 architecture is suitable for deploying full functions on a single machine. It can integrate powerful O&M functions (such as host management, host monitoring, host diagnosis, downtime analysis, and security detection) with one click. As SysOM has been implemented in multiple scenarios and the popularity of the open-source community has increased, new features and growth in management scale have posed new demands on SysOM's architectural design.

  • Support for large-scale scenarios
  • Support for quick feature extension

In order to address the requirements above, SysOM 2.0 upgraded the overall architecture design to enable the platform to deploy and access new services more flexibly and quickly.

  1. SysOM updates backend components to microservices to implement deployment separation and supports distributed containerized deployment in large-scale scenarios. It can also scale out specified microservices based on the load of each microservice.
  2. SysOM introduces the Common Event Center (CEC) to support asynchronous communication among microservices, promote decoupling among microservices, and ensure high cohesion, low coupling, single responsibility, and a clear relationship. At the same time, the pluggable design can be compatible with various types of Message Queue (MQ) technologies and can flexibly switch among various MQs without modifying codes.
  3. SysOM provides a unified channel capability. Each microservice can use the Channel SDK to perform command execution, file delivery, file download, and other functions on the Node. Its pluggable design can support various types of channels, and the channel microservice adopts fully asynchronous programming, which significantly improves the parallel processing capability.

9
Figure: SysOM 2.0 Architecture

Use Practice

Download the rpm package:

wget https://gitee.com/anolis/sysom/releases/download/v2.0/sysom-2.0-1.an8.x86_64.rpm

Install the rpm package:

rpm -ivh sysom-2.0-1.an8.x86_64.rpm
# or yum install -y sysom-2.0-1.an8.x86_64.rpm
  • The default installation path is /usr/local/sysom.
  • The default configuration of nginx external port is 80, which can be configured by export SERVER_PORT=xxx.
  • The default configuration of intranet IP is the first IP found by the ip -4 route command and can be configured by export SERVER_LOCAL_IP=xxx.xxx.xxx.xxx.

Start:

# Run the following command to start:
bash -x /usr/local/sysom/init_scripts/server/init.sh

If the following log is generated by Log Service, the deployment is successful:

Oct 10 12:58:51 mfeng bash[3217754]: /usr/local/sysom/init_scripts/server
Oct 10 12:58:51 mfeng bash[3217754]: + for dir in `ls`
Oct 10 12:58:51 mfeng bash[3217754]: + '[' -d init.sh ']'
Oct 10 12:58:51 mfeng bash[3217754]: + for dir in `ls`
Oct 10 12:58:51 mfeng bash[3217754]: + '[' -d stop.sh ']'
Oct 10 12:58:51 mfeng bash[3217754]: + sed -i 's/^FIRST_INIT_DONE=0/FIRST_INIT_DONE=1/g'     /usr/local/sysom/init_scripts/server/init.sh

Access through the WEB frontend:

After the deployment is successful, you can access the SysOM frontend by accessing the public or private network IP address specified during deployment (such as http://172.22.3.238).

Default username and password: admin/123456

SysOM provides a Demo website (In Chinese): http://sysom.openanolis.cn/

10

0 2 1
Share on

OpenAnolis

83 posts | 5 followers

You may also like

Comments