[1]刘锡文,杨金刚,刘博宇,等.可观测性系统设计与实现[J].计算机技术与发展,2025,(01):208-214.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0317]
 LIU Xi-wen,YANG Jin-gang,LIU Bo-yu,et al.Design and Implementation of Observability System[J].,2025,(01):208-214.[doi:10.20165/j.cnki.ISSN1673-629X.2024.0317]
点击复制

可观测性系统设计与实现()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2025年01期
页码:
208-214
栏目:
新型计算应用系统
出版日期:
2025-01-10

文章信息/Info

Title:
Design and Implementation of Observability System
文章编号:
1673-629X(2025)01-0208-07
作者:
刘锡文杨金刚刘博宇李明哲
中国铁路哈尔滨局集团有限公司 信息技术所,黑龙江 哈尔滨 150000
Author(s):
LIU Xi-wenYANG Jin-gangLIU Bo-yuLI Ming-zhe
Information Technology Institute,China Railway Harbin Bureau Group Co. ,Ltd. ,Harbin 150000,China
关键词:
可观测性故障检测性能优化安全管理监控
Keywords:
observabilityfault detectionperformance optimizationsecurity managementmonitor
分类号:
TP393
DOI:
10.20165/j.cnki.ISSN1673-629X.2024.0317
摘要:
随着信息安全威胁的日益增多和复杂化,从传统监控向可观测性的转变已成为不可避免的趋势。 为有效应对信息系统在故障检测、性能优化和安全管理等领域的问题,设计并实现了 B/ S 架构下前后端分离的可观测性系统。 系统主要功能包括四项:一是数据收集。 通过对日志、指标与追踪三种数据的收集实现对软硬件层面的全方位监控。 二是数据处理。 通过日志处理与数据聚合实现对收集到的数据进行有效的管理与分析。 三是数据存储。 通过存储与管理日志数据、性能指标和追踪数据,为系统的监控、分析和优化提供支持。 四是数据可视化。 通过交互式监督与实时数据展示来设置监控指标并了解系统健康状态。 系统应用于中国铁路哈尔滨局集团有限公司研发并运维的信息系统,应用结果表明:该系统能够快速识别并诊断潜在问题,提高信息系统的可靠性,优化信息系统的性能。
Abstract:
As information security threats become increasingly numerous and complex, the shift from traditional monitoring to observability has become an inevitable trend. To effectively address issues in fault detection,performance optimization,and security man-agement in information systems,a B/ S architecture-based observability system with front-end and back-end separation was designed and implemented. The main functions of the system include four items. One is data collection. Through the collection of logs,metrics,and traces,the software and hardware level can be comprehensively monitored. The second is data processing. Through log processing and data aggregation,the collected data can be effectively managed and analyzed. The third is data storage. Support system monitoring,analysis,and optimization by storing and managing log data,performance metrics,and tracking data. The fourth is data visualization. Set monitoring metrics and understand system health through interactive monitoring and real-time data presentation. The system is applied to the information systems developed and operated by China Railway Harbin Bureau Group Co. , Ltd. ,and application results demonstrate that the system can quickly identify and diagnose potential issues,improving the reliability and optimizing the performance of information systems.

相似文献/References:

[1]卢一相 高清维 张德祥.基于AR模型的齿轮箱振动故障检测[J].计算机技术与发展,2007,(06):250.
 Yi-xiang,GAO Qing-wei,ZHANG De-xiang.Fault Detection of Gearbox Vibration Based on AR Model[J].,2007,(01):250.
[2]徐艳雷 韩兵.基于自适应数值滤波器的空调系统故障诊断[J].计算机技术与发展,2011,(12):182.
 XU Yan-lei,HAN Bing.Fault Diagnosis on Air-Conditioning System Based on Adaptive Digital Filter[J].,2011,(01):182.
[3]陈志明 崔宝同.数据包丢失的无线网络控制系统的故障检测[J].计算机技术与发展,2012,(11):61.
 CHEN Zhi-ming,CUI Bao-tong.Faults Detection Occurred in Wireless Networked Control System with Packet Dropout[J].,2012,(01):61.
[4]蒋美娟,郑羽,陈瑞林,等.基于LPC1768汽车故障远程诊断控制器的设计[J].计算机技术与发展,2013,(08):238.
 JIANG Mei-juan,ZHENG Yu,CHEN Rui-lin,et al.Design of Automobile Faulty Remote Diagnosis Controller Based on LPC1768[J].,2013,(01):238.
[5]洪硕果[],沈苏彬[]. 一种SDN网络的故障自动恢复方案[J].计算机技术与发展,2015,25(11):87.
 HONG Shuo-guo[],SHEN Su-bin[]. An Automatic Failure Recovery Scheme in SDN[J].,2015,25(01):87.
[6]马玮骏,王强,何晓晖,等. 云存储系统Master节点故障动态切换算法[J].计算机技术与发展,2017,27(09):85.
 MA Wei-jun,WANG Qiang,HE Xiao-hui,et al. Dynamic Switching Algorithm for Master Node in Huge Cloud Storage System[J].,2017,27(01):85.
[7]王凯鹏,姚凯学,任 莎,等.基于 STM32 的路灯智能监测控制系统[J].计算机技术与发展,2020,30(07):120.[doi:10. 3969 / j. issn. 1673-629X. 2020. 07. 026]
 WANG Kai-peng,YAO Kai-xue,REN Sha,et al.Intelligent Monitoring and Control System of Street Lamp Based on STM32[J].,2020,30(01):120.[doi:10. 3969 / j. issn. 1673-629X. 2020. 07. 026]

更新日期/Last Update: 2025-01-10