Journal articles on the topic 'Fault resilience'

To see the other types of publications on this topic, follow the link: Fault resilience.

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 journal articles for your research on the topic 'Fault resilience.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.

1

Lee, Yen-Lin, Shinta Nuraisya Arizky, Yu-Ren Chen, Deron Liang, and Wei-Jen Wang. "High-Availability Computing Platform with Sensor Fault Resilience." Sensors 21, no. 2 (January 13, 2021): 542. http://dx.doi.org/10.3390/s21020542.

Full text
Abstract:
Modern computing platforms usually use multiple sensors to report system information. In order to achieve high availability (HA) for the platform, the sensors can be used to efficiently detect system faults that make a cloud service not live. However, a sensor may fail and disable HA protection. In this case, human intervention is needed, either to change the original fault model or to fix the sensor fault. Therefore, this study proposes an HA mechanism that can continuously provide HA to a cloud system based on dynamic fault model reconstruction. We have implemented the proposed HA mechanism on a four-layer OpenStack cloud system and tested the performance of the proposed mechanism for all possible sets of sensor faults. For each fault model, we inject possible system faults and measure the average fault detection time. The experimental result shows that the proposed mechanism can accurately detect and recover an injected system fault with disabled sensors. In addition, the system fault detection time increases as the number of sensor faults increases, until the HA mechanism is degraded to a one-system-fault model, which is the worst case as the system layer heartbeating.
APA, Harvard, Vancouver, ISO, and other styles
2

Lu, Wei, Weidong Wang, Ergude Bao, Weiwei Xing, and Kai Zhu. "Improving Resilience of Software Systems: A Case Study in 3D-Online Game System." International Journal of Software Engineering and Knowledge Engineering 27, no. 01 (February 2017): 1–22. http://dx.doi.org/10.1142/s0218194017500012.

Full text
Abstract:
Resilience is the property that enables a system to continue operating properly when one or more faults occur. Nowadays, as software systems become more and more complex, their hardware execution platforms also become more heterogenous with larger scale. Software systems may fail due to some faults such as node breakdown, communication failure, or data processing failure. In this paper, we propose a ring-based resilience mechanism, which implements fault detection and recovery. (1) To solve the problem that the central server may have high burden of network traffic, we design a ring-based heartbeat algorithm for crash fault detection. (2) We also design a light-weight recovery mechanism to recover from crash faults as compared with the current system-specific mechanisms. To evaluate our mechanism, we use a 3D-online game system as a case study. By injecting faults, we test the effectiveness and overhead of the proposed mechanism. Compared with other mechanisms, the experimental results show that our mechanism can support resilience very well and is better at dealing with the crash fault caused by high cluster workload with acceptable overhead.
APA, Harvard, Vancouver, ISO, and other styles
3

Wu, Weiqiang, Ning Huang, Lina Sun, and Xiaolu Zheng. "Measurement and Analysis of MANET Resilience with Fault Tolerance Strategies." Mathematical Problems in Engineering 2017 (2017): 1–10. http://dx.doi.org/10.1155/2017/9806365.

Full text
Abstract:
Resilience is usually considered as the ability of network fault tolerance. To improve the resilience of MANET, fault tolerance strategies such as routing protocols are usually employed which will impact resilience of MANET. For resilience measurement and fault tolerance strategies’ efficiency evaluation, the impact of fault tolerance strategies deserves a detailed study. However, the general MANET resilience measurement methods do not consider the fault tolerance strategies as individual resilience influence factors, let alone reflecting the interplay among strategies that deployed on different network layers. Thus, it results in a limitation on efficiency assessment of fault tolerance strategies. In this paper, it models fault tolerance strategies for MANET resilience measurement with considering strategies as individual resilience influence factors. Firstly, through analyzing the features of fault tolerance strategies that deployed on physical and logical layers of network, we built a hierarchical network model to describe the resilience impact of strategies. Then, based on this network model, we proposed fault tolerance strategies model to measure resilience of MANET. Particularly, the model can well support the interplay study among different strategies through contrasting the quantitative value defined by strategy model. At last, a case study was given for verification and analysis.
APA, Harvard, Vancouver, ISO, and other styles
4

Ding, Dong, Lei Wang, Zhijie Yang, Kai Hu, and Hongjun He. "ACIMS: Analog CIM Simulator for DNN Resilience." Electronics 10, no. 6 (March 15, 2021): 686. http://dx.doi.org/10.3390/electronics10060686.

Full text
Abstract:
Analog Computing In Memory (ACIM) combines the advantages of both Compute In Memory (CIM) and analog computing, making it suitable for the design of energy-efficient hardware accelerators for computationally intensive DNN applications. However, their use will introduce hardware faults that decrease the accuracy of DNN. In this work, we take Sandwich-Ram as the real hardware example of ACIM and are the first to propose a fault injection and fault-aware training framework for it, named Analog Computing In Memory Simulator (ACIMS). Using this framework, we can simulate and repair the hardware faults of ACIM. The experimental results show that ACIMS can recover 91.0%, 93.7% and 89.8% of the DNN’s accuracy drop through retraining on the MNIST, SVHN and Cifar-10 datasets, respectively; moreover, their adjusted accuracy can reach 97.0%, 95.3% and 92.4%.
APA, Harvard, Vancouver, ISO, and other styles
5

SERVICE, TRAVIS, and DANIEL TAURITZ. "INCREASING INFRASTRUCTURE RESILIENCE THROUGH COMPETITIVE COEVOLUTION." New Mathematics and Natural Computation 05, no. 02 (July 2009): 441–57. http://dx.doi.org/10.1142/s1793005709001416.

Full text
Abstract:
The world is increasingly dependent on critical infrastructures such as the electric power grid, water, gas and oil transport systems. Due to this increasing dependence and inadequate infrastructure expansion, these systems are becoming increasingly stressed. These additional stresses leave these systems less resilient to external faults, both accidental and malicious than ever before. As a result of this increased vulnerability, many critical infrastructures are becoming susceptible to cascading failures, where an initial fault caused by an external force may induce a domino-effect of further component failures. An important implication is that traditional infrastructure risk analysis methods, often relying on Monte Carlo sampling of fault scenarios, are no longer sufficient. Instead, systematic analysis based on worst-case attacks by intelligent adversaries is essential. This paper describes a coevolutionary methodology to simultaneously discover low-effort high-impact faults and corresponding means of hardening infrastructures against them. We empirically validate our methodology through an electric power transmission system case study.
APA, Harvard, Vancouver, ISO, and other styles
6

Xie, Fei, Jun Yan, and Jun Shen. "A Novel PageRank-Based Fault Handling Strategy for Workflow Scheduling in Cloud Data Centers." International Journal of Web Services Research 18, no. 4 (October 2021): 1–26. http://dx.doi.org/10.4018/ijwsr.2021100101.

Full text
Abstract:
Unexpected faults result in unscheduled cloud outage, which negatively affects the completion of workflow tasks in the cloud. This paper presents a novel PageRank-based fault handling strategy to rescue workflow tasks at the faulty data center. The proposed approach uses a holistic view and considers the task attributes, the timeline scenario, and the overall cloud performance. A priority assignment system is developed based on the modified PageRank algorithm to prioritise workflow tasks. A min-max normalization method is applied to select the target data center and match the timeline at this data center. Additionally, a dynamic PageRank-constrained task scheduling algorithm is proposed to generate the task scheduling solution. The simulation results show that the proposed approach can achieve better fault handling performance, measured by task resilience ratio, workflow resilience ratio, and workflow continuity ratio in both the traditional 3-replica and the image backup cloud environment.
APA, Harvard, Vancouver, ISO, and other styles
7

Luo, Mon-Yen, and Chu-Sing Yang. "Enabling fault resilience for web services." Computer Communications 25, no. 3 (February 2002): 198–209. http://dx.doi.org/10.1016/s0140-3664(01)00363-2.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Meilianda, Ella, Franck Lavigne, Biswajeet Pradhan, Patrick Wassmer, Darusman Darusman, and Marjolein Dohmen-Janssen. "Barrier Islands Resilience to Extreme Events: Do Earthquake and Tsunami Play a Role?" Water 13, no. 2 (January 13, 2021): 178. http://dx.doi.org/10.3390/w13020178.

Full text
Abstract:
Barrier islands are indicators of coastal resilience. Previous studies have proven that barrier islands are surprisingly resilient to extreme storm events. At present, little is known about barrier systems’ resilience to seismic events triggering tsunamis, co-seismic subsidence, and liquefaction. The objective of this study is, therefore, to investigate the morphological resilience of the barrier islands in responding to those secondary effects of seismic activity of the Sumatra–Andaman subduction zone and the Great Sumatran Fault system. Spatial analysis in Geographical Information Systems (GIS) was utilized to detect shoreline changes from the multi-source datasets of centennial time scale, including old topographic maps and satellite images from 1898 until 2017. Additionally, the earthquake and tsunami records and established conceptual models of storm effects to barrier systems, are corroborated to support possible forcing factors analysis. Two selected coastal sections possess different geomorphic settings are investigated: (1) Lambadeuk, the coast overlying the Sumatran Fault system, (2) Kuala Gigieng, located in between two segments of the Sumatran Fault System. Seven consecutive pairs of comparable old topographic maps and satellite images reveal remarkable morphological changes in the form of breaching, landward migrating, sinking, and complete disappearing in different periods of observation. While semi-protected embayed Lambadeuk is not resilient to repeated co-seismic land subsidence, the wave-dominated Kuala Gigieng coast is not resilient to the combination of tsunami and liquefaction events. The mega-tsunami triggered by the 2004 earthquake led to irreversible changes in the barrier islands on both coasts.
APA, Harvard, Vancouver, ISO, and other styles
9

Caseiro, Luís, and André Mendes. "Fault Analysis and Non-Redundant Fault Tolerance in 3-Level Double Conversion UPS Systems Using Finite-Control-Set Model Predictive Control." Energies 14, no. 8 (April 15, 2021): 2210. http://dx.doi.org/10.3390/en14082210.

Full text
Abstract:
Fault-tolerance is critical in power electronics, especially in Uninterruptible Power Supplies, given their role in protecting critical loads. Hence, it is crucial to develop fault-tolerant techniques to improve the resilience of these systems. This paper proposes a non-redundant fault-tolerant double conversion uninterruptible power supply based on 3-level converters. The proposed solution can correct open-circuit faults in all semiconductors (IGBTs and diodes) of all converters of the system (including the DC-DC converter), ensuring full-rated post-fault operation. This technique leverages the versatility of Finite-Control-Set Model Predictive Control to implement highly specific fault correction. This type of control enables a conditional exclusion of the switching states affected by each fault, allowing the converter to avoid these states when the fault compromises their output but still use them in all other conditions. Three main types of corrective actions are used: predictive controller adaptations, hardware reconfiguration, and DC bus voltage adjustment. However, highly differentiated corrective actions are taken depending on the fault type and location, maximizing post-fault performance in each case. Faults can be corrected simultaneously in all converters, as well as some combinations of multiple faults in the same converter. Experimental results are presented demonstrating the performance of the proposed solution.
APA, Harvard, Vancouver, ISO, and other styles
10

Rizzi, F., K. Morris, K. Sargsyan, P. Mycek, C. Safta, O. Le Maître, O. Knio, and B. Debusschere. "Partial differential equations preconditioner resilient to soft and hard faults." International Journal of High Performance Computing Applications 32, no. 5 (January 29, 2017): 658–73. http://dx.doi.org/10.1177/1094342016684975.

Full text
Abstract:
We present a domain-decomposition-based preconditioner for the solution of partial differential equations (PDEs) that is resilient to both soft and hard faults. The algorithm reformulates the PDE as a sampling problem, followed by a solution update through data manipulation that is resilient to both soft and hard faults. This reformulation allows us to recast the problem as a set of independent tasks, and exploit data locality to reduce global communication. We discuss two different parallel implementations: (a) a single program multiple data (SPMD) version based on a one-to-one mapping between subdomain and MPI processes responsible for both state and computation; and (b) an asynchronous server–client implementation where all state information is held by the servers and clients are designed solely as computational units. We present a scalability comparison of both implementations under nominal conditions, showing efficiency within ~80% for up to 12,000 cores. We present a resilience analysis under different fault scenarios based on the server–client implementation. This framework provides resiliency to hard faults such that if a client crashes, it stops asking for work, and the servers simply distribute the work among all of the other clients alive. Erroneous subdomain solves (e.g. due to soft faults) appear as corrupted data, which is either rejected if that causes a task to fail, or is seamlessly filtered out during the regression stage through a suitable noise model. Three different types of faults are modeled: hard faults modeling nodes (or clients) crashing; soft faults occurring during the communication of the tasks between server and clients; and soft faults occurring during task execution. We demonstrate the resiliency of the approach for a 2D elliptic PDE, and explore the effect of the faults at various failure rates.
APA, Harvard, Vancouver, ISO, and other styles
11

Yang, Lishan, Bin Nie, Adwait Jog, and Evgenia Smirni. "SUGAR." Proceedings of the ACM on Measurement and Analysis of Computing Systems 5, no. 1 (February 18, 2021): 1–29. http://dx.doi.org/10.1145/3447375.

Full text
Abstract:
As Graphics Processing Units (GPUs) are becoming a de facto solution for accelerating a wide range of applications, their reliable operation is becoming increasingly important. One of the major challenges in the domain of GPU reliability is to accurately measure GPGPU application error resilience. This challenge stems from the fact that a typical GPGPU application spawns a huge number of threads and then utilizes a large amount of potentially unreliable compute and memory resources available on the GPUs. As the number of possible fault locations can be in the billions, evaluating every fault and examining its effect on theapplication error resilience is impractical. Application resilience is evaluated via extensive fault injection campaigns based on sampling of an extensive fault site space. Typically, the larger the input of the GPGPU application, the longer the experimental campaign. In this work, we devise a methodology, SUGAR (Speeding Up GPGPU Application Resilience Estimation with input sizing), that dramatically speeds up the evaluation of GPGPU application error resilience by judicious input sizing. We show how analyzing a small fraction of the input is sufficient to estimate the application resilience with high accuracy and dramatically reduce the duration of experimentation. Key of our estimation methodology is the discovery of repeating patterns as a function of the input size. Using the well-established fact that error resilience in GPGPU applications is mostly determined by the dynamic instruction count at the thread level, we identify the patterns that allow us to accurately predict application error resilience for arbitrarily large inputs. For the cases that we examine in this paper, this new resilience estimation mechanism provides significant speedups (up to 1336 times) and 97.0 on the average, while keeping estimation errors to less than 1%.
APA, Harvard, Vancouver, ISO, and other styles
12

Yang, Lishan, Bin Nie, Adwait Jog, and Evgenia Smirni. "SUGAR: Speeding Up GPGPU Application Resilience Estimation with Input Sizing." ACM SIGMETRICS Performance Evaluation Review 49, no. 1 (June 22, 2022): 45–46. http://dx.doi.org/10.1145/3543516.3453917.

Full text
Abstract:
As Graphics Processing Units (GPUs) are becoming a de facto solution for accelerating a wide range of applications, their reliable operation is becoming increasingly important. One of the major challenges in the domain of GPU reliability is to accurately measure GPGPU application error resilience. This challenge stems from the fact that a typical GPGPU application spawns a huge number of threads and then utilizes a large amount of potentially unreliable compute and memory resources available on the GPUs. As the number of possible fault locations can be in the billions, evaluating every fault and examining its effect on the application error resilience is impractical. Application resilience is evaluated via extensive fault injection campaigns based on sampling of an extensive fault site space. Typically, the larger the input of the GPGPU application, the longer the experimental campaign. In this work, we devise a methodology, SUGAR (Speeding Up G PGPU Application Resilience Estimation with input sizing), that dramatically speeds up the evaluation of GPGPU application error resilience by judicious input sizing. We show how analyzing a small fraction of the input is sufficient to estimate the application resilience with high accuracy and dramatically reduce the duration of experimentation. Key to our estimation methodology is the discovery of repeating patterns as a function of the input size. Using the well-established fact that error resilience in GPGPU applications is mostly determined by the dynamic instruction (DI) count at the thread level, we discover the patterns that allow to accurately predict application error resilience for arbitrarily large inputs. For the cases that we examine in this paper, this new resilience estimation provides significant speedups (up to 1336 times) and 97.0 on the average, while keeping estimation errors to less than 1%, for details see the full version of this SIGMETRICS paper.
APA, Harvard, Vancouver, ISO, and other styles
13

Koo, Geun Wan, DongMyoung Joo, and Byoung Kuk Lee. "Enhanced Threshold Point Calculation Algorithm for Switch Fault Diagnosis in Grid Connected 3-Phase AC–DC PWM Converters." Energies 12, no. 10 (May 23, 2019): 1979. http://dx.doi.org/10.3390/en12101979.

Full text
Abstract:
The resilience of systems with alternating current (AC)–direct current (DC) converters has been investigated with the aim of improving switch fault diagnosis. To satisfy this aim, this paper proposes a switch fault diagnosis algorithm for three-phase AC–DC converters. The proposed algorithm operates using the phase current instead of the average current to reduce the calculation time required for fault switch detection. Moreover, a threshold point calculation method is derived using a theoretical analysis, which was lacking in previous research. Using the calculated threshold point, a switch fault diagnosis algorithm is obtained to detect faults independent of the load condition. Using the proposed algorithm, switch faults can be detected within 4 ms, which is the recommended value for uninterruptible power supply (UPS). The theoretical analysis, the operating principle, and the experimental results on a 3-kW grid-tied AC–DC converter test-bed are presented herein, which verify the performance of the proposed algorithm.
APA, Harvard, Vancouver, ISO, and other styles
14

Huang, Tianen, Zhenjie Wu, Yuantao Wang, Jian Tang, Xiang Li, Yajun Mo, Chengda Li, et al. "A Novel Pre-Dispatching Strategy of Power System under Extreme Weather." Processes 9, no. 12 (November 24, 2021): 2112. http://dx.doi.org/10.3390/pr9122112.

Full text
Abstract:
Pre-dispatch is an important way for distribution networks to cope with typhoon weather, enhance resilience and reduce economic losses. In order to accurately describe the faults and consequences of components’ failure in the distribution network, this paper establishes a pre-dispatch model to cope with typhoon weather based on line failures consequence analysis. First, Monte Carlo simulation is used to sample the typical fault scenarios of vulnerable lines. According to the location of switchgear, the distribution network is partitioned and a block breaker correlation matrix is established. Combined with the line fault status, a fault consequence model of distribution lines related to the pre-dispatching strategy is established. Then, the objective function is given to minimize the sum of the cost of the pre-dispatch operation and the power outage, and then establish a pre-dispatch model for the distribution network. In order to reduce the computational complexity, PH (Progressive Hedging) algorithm is used to solve the model. Finally, the IEEE-69 test system is used to analyze the effectiveness of the method. The results show that the proposed dispatching model can effectively avoid potential risks, reduce system economic losses and improve the resilience of power grids.
APA, Harvard, Vancouver, ISO, and other styles
15

Chapenko, V. P. "An algorithmic method for increasing ADC fault resilience." Automatic Control and Computer Sciences 45, no. 1 (February 2011): 53–59. http://dx.doi.org/10.3103/s0146411611010032.

Full text
APA, Harvard, Vancouver, ISO, and other styles
16

Najjar, W., and J. L. Gaudiot. "Network resilience: a measure of network fault tolerance." IEEE Transactions on Computers 39, no. 2 (1990): 174–81. http://dx.doi.org/10.1109/12.45203.

Full text
APA, Harvard, Vancouver, ISO, and other styles
17

Lacerda, Felipe G., Joseph M. Renes, and Renato Renner. "Classical Leakage Resilience from Fault-Tolerant Quantum Computation." Journal of Cryptology 32, no. 4 (August 8, 2019): 1071–94. http://dx.doi.org/10.1007/s00145-019-09310-6.

Full text
APA, Harvard, Vancouver, ISO, and other styles
18

Annarelli, Alessandro, Cinzia Battistella, and Fabio Nonino. "A Framework to Evaluate the Effects of Organizational Resilience on Service Quality." Sustainability 12, no. 3 (January 28, 2020): 958. http://dx.doi.org/10.3390/su12030958.

Full text
Abstract:
Disruptions and unexpected turbulences characterize the contemporary service industry and pose greater criticalities to organizations that have concerns about both their survival and their business sustainability. The answer to this challenge is to design an organizational system aimed at enhancing so-called organizational resilience. The aim of this work is to present and test an assessment methodology based on a framework that links key static and dynamic characteristics of firms as resilient systems and relates these characteristics with service quality preservation after disruptions. The framework adopts a set of indicators, namely resilience dimensions, to comprehend the real effects of resilience characteristics with service quality dimensions. Through the analysis of a real case, we provided evidence of how the model can be applied in a backward perspective to understand fault modes of specific events. Results show how inadequacies in terms of resilience characteristics determine losses of service quality, and also how the model can be used as a predictive tool to determine in which area companies should intervene in order to improve resilience and service quality.
APA, Harvard, Vancouver, ISO, and other styles
19

Ning, Xuefei, Guangjun Ge, Wenshuo Li, Zhenhua Zhu, Yin Zheng, Xiaoming Chen, Zhen Gao, Yu Wang, and Huazhong Yang. "FTT-NAS: Discovering Fault-tolerant Convolutional Neural Architecture." ACM Transactions on Design Automation of Electronic Systems 26, no. 6 (November 30, 2021): 1–24. http://dx.doi.org/10.1145/3460288.

Full text
Abstract:
With the fast evolvement of embedded deep-learning computing systems, applications powered by deep learning are moving from the cloud to the edge. When deploying neural networks (NNs) onto the devices under complex environments, there are various types of possible faults: soft errors caused by cosmic radiation and radioactive impurities, voltage instability, aging, temperature variations, malicious attackers, and so on. Thus, the safety risk of deploying NNs is now drawing much attention. In this article, after the analysis of the possible faults in various types of NN accelerators, we formalize and implement various fault models from the algorithmic perspective. We propose Fault-Tolerant Neural Architecture Search (FT-NAS) to automatically discover convolutional neural network (CNN) architectures that are reliable to various faults in nowadays devices. Then, we incorporate fault-tolerant training (FTT) in the search process to achieve better results, which is referred to as FTT-NAS. Experiments on CIFAR-10 show that the discovered architectures outperform other manually designed baseline architectures significantly, with comparable or fewer floating-point operations (FLOPs) and parameters. Specifically, with the same fault settings, F-FTT-Net discovered under the feature fault model achieves an accuracy of 86.2% (VS. 68.1% achieved by MobileNet-V2), and W-FTT-Net discovered under the weight fault model achieves an accuracy of 69.6% (VS. 60.8% achieved by ResNet-18). By inspecting the discovered architectures, we find that the operation primitives, the weight quantization range, the capacity of the model, and the connection pattern have influences on the fault resilience capability of NN models.
APA, Harvard, Vancouver, ISO, and other styles
20

Wang, Saiyi, Weiguo He, Haodong Shen, Tangyun Xu, Shichao Zhou, Jie Chen, and Yingying Zhao. "Resilience Assessment for HDG under Extreme Events." E3S Web of Conferences 256 (2021): 02020. http://dx.doi.org/10.1051/e3sconf/202125602020.

Full text
Abstract:
The resilience assessment method of hybrid AC-DC distribution grid is the key and basis of measuring the tolerance of the distribution network in face of extreme events and testing the effect of the resilience improvement strategy of the distribution network. This paper fully considers the characteristics of different extreme natural disasters and proposes a targeted resilience assessment method. For natural disasters, the fault rate model of the line under the influence of the disaster is constructed, and the fault set is generated based on Monte Carlo sampling. According to the operating characteristics, a set of evaluation indicators focusing on the core characteristics of resilience, including capability of strain, defense and recoverability, is constructed. The objective weight of each indicator and the comprehensive assessment results are obtained based relatively on the entropy weight method and the approximate ideal solution sorting method.
APA, Harvard, Vancouver, ISO, and other styles
21

SIPPER, MOSHE, MARCO TOMASSINI, and OLIVIER BEURET. "STUDYING PROBABILISTIC FAULTS IN EVOLVED NON-UNIFORM CELLULAR AUTOMATA." International Journal of Modern Physics C 07, no. 06 (December 1996): 923–39. http://dx.doi.org/10.1142/s0129183196000776.

Full text
Abstract:
We study the effects of random faults on the behavior of one-dimensional, non-uniform cellular automata (CA), where the local update rule need not be identical for all grid sites. The CA systems examined were obtained via an approach known as cellular programming, which involves the evolution of non-uniform CAs to perform non-trivial computational tasks. Using the "system replicas" methodology, involving a comparison between a perfect, non-perturbed version of the CA and a faulty one, we find that our evolved systems exhibit graceful degradation in performance, able to tolerate a certain level of faults. We then "zoom" into the fault-tolerant zone, where "good" computational behavior is exhibited, introducing measures to fine-tune our understanding of the faulty CAs' operation. We study the error level as a function of time and space, as well as the recuperation time needed to recover from faults. Our investigation reveals an intricate interplay between temporal and spatial factors, with the presence of different rules in the grid giving rise to complex dynamics. Studies along this line may have applications to future computing systems that will contain thousands or even millions of computing elements, rendering crucial the issue of resilience.
APA, Harvard, Vancouver, ISO, and other styles
22

Wang, Jing, Hongyun Huang, and Jian Hou. "Discriminator Based Resilient Multi-agent Deep Deterministic Policy Gradient Under Uncertain Faulty Agents." Journal of Physics: Conference Series 2258, no. 1 (April 1, 2022): 012064. http://dx.doi.org/10.1088/1742-6596/2258/1/012064.

Full text
Abstract:
Abstract In recent years, Multi-agent reinforcement learning (MARL) is widely applied in various of fields, to achieve a global goal in a centralized or distributed manner. However, during its application it is crucial to be fault-tolerance as some agents behave abnormal. In this paper, we propose a Resilient Multi-gent Deep Deterministic Policy Gradient (RMADDPG) algorithm to achieve a cooperative task in the presence of faulty agents via centralized training decentralized execution. At training stage, each normal agent observes and records information only from other normal ones, without access to the faulty ones. Meanwhile, a discriminator is generated based on the well-trained actor network to identify each faulty agent via supervised learning. Followed by executing stage, each normal agent selects its action based on local observation according to its actor network and its discriminator, so as to achieve certain system goal. Specifically, RMADDPG offers a scheme to train agents for improved resilience against arbitrary number of faulty agents. Finally, a cooperative navigation experiment is provided to validate the effectiveness of the proposed algorithm.
APA, Harvard, Vancouver, ISO, and other styles
23

Ibrahim, Mariam, and Asma Alkhraibat. "Resiliency Assessment of Microgrid Systems." Applied Sciences 10, no. 5 (March 6, 2020): 1824. http://dx.doi.org/10.3390/app10051824.

Full text
Abstract:
Measuring resiliency of smart grid systems is one of the vital topics towards maintaining a reliable and efficient operation under attacks. This paper introduces a set of factors that are utilized for resiliency quantification of microgrid (MG) systems. The level of resilience (LoR) measure is determined by examining the voltage sag percentage, the level of performance reduction (LoPR) as measured by percentage of reduction of load served, recovery time (RT), which is the time system takes to detect and recover from an attack/fault, and the time to reach Power Balance state (Tb) during the islanded mode. As an illustrative example, a comparison based on the resiliency level is presented for two topologies of MGs under an attack scenario.
APA, Harvard, Vancouver, ISO, and other styles
24

Liu, Huiqing, Ruiting Zhang, and Shunzhe Zhang. "On the global strong resilience of fault Hamiltonian graphs." Applied Mathematics and Computation 418 (April 2022): 126841. http://dx.doi.org/10.1016/j.amc.2021.126841.

Full text
APA, Harvard, Vancouver, ISO, and other styles
25

Moríñigo, José A., Andrés Bustos, and Rafael Mayo-García. "Error resilience of three GMRES implementations under fault injection." Journal of Supercomputing 78, no. 5 (November 5, 2021): 7158–85. http://dx.doi.org/10.1007/s11227-021-04148-x.

Full text
APA, Harvard, Vancouver, ISO, and other styles
26

Zhang, Hongyan, Lars Bauer, Michael Andreas Kochte, Eric Schneider, Hans-Joachim Wunderlich, and Jorg Henkel. "Aging Resilience and Fault Tolerance in Runtime Reconfigurable Architectures." IEEE Transactions on Computers 66, no. 6 (June 1, 2017): 957–70. http://dx.doi.org/10.1109/tc.2016.2616405.

Full text
APA, Harvard, Vancouver, ISO, and other styles
27

Jiang, Linshan, Rui Tan, and Arvind Easwaran. "Resilience Bounds of Network Clock Synchronization with Fault Correction." ACM Transactions on Sensor Networks 16, no. 4 (October 12, 2020): 1–30. http://dx.doi.org/10.1145/3409804.

Full text
APA, Harvard, Vancouver, ISO, and other styles
28

Sindia, Suraj, and Vishwani D. Agrawal. "Neural Network Guided Spatial Fault Resilience in Array Processors." Journal of Electronic Testing 29, no. 4 (July 25, 2013): 473–83. http://dx.doi.org/10.1007/s10836-013-5394-8.

Full text
APA, Harvard, Vancouver, ISO, and other styles
29

Chen, Pohan, and Kai Sun. "An active fault management for microgrids resilience safety-assurance." iEnergy 1, no. 4 (December 2022): 394. http://dx.doi.org/10.23919/ien.2022.0039.

Full text
APA, Harvard, Vancouver, ISO, and other styles
30

Damian, Ioan-Cătălin, Mircea Eremia, and Lucian Toma. "Fault Simulations in a Multiterminal High Voltage DC Network with Modular Multilevel Converters Using Full-Bridge Submodules." Energies 14, no. 6 (March 16, 2021): 1653. http://dx.doi.org/10.3390/en14061653.

Full text
Abstract:
The concept of high-voltage DC transmission using a multiterminal configuration is presently a central topic of research and investment due to rekindled interest in renewable energy resource integration. Moreover, great attention is given to fault analysis, which leads to the necessity of developing proper tools that enable proficient dynamic simulations. This paper leverages models and control system design techniques and demonstrates their appropriateness for scenarios in which faults are applied. Furthermore, this paper relies on full-bridge submodule topologies in order to underline the increase in resilience that such a configuration brings to the multiterminal DC network, after an unexpected disturbance. Therefore, strong focus is given to fault response, considering that converters use a full-bridge topology and that overhead power lines connect the terminals.
APA, Harvard, Vancouver, ISO, and other styles
31

Wódczak, Michał, Nikolay Tcholtchev, Bruno Vidalenc, and Yuhong Li. "Design and Evaluation of Techniques for Resilience and Survivability of the Routing Node." International Journal of Adaptive, Resilient and Autonomic Systems 4, no. 4 (October 2013): 36–63. http://dx.doi.org/10.4018/ijaras.2013100103.

Full text
Abstract:
This paper presents the final results achieved within the EFIPSANS project on the topic of resilience and survivability in autonomic networks. In particular, the outcome of the investigation of the following issues is presented: adaptive level of recovery, unified architecture for autonomic routing resilience, synergies between autonomic fault-management and resilience in self-managing networks, as well as network resilience through autonomic reroute mechanism enhanced with multi-path node-to-node cooperation. The described ideas are supported by exhaustive descriptions and analyses featuring extensive validation results aimed to prove the applicability of the proposed concepts.
APA, Harvard, Vancouver, ISO, and other styles
32

Sadashiva, Vinod, Richard Mowll, S. R. Uma, Sheng L. Lin, David Heron, Nick Horspool, Mostafa Nayyerloo, et al. "Improving Wellington region’s resilience through integrated infrastructure resilience investments." Bulletin of the New Zealand Society for Earthquake Engineering 54, no. 2 (June 1, 2021): 117–34. http://dx.doi.org/10.5459/bnzsee.54.2.117-134.

Full text
Abstract:
Infrastructure networks (e.g. transport, water, energy, telecommunications) support life and the economy of communities of all sizes. New Zealand has witnessed several damaging earthquakes in the last decade that provide a compelling case to accelerate building resilient infrastructures in the country, so we can minimize any adverse impacts from future earthquakes. One of the regions that is highly vulnerable to earthquakes is Wellington. With the region’s population continually expanding and placing increased demands on its ageing infrastructures, with limited redundancy in the networks, and with many of its assets close to and / or intersecting fault lines, a large earthquake in the region could be highly disruptive, potentially resulting in serious social and economic consequences. While it may not be possible to completely avoid the impacts, they can be reduced. This paper provides an overview of the process taken in delivering a Wellington Lifelines Group report that demonstrates how impacts from a future major earthquake can be reduced through integrated and targeted infrastructure resilience investments. To quantify the benefits that can be achieved by making the proposed investments, impact modelling on nine different lifeline utilities in the Wellington metropolitan area were conducted; the assessment approach taken, and results derived and their use to prioritise resilience investments, are shown in this paper for selected key networks. The time-stamped service outage maps and tables produced from this work formed an essential input to evaluate and demonstrate the impact of the proposed resilience initiatives on the regional and national economies.
APA, Harvard, Vancouver, ISO, and other styles
33

Najjar, Walid, and Pradip K. Srimani. "Conditional Disconnection Probability in Star Graphs." VLSI Design 1, no. 1 (January 1, 1993): 61–70. http://dx.doi.org/10.1155/1993/84924.

Full text
Abstract:
Recently a new interconnection topology has been proposed which compares very favorably with the well known n-cubes (hypercubes) in terms of degree, diameter, fault-tolerance and applicability in VLSI design. In this paper we use a new probabilistic measure of network fault tolerance expressed as the probability of disconnection to study the robustness of star graphs. We derive analytical approximation for the disconnection probability of star graphs and verify it with Monte Carlo simulation. We then compare the results with hypercubes [4]. We also use the measures of network resilience and relative network resilience to evaluate the effects of the disconnection probability on the reliability of star graphs.
APA, Harvard, Vancouver, ISO, and other styles
34

Gilberto Bratti Volken, Felipe, and Claudio Roberto do Rosário. "The use of resilience engineering tools for fault propagation mitigation in sociotechnical systems." Revista Gestão da Produção Operações e Sistemas 16, no. 2 (June 1, 2021): 39–73. http://dx.doi.org/10.15675/gepros.v16i2.2720.

Full text
Abstract:
Purpose – To demonstrate an application of resilience engineering, aiming to improve productivity and quality by decreasing variability and fault propagation in the production functions of a sociotechnical system, the object of the study, in an asphalt concrete production plant. Design/methodology/approach – The approach used was the functional resonance analysis method, where experts in sociotechnical systems were interviewed, eliciting their knowledge on the aspects of interaction between production functions. The elicited knowledge was entered into FRAM Model Visualizer 0.4.1 software, which presents a graphical map of the system and allows the number of couplings (NC) to be analyzed. Findings – The analysis of the NCs highlighted that the laboratory control, burner, drum dryer, and baghouse functions showed a high potential for variability absorption, while the PLC control system and laboratory control functions have a high potential for fault propagation. Both groups were examined and measured explored within the scope of resilience engineering that enhance their roles in mitigating resilience within the system. Originality/value – The replication of the resilience engineering discussed in this article enables gains in productivity and quality in sociotechnical systems for the same segment; the application of the functional resonance analysis methodology demonstrated can benefit further studies on productive systems with interaction between people and technology. Keywords - Resilience engineering, productive systems, sociotechnical systems, FRAM.
APA, Harvard, Vancouver, ISO, and other styles
35

Ananda, Giani, Taufika Ophiyandri, and Edi Hasymi. "Resilience measurement of Padang city’s infrastructures toward multi-hazard." E3S Web of Conferences 156 (2020): 01003. http://dx.doi.org/10.1051/e3sconf/202015601003.

Full text
Abstract:
The complexity of geographical conditions and regional morphology of Padang City have caused it to be at high risk of multi-hazard. Padang City is located near the meeting point of the Indo-Australian Plate and the Eurasian Plate, and also on the Sumatra Fault line (Semangko Fault). Therefore, strong infrastructures are needed in order to minimize the impact of the risk of multi-hazard. This study is conducted to measure the resilience of Padang City’s infrastructures toward multi-hazard and provide recommendations to improve the resilience of Padang City’s infrastructures toward Multi-hazard. This study was conducted with qualitative methods and presented quantitatively in the form of diagrams. The measurement is carried out based on the concept of city toughness measurements made by UNISDR known as the "Scorecard". This study only focuses on essential 8 about "Increase Infrastructure Resilience" and essential 10 about "Expedite Recovery and Build Back Better". From this study, it can be concluded that the resilience of Padang City infrastructures is still relatively low so several recommendations that are expected will increase the resilience of Padang City’s infrastructures are proposed, that are; in-depth assessment, data collection and supervision monitoring of infrastructures, important assets, and protective infrastructure.
APA, Harvard, Vancouver, ISO, and other styles
36

Rivera Torres, Pedro Juan, Carlos Gershenson García, María Fernanda Sánchez Puig, and Samir Kanaan Izquierdo. "Reinforcement Learning with Probabilistic Boolean Network Models of Smart Grid Devices." Complexity 2022 (June 13, 2022): 1–15. http://dx.doi.org/10.1155/2022/3652441.

Full text
Abstract:
The area of smart power grids needs to constantly improve its efficiency and resilience, to provide high quality electrical power in a resilient grid, while managing faults and avoiding failures. Achieving this requires high component reliability, adequate maintenance, and a studied failure occurrence. Correct system operation involves those activities and novel methodologies to detect, classify, and isolate faults and failures and model and simulate processes with predictive algorithms and analytics (using data analysis and asset condition to plan and perform activities). In this paper, we showcase the application of a complex-adaptive, self-organizing modeling method, and Probabilistic Boolean Networks (PBNs), as a way towards the understanding of the dynamics of smart grid devices, and to model and characterize their behavior. This work demonstrates that PBNs are equivalent to the standard Reinforcement Learning Cycle, in which the agent/model has an interaction with its environment and receives feedback from it in the form of a reward signal. Different reward structures were created to characterize preferred behavior. This information can be used to guide the PBN to avoid fault conditions and failures.
APA, Harvard, Vancouver, ISO, and other styles
37

Tan, Guang, and Stephen A. Jarvis. "Improving the Fault Resilience of Overlay Multicast for Media Streaming." IEEE Transactions on Parallel and Distributed Systems 18, no. 6 (June 2007): 721–34. http://dx.doi.org/10.1109/tpds.2007.1054.

Full text
APA, Harvard, Vancouver, ISO, and other styles
38

Savaliya, Abhishek, Rutvij H. Jhaveri, Qin Xin, Saad Alqithami, Sagar Ramani, and Tariq Ahamed Ahanger. "Securing industrial communication with software-defined networking." Mathematical Biosciences and Engineering 18, no. 6 (2021): 8298–313. http://dx.doi.org/10.3934/mbe.2021411.

Full text
Abstract:
<abstract> <p>Industrial Cyber-Physical Systems (CPSs) require flexible and tolerant communication networks to overcome commonly occurring security problems and denial-of-service such as links failure and networks congestion that might be due to direct or indirect network attacks. In this work, we take advantage of Software-defined networking (SDN) as an important networking paradigm that provide real-time fault resilience since it is capable of global network visibility and programmability. We consider OpenFlow as an SDN protocol that enables interaction between the SDN controller and forwarding plane of network devices. We employ multiple machine learning algorithms to enhance the decision making in the SDN controller. Integrating machine learning with network resilience solutions can effectively address the challenge of predicting and classifying network traffic and thus, providing real-time network resilience and higher security level. The aim is to address network resilience by proposing an intelligent recommender system that recommends paths in real-time based on predicting link failures and network congestions. We use statistical data of the network such as link propagation delay, the number of packets/bytes received and transmitted by each OpenFlow switch on a specific port. Different state-of-art machine learning models has been implemented such as logistic regression, K-nearest neighbors, support vector machine, and decision tree to train these models in normal state, links failure and congestion conditions. The models are evaluated on the Mininet emulation testbed and provide accuracies ranging from around 91–99% on the test data. The machine learning model with the highest accuracy is utilized in the intelligent recommender system of the SDN controller which helps in selecting resilient paths to achieve a better security and quality-of-service in the network. This real-time recommender system helps the controller to take reactive measures to improve network resilience and security by avoiding faulty paths during path discovery and establishment.</p> </abstract>
APA, Harvard, Vancouver, ISO, and other styles
39

Davies, Alistair, Conrad Zorn, Thomas Wilson, Liam Wotherspoon, Sarah Beavan, Tim Davies, and Matthew Hughes. "Infrastructure failure propagations and recovery strategies from an Alpine Fault earthquake scenario." Bulletin of the New Zealand Society for Earthquake Engineering 54, no. 2 (June 1, 2021): 82–96. http://dx.doi.org/10.5459/bnzsee.54.2.82-96.

Full text
Abstract:
While it is well established that community members should participate in resilience planning, participation with genuine decision-making power remains rare. We detail an end-to-end disaster impact reduction modelling framework for infrastructure networks, embedded within a scenario-based participatory approach. Utilising the AF8+ earthquake scenario, we simulate hazard exposure, asset failure and recovery of interdependent critical infrastructure networks. Quantifying service levels temporally offers insights into possible interdependent network performance and community disconnection from national networks, not apparent when studying each infrastructure in isolation. Sequencing participation enables feedbacks between integrated modelling and participants’ impact assessments. Shared ownership of modelling outputs advances stakeholders’ understanding of resilience measures, allowing real-time implementation, increasing community resilience. Readily understood by central government, this format may increase support and resourcing, if nationally significant. Finally, this method tested integrated modelling and impacts assessments, identifying and enabling improvements for both.
APA, Harvard, Vancouver, ISO, and other styles
40

Chopra, Shauhrat S., Trent Dillon, Melissa M. Bilec, and Vikas Khanna. "A network-based framework for assessing infrastructure resilience: a case study of the London metro system." Journal of The Royal Society Interface 13, no. 118 (May 2016): 20160113. http://dx.doi.org/10.1098/rsif.2016.0113.

Full text
Abstract:
Modern society is increasingly dependent on the stability of a complex system of interdependent infrastructure sectors. It is imperative to build resilience of large-scale infrastructures like metro systems for addressing the threat of natural disasters and man-made attacks in urban areas. Analysis is needed to ensure that these systems are capable of withstanding and containing unexpected perturbations, and develop heuristic strategies for guiding the design of more resilient networks in the future. We present a comprehensive, multi-pronged framework that analyses information on network topology, spatial organization and passenger flow to understand the resilience of the London metro system. Topology of the London metro system is not fault tolerant in terms of maintaining connectivity at the periphery of the network since it does not exhibit small-world properties. The passenger strength distribution follows a power law, suggesting that while the London metro system is robust to random failures, it is vulnerable to disruptions on a few critical stations. The analysis further identifies particular sources of structural and functional vulnerabilities that need to be mitigated for improving the resilience of the London metro network. The insights from our framework provide useful strategies to build resilience for both existing and upcoming metro systems.
APA, Harvard, Vancouver, ISO, and other styles
41

Lin, Yue, Ya Nan Gao, Song Tao Fan, and Chun Xu. "Design and Simulation of an Improved Five Redundancy GNC System Architecture." Applied Mechanics and Materials 543-547 (March 2014): 1423–27. http://dx.doi.org/10.4028/www.scientific.net/amm.543-547.1423.

Full text
Abstract:
Aerospace plane, the next generation manned spacecraft has the characteristics of high reliability and high security. Therefore, a five modular redundancy byzantine fault tolerant architecture is propose for the guidance, navigation and control (GNC) system. This architecture can give the system characteristic of double fault resilience, and is also suitable for solving Byzantine general problem. A semi-physical simulation is done, which is based on 1553B data bus and VxWorks operating system. The simulation result has verified the fault tolerant architecture and feasibility of the system implementation.
APA, Harvard, Vancouver, ISO, and other styles
42

Ali, Nazakat, Manzoor Hussain, and Jang-Eui Hong. "Fault-Tolerance by Resilient State Transition for Collaborative Cyber-Physical Systems." Mathematics 9, no. 22 (November 10, 2021): 2851. http://dx.doi.org/10.3390/math9222851.

Full text
Abstract:
Collaborative Cyber-Physical Systems (CCPS) are systems where several individual cyber-physical systems collaborate to perform a single task. The safety of a single Cyber-Physical System (CPS) can be achieved by applying a safety mechanism and following standard processes defined in ISO 26262 and IEC 61508. However, due to heterogeneity, complexity, variability, independence, self-adaptation, and dynamic nature, functional operations for CCPS can threaten system safety. In contrast to fail-safe systems, where, for instance, the system leads to a safe state when an actuator shuts down due to a fault, the system has to be fail-operational in autonomous driving cases, i.e., a shutdown of a platooning member vehicle during operation on the road is unacceptable. Instead, the vehicle should continue its operation with degraded performance until a safe state is reached or returned to its original state in case of temporal faults. Thus, this paper proposes an approach that considers the resilient behavior of collaborative systems to achieve the fail-operational goal in autonomous platooning systems. First, we extended the state transition diagram and introduced additional elements such as failures, mitigation strategies, and safe exit to achieve resilience in autonomous platooning systems. The extended state transition diagram is called the Resilient State Transition Diagram (R-STD). Second, an autonomous platooning system’s perception, communication, and ego-motion failures are modeled using the proposed R-STD to check its effectiveness. Third, VENTOS simulator is used to verify the resulting resilient transitions of R-STD in a simulation environment. Results show that a resilient state transition approach achieves the fail-operational goal in the autonomous platooning system.
APA, Harvard, Vancouver, ISO, and other styles
43

Benacchio, Tommaso, Luca Bonaventura, Mirco Altenbernd, Chris D. Cantwell, Peter D. Düben, Mike Gillard, Luc Giraud, et al. "Resilience and fault tolerance in high-performance computing for numerical weather and climate prediction." International Journal of High Performance Computing Applications 35, no. 4 (February 8, 2021): 285–311. http://dx.doi.org/10.1177/1094342021990433.

Full text
Abstract:
Progress in numerical weather and climate prediction accuracy greatly depends on the growth of the available computing power. As the number of cores in top computing facilities pushes into the millions, increased average frequency of hardware and software failures forces users to review their algorithms and systems in order to protect simulations from breakdown. This report surveys hardware, application-level and algorithm-level resilience approaches of particular relevance to time-critical numerical weather and climate prediction systems. A selection of applicable existing strategies is analysed, featuring interpolation-restart and compressed checkpointing for the numerical schemes, in-memory checkpointing, user-level failure mitigation and backup-based methods for the systems. Numerical examples showcase the performance of the techniques in addressing faults, with particular emphasis on iterative solvers for linear systems, a staple of atmospheric fluid flow solvers. The potential impact of these strategies is discussed in relation to current development of numerical weather prediction algorithms and systems towards the exascale. Trade-offs between performance, efficiency and effectiveness of resiliency strategies are analysed and some recommendations outlined for future developments.
APA, Harvard, Vancouver, ISO, and other styles
44

Melo, Douglas R., Cesar A. Zeferino, Luigi Dilillo, and Eduardo A. Bezerra. "Maximizing the Inner Resilience of a Network-on-Chip through Router Controllers Design." Sensors 19, no. 24 (December 9, 2019): 5416. http://dx.doi.org/10.3390/s19245416.

Full text
Abstract:
Reducing component size and increasing the operating frequency of integrated circuits makes the Systems-on-Chip (SoCs) more susceptible to faults. Faults can cause errors, and errors can be propagated and lead to a system failure. SoCs employing many cores rely on a Network-on-Chip (NoC) as the interconnect architecture. In this context, this study explores alternatives to implement the flow regulation, routing, and arbitration controllers of an NoC router aiming at minimizing error propagation. For this purpose, a router with Finite-State Machine (FSM)-based controllers was developed targeting low use of logical resources and design flexibility for implementation in FPGA devices. We elaborated and compared the synthesis and simulation results of architectures that vary their controllers on Moore and Mealy FSMs, as well as the Triple Modular Redundancy (TMR) hardening application. Experimental results showed that the routing controller was the most critical one and that migrating a Moore to a Mealy controller offered a lower error propagation rate and higher performance than the application of TMR. We intended to use the proposed router architecture to integrate cores in a fault-tolerant NoC-based system for data processing in harsh environments, such as in space applications.
APA, Harvard, Vancouver, ISO, and other styles
45

Chalishazar, Vishvas H., Ted K. A. Brekken, Darin Johnson, Kent Yu, James Newell, King Chin, Rob Weik, et al. "Connecting Risk and Resilience for a Power System Using the Portland Hills Fault Case Study." Processes 8, no. 10 (September 23, 2020): 1200. http://dx.doi.org/10.3390/pr8101200.

Full text
Abstract:
Active seismic faults in the Pacific Northwest area have encouraged electric utilities in the region to deeply contemplate and proactively intervene to support grid resilience. To further this effort this research introduces Monte Carlo (MC)-based power system modeling as a means to inform the Performance Based Earthquake Engineering method and simulates 100,000 sample earthquakes of a 6.8 magnitude (M6.8) Portland Hills Fault (PHF) scenario in the Portland General Electric (PGE) service territory as a proof of concept. This paper also proposes the resilience metric Seismic Load Recovery Factor (SLRF) to quantify the recovery of a downed power system and thus can be used to quantify earthquake economic risk. Using MC results, the SLRF was evaluated to be 19.7 h and the expected economic consequence cost of a M6.8 PHF event was found to be $180 million with an annualized risk of $90,000 given the event’s 1 in 2000 year probability of occurrence. The MC results also identified the eight most consequential substations in the PGE system—i.e., those that contributed to maximum load loss. This paper concludes that retrofitting these substations reduced the expected consequence cost of a M6.8 PHF event to $117 million.
APA, Harvard, Vancouver, ISO, and other styles
46

Ramani, Sagar, and Rutvij H. Jhaveri. "ML-Based Delay Attack Detection and Isolation for Fault-Tolerant Software-Defined Industrial Networks." Sensors 22, no. 18 (September 14, 2022): 6958. http://dx.doi.org/10.3390/s22186958.

Full text
Abstract:
Traditional security mechanisms find difficulties in dealing with intelligent assaults in cyber-physical systems (CPSs) despite modern information and communication technologies. Furthermore, resource consumption in software-defined networks (SDNs) in industrial organizations is usually on a larger scale, and the present routing algorithms fail to address this issue. In this paper, we present a real-time delay attack detection and isolation scheme for fault-tolerant software-defined industrial networks. The primary goal of the delay attack is to lower the resilience of our previously proposed scheme, SDN-resilience manager (SDN-RM). The attacker compromises the OpenFlow switch and launches an attack by delaying the link layer discovery protocol (LLDP) packets. As a result, the performance of SDN-RM is degraded and the success rate decreases significantly. In this work, we developed a machine learning (ML)-based attack detection and isolation mechanism, which extends our previous work, SDN-RM. Predicting and labeling malicious switches in an SDN-enabled network is a challenge that can be successfully addressed by integrating ML with network resilience solutions. Therefore, we propose a delay-based attack detection and isolation scheme (DA-DIS), which avoids malicious switches from entering the routes by combining an ML mechanism along with a route-handoff mechanism. DA-DIS increases network resilience by increasing success rate and network throughput.
APA, Harvard, Vancouver, ISO, and other styles
47

Ge, Xiaojun, Minyu Chen, Yaming Mao, Lizhen Lu, Xing He, and Qian Ai. "A Novel Fault Location Algorithm in Distribution Network Based on Power-on Signal and Power-off Signal." Journal of Physics: Conference Series 2031, no. 1 (September 1, 2021): 012033. http://dx.doi.org/10.1088/1742-6596/2031/1/012033.

Full text
Abstract:
Abstract In recent years, with the continuous promotion of clean and low-carbon energy strategy, power system’s operation mode has been transformed gradually. Consequently, the extended time of fault location when it occurs will jeopardize the fault restoration of distribution system and reduce the grid resilience. Hence, this paper proposes a novel fault location algorithm based on the combination of power-on signal and power-off signal, which can improve the efficiency of the fault location. First, the principle of numbering node is presented, making it convenient for computer to identify the relationship between device in the distribution network. Then, some assumptions and principles of the algorithm are shown, together with the termination condition of the proposed algorithm. Finally, case study involving an actual distribution network is carried out. Results show that the proposed algorithm is able to achieve accurate fault location meanwhile improve the efficiency of fault location.
APA, Harvard, Vancouver, ISO, and other styles
48

Bi, Shanshan, Tianchen Wang, Lei Wang, and Maciej Zawodniok. "Novel cyber fault prognosis and resilience control for cyber–physical systems." IET Cyber-Physical Systems: Theory & Applications 4, no. 4 (December 1, 2019): 304–12. http://dx.doi.org/10.1049/iet-cps.2018.5061.

Full text
APA, Harvard, Vancouver, ISO, and other styles
49

Chen, Sui, Greg Bronevetsky, Bin Li, Marc Casas Guix, and Lu Peng. "A framework for evaluating comprehensive fault resilience mechanisms in numerical programs." Journal of Supercomputing 71, no. 8 (April 24, 2015): 2963–84. http://dx.doi.org/10.1007/s11227-015-1422-z.

Full text
APA, Harvard, Vancouver, ISO, and other styles
50

Van Waes, Jonas, Dries Vanoost, Jens Vankeirsbilck, Jonas Lannoo, Davy Pissoort, and Jeroen Boydens. "Resilience of Error Correction Codes Against Harsh Electromagnetic Disturbances: Fault Mechanisms." IEEE Transactions on Electromagnetic Compatibility 62, no. 4 (August 2020): 1017–27. http://dx.doi.org/10.1109/temc.2019.2931369.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography