*Condition-Based Maintenance for Data Center Operations Management DOI: http://dx.doi.org/10.5772/intechopen.93945*

#### **Table 1.**

respectively. The failure limit line determines the borderline between the operating and failure zones. If the forecasted trend line reaches or exceeds the failure limit, appropriate maintenance may be planned and scheduled ahead of time before the forecast point [10]. The ability to predict the future deterioration trend is the core

PPM can be defined as a strategic approach to improve the availability and reliability performance of a particular data center device or system. CBM is one type of PPM that extrapolates and predicts device or system condition over time, utiliz-

How to prevent those courses of data center failures? First, redundant system design is the first solution to prevent primary failure while selected devices and systems with highest MTBF rate is other best option. Uptime Tier Classification [11] and BICSI-002 [12] are classified the solution to prevent against the causes of failure. **Figure 5** presents the level of prevention of Uptime that Tier 4 is the highest level and Tier 1 is the lowest level of system protection while **Table 1** represents the level of prevention of BICSI 002 that Class F0 is the lowest level and Class F4 is the highest level of system protection respectively. The annual allowable planned

of the prognostics method in the preventive maintenance strategy.

*Operations Management - Emerging Trend in the Digital Era*

**Figure 4.**

**Figure 5.**

**40**

*Uptime data center tier classification.*

*The principle of the prognostics method.*

ing probability equations to assess and predict the downtime risks.

for maintenance is the crucial factor to prevent data center downtime.

*BICSI 002 system reliability classification.*

For reinforcement of system reliability, the Class F4 and Class F3 are designed for system reliability of PDS for 2(N + 1) and 2 N or N + 1 topology respectively that help more robust on CBM for tolerant maintaining operations with minimal downtime effect to entire system.

failure, the extent to which that failure is mitigated by functional redundancy (or some other design consideration) to prevent it from acting as a single point of failure [13]. A protection design of system reliability for data center can be classified to three stages, which imply as the sources of power protection, as demonstrated in

*Condition-Based Maintenance for Data Center Operations Management*

*DOI: http://dx.doi.org/10.5772/intechopen.93945*

Stage 1: On normal condition, data center is operating with power utility sources

Stage 2: On utility outage condition, at short duration with less than <0.5 millisecond to 15 second UPS with flywheel systems can capable handle critical IT loads immediately, while the UPS with battery systems will continuous take action to protect critical IT equipment after flywheel already discharged within 30 seconds. The design capacity of batteries loads is depended on critical IT application and equipment needs, mostly designer or consultant has designed for 15 to

30 minutes. This important information must be given for IT team and data center consultant for calculation design for predicted solution for critical loads [14]. Stage 3: During operation of Stage 2, generator will start after detected utility outage within 12–15 seconds, if the power utilities still not recover on normal function, after generator control sensor detected utility outage within 15 seconds

Last, power distribution system of data center designs for isolating and dividing

Zone 0: Utilities (2 N) Preventive Approach, CBM can be performed to utility

Zone I: Generators 2(N + 1) Preventive Approach, CBM can be performed to

Zone II: UPSs 2(N + 1) Preventive Approach, CBM can be performed to software

Zone III: Dual Power Paths (2 N) Preventive Approach, CBM can be performed

Zone IV: Load Shedding Preventive Approach, CBM can be performed to software DCIM and in house training to handle load shedding (within 10 minutes),

The power distribution system (PDS) of data center has exanimated as case studies for this research. They are 4 topology prototypes of Uptime (Tier I, Tier II, Tier III, and IV) and 5 topology prototypes of BICSI (Class F0, F1, F2, F3, F4) of demonstration on operations and maintenance management. Plan-Do-Check-Act (PDCA) has been applied through PPM model. This process has established more data collection from earlier cycles as the same time this process has certified data training for fault diagnostics and prognostics. The fault diagnostics perform through auto-discovery in DCIM software. StruxureWare software [15] had deployed as auto-discovery subject to ability to detect a device, model it and measure that relevant data points of that equipment. PPM approach has examined by

The SFD begins with data collection from sensing devices at condition monitoring state; data processing and data analytic; feature selection to form statistic modeling before pass through fault diagnostics and prognostics. Output of prognostic process constructs data set and transfers to estimate RUL for input data for predictive maintenance [16]. Predictive maintenance and CBM are synchronized processing with the

CBM into 4 groups or zones: Zone 0, Zone I, Zone II, Zone III, and Zone IV, as

power standby system is already to takeover load from Stage 2 (UPSs).

service level agreement (SLA) and remote monitoring and controlling.

DCIM and main contractor SLA or 3rd parties contract for SLA.

main contractor SLA or 3rd parties contract for SLA.

system flow diagram (SFD), as depicted in **Figure 8**.

software DCIM and main contractor SLA or 3rd parties contract for SLA.

to software DCIM and main contractor SLA or 3rd parties contract for SLA.

**Figure 6**.

as primary power.

presented in **Figure 7**, by:

**3. Research methodology**

**43**

Second, how deep to understand consequence of device/system protection of power distribution system. The failure mitigation map illustrates, for each primary

**Figure 7.** *Zone preventive approach for CBM.*

failure, the extent to which that failure is mitigated by functional redundancy (or some other design consideration) to prevent it from acting as a single point of failure [13]. A protection design of system reliability for data center can be classified to three stages, which imply as the sources of power protection, as demonstrated in **Figure 6**.

Stage 1: On normal condition, data center is operating with power utility sources as primary power.

Stage 2: On utility outage condition, at short duration with less than <0.5 millisecond to 15 second UPS with flywheel systems can capable handle critical IT loads immediately, while the UPS with battery systems will continuous take action to protect critical IT equipment after flywheel already discharged within 30 seconds. The design capacity of batteries loads is depended on critical IT application and equipment needs, mostly designer or consultant has designed for 15 to 30 minutes. This important information must be given for IT team and data center consultant for calculation design for predicted solution for critical loads [14].

Stage 3: During operation of Stage 2, generator will start after detected utility outage within 12–15 seconds, if the power utilities still not recover on normal function, after generator control sensor detected utility outage within 15 seconds power standby system is already to takeover load from Stage 2 (UPSs).

Last, power distribution system of data center designs for isolating and dividing CBM into 4 groups or zones: Zone 0, Zone I, Zone II, Zone III, and Zone IV, as presented in **Figure 7**, by:

Zone 0: Utilities (2 N) Preventive Approach, CBM can be performed to utility service level agreement (SLA) and remote monitoring and controlling.

Zone I: Generators 2(N + 1) Preventive Approach, CBM can be performed to software DCIM and main contractor SLA or 3rd parties contract for SLA.

Zone II: UPSs 2(N + 1) Preventive Approach, CBM can be performed to software DCIM and main contractor SLA or 3rd parties contract for SLA.

Zone III: Dual Power Paths (2 N) Preventive Approach, CBM can be performed to software DCIM and main contractor SLA or 3rd parties contract for SLA.

Zone IV: Load Shedding Preventive Approach, CBM can be performed to software DCIM and in house training to handle load shedding (within 10 minutes), main contractor SLA or 3rd parties contract for SLA.
