Intelligent Infrastructure Maintenance

Overview: Norwegian Public Roads Administration (NPRA) is a Norwegian government agency responsible for national and county public roads in Norway. Amongst the other responsibilities NPRA is also responsible for the maintenance of the roads and bridges in the country. Several prominent bridges are ageing and need continuous attention. Traditionally NPRA has based their maintenance on preventive methods and time scheduled inspections. With increasing needs for maintenance NPRA need to address which bridges to maintain in a more efficient and optimized way compared to the traditional methods.

NPRA and SAP are working together on a real-time monitoring system for Stavå bridge. SAP in collaboration with NPRA has introduced a digital twin solution to enable the real-time monitoring of the bridge. Real-time information is gathered using several sensors placed on the various parts of the bridge and sent to the SAP system for monitoring. With this information NPRA can understand the bridge maintenance requirements and patterns.

The challenge of measuring and monitoring bridge behaviour and analyzing structural deterioration over time was solved mainly using several sensors placed at several key positions on the bridge. The inputs from these are processed and analyzed in order to understand the behaviour of the bridge. The main sensors whose data is being utilized for the tasks in this Hackathon are:

There are 20 sensors of the type accelerometers placed on the bridge.
Each accelerometer provides 3 channels of data measuring acceleration in axes orthogonal to each other [let’s call these orthogonal axes a, b and c].
One air temperature sensor located near the bridge.

Note that all sensors do not have the same orientation in space and may be rotated relative to each other in both the horizontal and vertical plane. Channels 1 (axis a), 2 (axis b) and 3 (axis c) may thus represent different directions for different sensors, i.e. point in different directions, that could even be opposite to each other. Additionally, the ‘Frost’ data in the dataset provided indicates the temperature sensor being present near the bridge.
The accelerometers deliver data in 64 Hz, while temperature measurements are updated every 10 minutes. Please take a look at the short visualization video-clip showing raw values from accelerometers.

Tasks:

1: Data Compression

Since there are 20 accelerometers and 3 channels for each of them, we have 60 values/dimensions recorded for each point in time [A_{1_a}, A_{1_b}, A_{1_c}, A_{2_a}, A_{2_b}, A_{2_c} … A_{20_a}, A_{20_b}, A_{20_c}]. These number of dimensions can place a toll on bandwidth as well as storage, especially as the number of sensors and sampling frequency increase. The objective of the task is to reduce the number of dimensions from 60 while ensuring that at least 90 percent of the original data is retained. Let these new reduced dimensions be referred to as K₁, K₂ ... K_N, where N<60.

The solution should consist of two parts, an encoder/compressor, and a decoder/un-compressor. Let X₁, X₂, X₃...X₆₀, be the values for 60 dimensions at a point of time t. The encoder should take as input these 60 dimensions and encode them into K₁, K₂ ... K_N, where N<60. The decoder should be able take as input, these K₁, K₂ ... K_N and use it to recreate/predict the original values/dimensions, let these decoded values be X_1^*, X_2^*, … X_60^*. The data loss would be measured between the original values X₁, X₂ … X₆₀ and the encoded-decoded result X_1^*, X_2^*, … X_60^*.

The corresponding temperature data will also be provided, and can be used, if the candidate wants. The temperature readings will not be subject to evaluation.

Example: Some example solution strategies:

Dimensionality Reduction Techniques
Auto-Encoders
Algorithms to predict the value of some sensors(Y) using the rest of the sensors(X). Thus, in the encoding part, the dimensions of Y could be stripped off and in decode, they could be predicted.

2: Sensor Network Robustness

The objective of the task is to handle situations of sensor-fallouts, by predicting the values of those sensors using the data of other sensors. The task is to develop 20 Models M₁ to M₂₀ that can predict the values of A₁ to A₂₀, respectively. For example, Model M₁ should predict the value of A₁ using the values of A₂ to A₂₀, M₂ should predict the value of A₂ and so on.

FYI: These individual models M₁ to M₂₀ can contain more models within them if the algorithm needs. For example, to predict the value of A₁, M₁ needs to predict the value of A_{1_a}, A_{1_b} and A_{1_c}. So, M₁ can have different component models within it, dedicated to predicting each of the axes, ie M_{1_a} for A_{1_a}, M_{1_b} for A_{1_b}, M_{1_c} for A_{1_c}.

The corresponding temperature data will also be provided, and can be used, if the candidate wants. Note that the sensors may be subject to (internal and individual) temperature bias, giving a constant change of values in each channel, and may also rotate slightly as the structure deforms upon change in temperature.

Naming Conventions:

For sensor A_n,

The values for channels a, b, c are denoted by A_{n_a}, A_{n_b}, A_{n_c}
the vector length is calculated using Pythagoras (vector length equation) and is denoted by A_n,

The predicted a,b,c values are denoted by A_{n*_a}, A_{n*_b}, A_{n*_c}

The predicted vector length is calculated using Pythagoras (in the form vector length equation) and is denoted by A_n*

Bias Removal:

For all the calculations, in-order to avoid the issues and biases that could be caused due to noise or external effects such as temperature, a pre-processing is applied. For this, we divide the day into 5 minute intervals, and the average value of vector length is calculated for these intervals [For the scope of this hackathon, these are fixed windows and not sliding windows]. For all the sensors, the vector length A_n at time t, is then calculated as the difference of the value of the vector length A_n at time t, and the average value of the vector length in the 5 minute interval.

A_n (corrected) at time t = A_n at time t – Average value of A_n in the 5 minute time interval

Technical Evaluation Criteria

Challenge 1:

Metric 1: The data loss would be measured between the sum of differences in the square of the original vector length (average taken out as described above) and sum of square of the encoded-decoded value for all sensors. This difference should be within 10% of the original sum of squares (of A_n). These metrics will be computed on the validation dataset. The higher the reduction in dimension while ensuring 90% data, the better.

where A_ntis the original vector length of sensor A_n at time t

, A^*_nt is the vector length after encoding-decoding of sensor A_n at time t

, n is the index of sensors from 1 to 20

, t₁ is the beginning time and t₂ is the end time

In addition to the metric, the following criterions can also be assessed to select the winner:

Data retention, that is among two approaches with similar level of dimension reduction, the one with higher data retention is better.
Generalizability of the approach, can it be used for other type of sensors.
Size of the model (if used), smaller the better.
Speed

Challenge 2:

Metric 1: RMSE across each of the sensors. RMSE will not be independently calculated across dimensions, but the RMSE between the original vector length and predicted vector length will be used. That is RMSE will be calculated between vector length A_n and A_n^*. Before we do this calculation, the bias removal strategy specified above will be applied to both A_n and A_n^*.These metrics will be computed on the validation dataset, the candidate with the lower total RMSE wins.

In addition to the metric, the following criterions can also be assessed to select the winner:

Generalizability
Size of the model
Speed

Data & resources:

Note: We have reserved an additional test data set in the same format as training data. Your data model & pipeline should be robust for validation testing.

Code with SAP Labs India - Discover, Design, Deliver

Winners

Smart Infrastructure & Ontology

Use-case based Interpretation of Machine Learning Models using Explainable AI

SmartStory

NOBURNOUT

Smart Infrastructure

Intelligent Infrastructure Maintenance

Overview Session Recording

Click here to view the PPT for understanding this problem statement

Code with SAP Labs India - Discover, Design, Deliver

Winners

Smart Infrastructure & Ontology

Use-case based Interpretation of Machine Learning Models using Explainable AI

SmartStory

NOBURNOUT

Smart Infrastructure

Intelligent Infrastructure Maintenance

Overview Session Recording

Click here to view the PPT for understanding this problem statement

Social Share