Skip to main content

Values and RUL

The Values and RUL section enables you to process regression data effectively. RUL stands for Remaining Useful Life, a critical metric in predictive modeling for estimating how long a system or component will function before requiring maintenance or replacement. This section is divided into three segments:

  1. Data Sample List
  2. Distributions for [List Name]
  3. Exploration Results for [List Name]

Each segment provides unique functionalities and insights to help you analyze and refine regression data for model development.

Values and RUL

Data Sample List

The Data Sample List segment displays an interactive table with the following fields:

FieldDescription
Lock SymbolAllows you to lock or unlock specific data samples to prevent accidental edits.
List NameDisplays the name of the current data sample list.
List TypeSpecifies whether the list is for training, validation, or testing purposes.
Data ShapeIndicates the dimensions of the data in terms of features and samples.
N SamplesShows the number of samples in the data list.
Target RangeDisplays the range of target values in the data.
CreatedShows the date and time when the data list was created.
ModifiedDisplays the date and time of the last modification made to the data list.
CommentsIncludes any notes or remarks associated with the data list.
StatusIndicates the current state of the data list, such as ready, in progress, or locked.

This table helps you organize, review, and manage your regression data efficiently.

Distributions

The Distributions segment visualizes the distribution of your data samples, offering critical insights into the dataset's characteristics. It includes:

FieldDescription
# of BinsSpecifies the number of bins for the histogram. You can manually enter a value or adjust it using the up and down arrows.
Value RangeDisplays the range of values in the selected bin.
CountIndicates the number of samples within each bin.
% of ListShows the percentage of total samples that fall within each bin.
Total CountDisplays the total number of samples in the data list.

This segment allows you to analyze data distribution patterns and identify anomalies or imbalances.

Exploration Results

The Exploration Results segment provides detailed metrics and visualizations to evaluate your regression data. Key features include:

FieldDescription
FavoritesAllows you to mark preferred configurations or results for quick access.
ComplexityDisplays the computational complexity, including resource requirements like Flash and RAM for different configurations.
Kfold AccuracyShows cross-validation accuracy metrics, including R-squared and Worst% Error values.
Training SeparationProvides performance metrics for training data, including R-squared and Worst% Error values.
Create Base ToolEnables you to create a baseline tool or model configuration for further analysis.
ExplanationOffers insights into the decision structure and significance of various parameters, along with interactive plots for deeper analysis.
Regression PlotsDisplays visualizations like scatter plots or line graphs to represent regression outputs and their alignment with target values.

This segment is essential for refining models and understanding their performance before deployment.

Creating AI Models and Viewing Statistics

This procedure guides you through creating AI models using an uploaded dataset and reviewing various statistics for these models.

  1. On the left menu, go to AI Explore > Values and RUL.
  2. On the Data Sample Lists segment, choose the required segmented list from the table.
  3. In the Exploration Results tab, click Start Exploring.
  • The system begins processing the feature spaces and generating models.
  • You can monitor the progress and view the top-performing models listed on the page.
  • For more information about this stage, refer to the Exploration Results Interface section.
  1. Each row in the Exploration Results section represents a model based on features identified during the AI exploration process. To view detailed accuracy statistics, click the Regression Plots row for the desired model.
  2. A blank Regression Plot opens. At the bottom of the screen:
    • Click the first option to view the confusion matrix, which includes overall accuracy, F1 score and error distribution.
      • Use the error distribution plot to identify the concentration of errors across the numeric range. A wider distribution indicates poorer model performance.
    • Click the second option to access two plots:
      • Regression Plot: Shows actual vs. predicted values for the selected model.
      • Residual Error Plot: Displays sample-wise residual errors.
NOTE

The key metrics include**

  • Residual Error: The vertical difference between a data point and the regression line, representing the error between predicted and observed values.
  • R-squared (R²): A measure of how well the independent variable explains the variation in the dependent variable. Higher values indicate better fit.
  • Mean Absolute Error (MAE): The average of all absolute errors, representing the average error magnitude.
  1. Hover over the Complexity tab to view resource consumption for the model:
    • Feature Space: Multiplication operations for feature calculation.
    • Classifier: Multiplication operations required for the classifier (blank for regression models).
    • Total: Overall multiplication operations.
  2. Use the complexity numbers to gauge resource requirements for the model on target processors (MCU/MPUs). For example:
    • If spectral-based features are important, and the decision significance plot highlights frequencies under 50 Hz, applying a filter for frequencies < 50 Hz during feature computation can reduce resource consumption.
    • If you preselected a target processor during project creation, the top-performing models will align with the available resources for that processor.
  3. After reviewing the statistics, click the highlighted option to deploy the model for further testing or export.