Development of pipeline feature engineering for building an AutoML service

Parfenov, D.; Bolodurina, I.; Grishina, L.; Zhigalov, A.; Legashev, L.

doi:10.1088/1742-6596/2388/1/012053

Development of pipeline feature engineering for building an AutoML service

Author(s):	D. Parfenov I. Bolodurina L. Grishina A. Zhigalov L. Legashev
Medium:	journal article
Language(s):	English
Published in:	Journal of Physics: Conference Series, 1 December 2022, n. 1, v. 2388
Page(s):	012053
DOI:	10.1088/1742-6596/2388/1/012053
Abstract:	The large–scale implementation of artificial intelligence approaches in applied fields has a number of limitations, one of which is the availability of research competencies, knowledge of data analysis methods, mathematical statistics and machine learning. Automatic machine learning is designed to simplify the methodology of ML application development. Within the framework of this study, a new approach to the construction of pipeline feature engineering for AutoML service is presented, based on the sequential expansion of the feature space and the use of autoencoders to reduce the dimension of input features and reconstruct the final output features. The results of the presented approach are shown by the example of VANET network traffic data when solving the problem of classifying attacks on nodes. The data set was obtained as a result of simulating the real traffic of a certain segment of the VANET network in the OMNET++ environment and subsequent aggregation of data on network flows by means of CICFlowmeter-V4.0. Experiments have shown that machine learning models on the source data have an accuracy of 2% lower on average, which indicates the effectiveness of using the proposed Feature Engineering approach. The highest classification accuracy was demonstrated by Pipeline using the Multi–layered Model autoencoder and the XGBoost classification model – 91.2%. Thus, the presented Feature Engineering approach can be used to build the most effective feature space and improve the quality of machine learning models.

Structurae cannot make the full text of this publication available at this time. The full text can be accessed through the publisher via the DOI: 10.1088/1742-6596/2388/1/012053.

About this
data sheet
Reference-ID
10777547
Published on:
12/05/2024
Last updated on:
12/05/2024

Advertisement

Structurae cooperates with

International Association for Bridge and Structural Engineering (IABSE)

Advertisement

Development of pipeline feature engineering for building an AutoML service

You have exceeded your monthly download limit!

Structurae Plus subscribers can download 30 media files or data sets per month.
Structurae Pro users are limited to 50.

Required Data

Development of pipeline feature engineering for building an AutoML service

You have exceeded your monthly download limit! Structurae Plus subscribers can download 30 media files or data sets per month. Structurae Pro users are limited to 50.

Required Data

You have exceeded your monthly download limit!

Structurae Plus subscribers can download 30 media files or data sets per month.
Structurae Pro users are limited to 50.