Mgr. Alexander Kovalenko, Ph.D.

AI-Based Spatiotemporal Crop Monitoring by Cloud Removal in Satellite Images

Authors

Pihrt, J.; Šimánek, P.; Kovalenko, A.; Kvapil, J.; Charvat, K.

Year

2024

Published

Proceedings of the19th Conference on Computer Science and Intelligence Systems. Institute of Electrical and Electronics Engineers Inc., 2024. p. 485-492. Annals of Computer Science and Intelligence Systems. vol. 39. ISSN 2300-5963. ISBN 978-83-969601-6-0.

Type

Proceedings paper

DOI

10.15439/2024F5446

Departments

Department of Applied Mathematics

Annotation

Efficient crop monitoring and crop dynamics fore- casting leveraging diverse satellite and point data are described. Attention-based architecture architecture is adapted for mono- temporal cloud removal which overcomes an issue of crop monitoring. Combining optical (Sentinel-2) and radar (Sentinel- 1) satellite data improves the robustness and accuracy of the model in terms of satellite image reconstruction and normalized difference vegetation index prediction and forecasting. However, available soil-type geographical data and land surface analysis products, do not improve prediction accuracy significantly

Exploring sperm cell motion dynamics: Insights from genetic algorithm-based analysis

Authors

Klingner, A.; Kovalenko, A.; Magdanz, V.; Khalil, I.S.M.

Year

2024

Published

Computational and Structural Biotechnology Journal. 2024, 23 2837-2850. ISSN 2001-0370.

Type

Article

DOI

10.1016/j.csbj.2024.06.008

Departments

Department of Applied Mathematics

Annotation

Accurate analysis of sperm cell flagellar dynamics plays a crucial role in understanding sperm motility as flagella parameters determine cell behavior in the spatiotemporal domain. In this study, we introduce a novel approach by harnessing Genetic Algorithms (GA) to analyze sperm flagellar motion characteristics and compare the results with the traditional decomposition method based on Fourier analysis. Our analysis focuses on extracting key parameters of the equation approximating flagellar shape, including beating period time, bending amplitude, mean curvature, and wavelength. Additionally, we delve into the extraction of phase constants and initial swimming directions, vital for the comprehensive study of sperm cell pairs and bundling phenomena. One significant advantage of GA over Fourier analysis is its ability to integrate sperm cell motion data, enabling a more comprehensive analysis. In contrast, Fourier analysis neglects sperm cell motion by transitioning to a sperm-centered coordinate system (material system). In our comparative study, GA consistently outperform the Fourier analysis-based method, yielding a remarkable reduction in fitting error of up to 70% and on average by 45%. An in-depth exploration of the sperm cell motion becomes indispensable in a wide range of applications from complexities of reproductive biology and medicine, to developing soft flagellated microrobots.

Machine Learning Based Tool for Automated Sperm Cell Tracking and Sperm Bundle Detection

Authors

Hořenín, J.; Magdanz, V.; Khalil, I.S.M.; Klingner, A.; Kovalenko, A.; Čepek, M.

Year

2024

Published

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track. Cham: Springer, 2024. p. 19-32. Lecture Notes in Computer Science. vol. 14950. ISSN 2945-9133. ISBN 978-3-031-70380-5.

Type

Proceedings paper

DOI

10.1007/978-3-031-70381-2_2

Departments

Department of Applied Mathematics

Annotation

This study introduces a novel machine learning-based methodology for automated detection and tracking of sperm cells within microscopic video recordings, aiming to elucidate the dynamics and motion patterns of individual sperm cells as well as sperm cell bundles. At first, the method identifies sperm cells across successive frames within a video sequence, facilitating the reconstruction of each cell's trajectory over time. Subsequently, we introduce a classification algorithm that distinguishes between solitary sperm cells, clusters of adjacent cells, and cohesive sperm cell bundles, addressing a gap in existing methodologies. Finally, we employ three conventional metrics for velocity assessment: Straight Line Velocity (VSL) and Average Path Velocity (VAP) and Curvilinear velocity (VCL), to quantify the movement speed of both individual sperm cells and bundles. The approach represents a significant advancement in the automated analysis of sperm motility and aggregation phenomena, providing a robust tool for researchers to study sperm behavior with enhanced accuracy and efficiency. The integration of machine learning techniques in sperm cell detection and tracking offers promising insights into reproductive biology and fertility studies.

Overcoming Long Inference Time of Nearest Neighbors Analysis in Regression and Uncertainty Prediction

Authors

Koutenský, F.; Šimánek, P.; Čepek, M.; Kovalenko, A.

Year

2024

Published

SN Computer Science. 2024, 5(5), ISSN 2662-995X.

Type

Article

DOI

10.1007/s42979-024-02670-2

Departments

Department of Applied Mathematics

Annotation

The intuitive approach of comparing like with like, forms the basis of the so-called nearest neighbor analysis, which is central to many machine learning algorithms. Nearest neighbor analysis is easy to interpret, analyze, and reason about. It is widely used in advanced techniques such as uncertainty estimation in regression models, as well as the renowned k-nearest neighbor-based algorithms. Nevertheless, its high inference time complexity, which is dataset size dependent even in the case of its faster approximated version, restricts its applications and can considerably inflate the application cost. In this paper, we address the problem of high inference time complexity. By using gradient-boosted regression trees as a predictor of the labels obtained from nearest neighbor analysis, we demonstrate a significant increase in inference speed, improving by several orders of magnitude. We validate the effectiveness of our approach on a real-world European Car Pricing Dataset with approximately rows for both residual cost and price uncertainty prediction. Moreover, we assess our method’s performance on the most commonly used tabular benchmark datasets to demonstrate its scalability. The link is to github repository where the code is available: https://github.com/koutefra/uncertainty_experiments.

Unlocking Nature’s Design through Neural Cellular Automata

Authors

Koutenský, F.; Šimánek, P.; Kovalenko, A.

Year

2024

Published

ALIFE 2024: Proceedings of the 2024 Artificial Life Conference. Cambridge: The MIT Press, 2024.

Type

Proceedings paper

DOI

10.1162/isal_a_00831

Departments

Department of Applied Mathematics

Annotation

This study presents Dynamics Identification via Neural Cellular Automata (DINCA), an enhancement of Neural Cellular Automata (NCA) for modeling reaction-diffusion systems. The main advantage of DINCA is its ability to estimate the parameters of the reaction-diffusion equations that govern the examined system, using minimal data. We demonstrate the method’s application potential by showing its ability to model leopard pattern formation, by learning on only three images, while revealing the governing reaction-diffusion equations. This positions NCA-based methodologies as a viable tool for inferring partial differential equations.

Die level predictive modeling to reduce latent reliability defect escapes

Authors

Lenhard, P.; Kovalenko, A.; Lenhard, R.

Year

2023

Published

Microelectronics Reliability. 2023, 148 1-7. ISSN 0026-2714.

Type

Article

DOI

10.1016/j.microrel.2023.115139

Departments

Department of Applied Mathematics

Annotation

This paper presents a die-level screening method based on inline defect inspection that uses advanced predictive engines to generate die-level failure probabilities to filter dice with high-reliability risk if the predicted probability is higher than a selected limit. The method uses the relationship between ’killer’ and latent defects to identify potentially unreliable dice that have passed the Wafer Sort. A novel approach of saliency map clustering algorithms is applied to increase the level of granularity in latent defect detection beyond supervised defect classification.

Deep learning techniques for integrated circuit die performance prediction

Authors

Kovalenko, A.; Lenhard, P.; Lenhard, R.

Year

2022

Published

MRS Advances. 2022, 7(30), 683-688. ISSN 2059-8521.

Type

Article

DOI

10.1557/s43580-022-00308-0

Departments

Department of Applied Mathematics

Annotation

Predicting integrated circuit (IC) functionality based on process control monitoring (PCM) parameters without individual die testing is a major challenge for manufacturers due to the high cost of electrical die measurement on the wafer. Complex dependencies between individual PCM parameters can be used to explain certain patterns of dice failure using Deep learning (DL) algorithms. However, random failure patterns due to process defects cannot be detected by this method. Combining PCM and in-process defect inspection data can be an ultimate prediction technique. In some cases, however, the availability of defect inspection data is much lower than the availability of PCM data, so direct ensemble training is rather ambiguous. This paper shows how to efficiently utilize both defect and PCM data to train a model to predict IC functionality. Such a hybrid model outperforms PCM-only solutions, and in contrast to a defect-only model predicts also failure areas across the wafer.

Integrated Circuit Die Level Yield Prediction Using Deep Learning

Authors

Lenhard, P.; Kovalenko, A.; Lenhard, R.

Year

2022

Published

2022 33rd Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC). Piscataway (New Jersey): IEEE, 2022. ISSN 1078-8743. ISBN 978-1-6654-9487-8.

Type

Proceedings paper

DOI

10.1109/ASMC54647.2022.9792526

Departments

Department of Applied Mathematics

Annotation

Given the integrated circuits (IC) production scale, the amount of process control monitoring (PCM) data enable to develop an efficient algorithm for IC yield prediction at the die-level. Therefore, in addition to cost-effective and timeefficient yield evaluation, the proposed model is able to identify failed dice and low-yield areas on a wafer without any direct electrical die testing. Additionally, for non-parametric random dice failure detection that are untraceable by PCM input based models, an ensemble learning including both PCM and die defect inspection data are described. As Wafer Sort (WS) consumes a lot of time and resources with high associated cost a significant cost reduction can be achieved using smart product routing with selective WS by employing the aforementioned die level predictive model.

Linear Self-attention Approximation via Trainable Feedforward Kernel

Authors

Yorsh, U.; Kovalenko, A.

Year

2022

Published

Artificial Neural Networks and Machine Learning – ICANN 2022. Springer, Cham, 2022. p. 807-810. LNCS. vol. 13531. ISSN 0302-9743. ISBN 978-3-031-15933-6.

Type

Proceedings paper

DOI

10.1007/978-3-031-15934-3_67

Departments

Department of Applied Mathematics
Programming Research Lab

Annotation

Restrictive limitation of Transformers due to the quadratic complexity of self-attention mechanism motivated a new research field of efficient Transformers, which approximate the original architecture with asymptotically faster models.

CodeDJ: Reproducible queries over large-scale software repositories

Authors

Máj, P.; Siek, K.; Kovalenko, A.; Vitek, J.

Year

2021

Published

Leibniz International Proceedings in Informatics (LIPIcs). Saarbrücken: Dagstuhl Publishing,, 2021. p. 1-24. ISSN 1868-8969. ISBN 978-3-95977-190-0.

Type

Proceedings paper

DOI

10.4230/LIPIcs.ECOOP.2021.6

Departments

Department of Theoretical Computer Science
Department of Applied Mathematics
Programming Research Lab

Annotation

Analyzing massive code bases is a staple of modern software engineering research – a welcome side-effect of the advent of large-scale software repositories such as GitHub. Selecting which projects one should analyze is a labor-intensive process, and a process that can lead to biased results if the selection is not representative of the population of interest. One issue faced by researchers is that the interface exposed by software repositories only allows the most basic of queries. CodeDJ is an infrastructure for querying repositories composed of a persistent datastore, constantly updated with data acquired from GitHub, and an in-memory database with a Rust query interface. CodeDJ supports reproducibility, historical queries are answered deterministically using past states of the datastore; thus researchers can reproduce published results. To illustrate the benefits of CodeDJ, we identify biases in the data of a published study and, by repeating the analysis with new data, we demonstrate that the study’s conclusions were sensitive to the choice of projects.

Dynamic Neural Diversification: Path to Computationally Sustainable Neural Networks

Authors

Kovalenko, A.; Kordík, P.; Friedjungová, M.

Year

2021

Published

Artificial Neural Networks and Machine Learning – ICANN 2021. Cham: Springer, 2021. p. 235-247. 1. vol. 12892. ISSN 1611-3349. ISBN 978-3-030-86339-5.

Type

Proceedings paper

DOI

10.1007/978-3-030-86340-1_19

Departments

Department of Applied Mathematics

Annotation

Small neural networks with a constrained number of trainable parameters, can be suitable resource-efficient candidates for many simple tasks, where now excessively large models are used. However, such models face several problems during the learning process, mainly due to the redundancy of the individual neurons, which results in sub-optimal accuracy or the need for additional training steps. Here, we explore the diversity of the neurons within the hidden layer during the learning process, and analyze how the diversity of the neurons affects predictions of the model. As following, we introduce several techniques to dynamically reinforce diversity between neurons during the training. These decorrelation techniques improve learning at early stages and occasionally help to overcome local minima faster. Additionally, we describe novel weight initialization method to obtain decorrelated, yet stochastic weight initialization for a fast and efficient neural network training. Decorrelated weight initialization in our case shows about 40% relative increase in test accuracy during the first 5 epochs.

Mgr. Alexander Kovalenko, Ph.D.

Publications

AI-Based Spatiotemporal Crop Monitoring by Cloud Removal in Satellite Images

Exploring sperm cell motion dynamics: Insights from genetic algorithm-based analysis

Machine Learning Based Tool for Automated Sperm Cell Tracking and Sperm Bundle Detection

Overcoming Long Inference Time of Nearest Neighbors Analysis in Regression and Uncertainty Prediction

Unlocking Nature’s Design through Neural Cellular Automata

Die level predictive modeling to reduce latent reliability defect escapes

Deep learning techniques for integrated circuit die performance prediction

Integrated Circuit Die Level Yield Prediction Using Deep Learning

Linear Self-attention Approximation via Trainable Feedforward Kernel

CodeDJ: Reproducible queries over large-scale software repositories

Dynamic Neural Diversification: Path to Computationally Sustainable Neural Networks