prof. Ing. RNDr. Martin Holeňa, CSc.

Suitability of Modern Neural Networks for Active and Transfer Learning in Surrogate-Assisted Black-Box Optimization

Autoři

Holeňa, M.; Koza, J.

Rok

2024

Publikováno

Proceedings of the 8th International Workshop and Tutorial on Interactive Adaptive Learning 2024. Aachen: CEUR Workshop Proceedings, 2024. p. 47-67. vol. 3770. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

Active learning plays a crucial role in black-box optimization, especially for objective functions that are expensive to evaluate. Continuous black-box optimization has adopted an approach called surrogate modelling, where the original black-box objective is approximated with a regression model. An active learning task in this context is to decide which points should be evaluated using the original objective to update the surrogate model. Apart from low-order polynomials, the first surrogate models were artificial neural networks of the kinds multilayer perceptron and radial basis function network. In the late 2000s, neural networks have been superseded by other kinds of surrogate models, primarily Gaussian processes. However, over the last 15 years, neural networks have seen significant and successful development, suggesting that they once again have the potential to serve as promising surrogate models. This paper reviews possible research directions concerning that potential, and recalls initial results from investigations in some of these directions. Finally, it contributes to those results by investigating the state-of-the-art black-box optimizer CMA-ES surrogate-assisted by two variants of random-activation-function neural network ensembles.

Textual embeddings with word-type-weighted word2vec

Autoři

Ladin, T.; Korel, L.; Holeňa, M.

Rok

2024

Publikováno

24th Conference Information Technologies - Applications and Theory. Aachen: CEUR Workshop Proceedings, 2024. p. 37-42. vol. 3792. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

The increasing use of artificial neural networks for knowledge processing often lacks precise knowledge representation. To address this issue, we propose using a word-type-weighted Word2Vec model to achieve more accurate representations of individual words within sentences. Our approach incorporates weighting vector embeddings of words based on parts-of-speech predictions generated by the spaCy library. Experimental results demonstrate that, compared to simple Word2Vec, our model enhances the accuracy of recognizing the semantics of a sentence, while maintaining significantly lower computational requirements than large language models and various variants of Transformer.

Improving Optimization with Gaussian Processes in the Covariance Matrix Adaptation Evolution Strategy

Autoři

Tumpach, J.; Koza, J.; Holeňa, M.

Rok

2023

Publikováno

Proceedings of the 23rd Conference Information Technologies – Applications and Theory (ITAT 2023). Aachen: CEUR Workshop Proceedings, 2023. p. 82-88. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

This paper explores the use of Gaussian processes (GPs) in the covariance matrix adaptation evolution strategy (CMA-ES) for black-box optimization. GPs are powerful probabilistic models that capture complex relationships, making them suitable for modeling uncertain objective functions. Integrating GPs into the CMA-ES improves exploration and adaptation in the search space, enhancing convergence speed and solution quality. The paper describes a novel implementation framework allowing to use GPs as surrogate models for the CMA-ES. That framework findings encourage further research to advance the application of GPs in black-box optimization.

Text-to-Ontology Mapping via Natural Language Processing with Application to Search for Relevant Ontologies in Catalysis

Autoři

Korel, L.; Yorsh, U.; Behr, A.S.; Holeňa, M.; Kockmann, N.

Rok

2023

Publikováno

COMPUTERS. 2023, 12(1), 14-1-14-25. ISSN 2073-431X.

Typ

Článek

DOI

10.3390/computers12010014

Pracoviště

Katedra aplikované matematiky

Anotace

The paper presents a machine-learning based approach to text-to-ontology mapping. We explore a possibility of matching texts to the relevant ontologies using a combination of artificial neural networks and classifiers. Ontologies are formal specifications of the shared conceptualizations of application domains. While describing the same domain, different ontologies might be created by different domain experts. To enhance the reasoning and data handling of concepts in scientific papers, finding the best fitting ontology regarding description of the concepts contained in a text corpus. The approach presented in this work attempts to solve this by selection of a representative text paragraph from a set of scientific papers, which are used as data set. Then, using a pre-trained and fine-tuned Transformer, the paragraph is embedded into a vector space. Finally, the embedded vector becomes classified with respect to its relevance regarding a selected target ontology. To construct representative embeddings, we experiment with different training pipelines for natural language processing models. Those embeddings in turn are later used in the task of matching text to ontology. Finally, the result is assessed by compressing and visualizing the latent space and exploring the mappings between text fragments from a database and the set of chosen ontologies. To confirm the differences in behavior of the proposed ontology mapper models, we test five statistical hypotheses about their relative performance on ontology classification. To categorize the output from the Transformer, different classifiers are considered. These classifiers are, in detail, the Support Vector Machine (SVM), k-Nearest Neighbor, Gaussian Process, Random Forest, and Multilayer Perceptron. Application of these classifiers in a domain of scientific texts concerning catalysis research and respective ontologies, the suitability of the classifiers is evaluated, where the best result was achieved by the SVM classifier.

Using Paraphrasers to Detect Duplicities in Ontologies

Autoři

Korel, L.; Behr, A.S.; Kockmann, N.; Holeňa, M.

Rok

2023

Publikováno

Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - (Volume 2). Madeira: SciTePress, 2023. p. 40-49. KEOD. vol. 2. ISSN 2184-3228. ISBN 978-989-758-671-2.

Typ

Stať ve sborníku

DOI

10.5220/0012164500003598

Pracoviště

Katedra aplikované matematiky

Anotace

This paper contains a machine-learning-based approach to detect duplicities in ontologies. Ontologies are formal specifications of shared conceptualizations of application domains. Merging and enhancing ontologies may cause the introduction of duplicities into them. The approach to duplicities proposed in this work presents a solution that does not need manual corrections by domain experts. Source texts consist of short textual descriptions from considered ontologies, which have been extracted and automatically paraphrased to receive pairs of sentences with the same or a very close meaning. The sentences in the received dataset have been embedded into Euclidean vector space. The classification task was to determine whether a given pair of sentence embeddings is semantically equivalent or different. The results have been tested using test sets generated by paraphrases as well as on a small real-world ontology. We also compared solutions by the most similar existing approach, based on GloVe and WordNet, with solutions by our approach. According to all considered metrics, our approach yielded better results than the compared approach. From the results of both experiments, the most suitable for the detection of duplicities in ontologies is the combination of BERT with support vector machines. Finally, we performed an ablation study to validate whether all paraphrasers used to create the training set for the classification were essential.

Neural-Network-Based Estimation of Normal Distributions in Black-Box Optimization

Autoři

Tumpach, J.; Koza, J.; Pitra, Z.; Holeňa, M.

Rok

2022

Publikováno

ESANN 2022 proceedings. Louvain la Neuve: Ciaco - i6doc.com, 2022. p. 187-192. ISBN 978-2-87587-084-1.

Typ

Stať ve sborníku

DOI

10.14428/esann/2022.ES2022-113

Pracoviště

Katedra aplikované matematiky

Anotace

The paper presents a novel application of artificial neural networks (ANNs) in the context of surrogate models for black-box optimization, i.e. optimization of objective functions that are accessed through empirical evaluation. For active learning of surrogate models, a very important role plays learning of multidimensional normal distributions, for which Gaussian processes (GPs) have been traditionally used. On the other hand, the research reported in this paper evaluated the applicability of two ANN-based methods to this end: combining GPs with ANNs and learning normal distributions with evidential ANNs. After methods sketch, the paper brings their comparison on a large collection of data from surrogate-assisted black-box optimization. It shows that combining GPs using linear covariance functions with ANNs yields lower errors than the investigated methods of evidential learning.

Using Artificial Neural Networks to Determine Ontologies Most Relevant to Scientific Texts

Autoři

Korel, L.; Behr, A.S.; Holeňa, M.; Kockmann, N.

Rok

2022

Publikováno

Proceedings of the 22nd Conference Information Technologies – Applications and Theory (ITAT 2022). CEUR-WS.org, 2022. p. 44-54. CEUR Workshop Proceedings. vol. 3226. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

This paper provides an insight into the possibility of how to find ontologies most relevant to scientific texts using artificial neural networks. The basic idea of the presented approach is to select a representative paragraph from a source text file, embed it to a vector space by a pre-trained fine-tuned transformer, and classify the embedded vector according to its relevance to a target ontology. We have considered different classifiers to categorize the output from the transformer, in particular random forest, support vector machine, multilayer perceptron, k-nearest neighbors, and Gaussian process classifiers. Their suitability has been evaluated in a use case with ontologies and scientific texts concerning catalysis research. From results we can say the worst results have random forest. The best results in this task brought support vector machine classifier.

Unsupervised Construction of Task-Specific Datasets for Object Re-identification

Autoři

Pulc, P.; Holeňa, M.

Rok

2021

Publikováno

ICCTA 2021 Conference Proceedings. New York: Association for Computing Machinery, 2021. p. 66-72. ISBN 978-1-4503-9052-1.

Typ

Stať ve sborníku vyzvaná či oceněná

DOI

10.1145/3477911.3477922

Pracoviště

Katedra aplikované matematiky

Anotace

In the last decade, we have seen a significant uprise of deep neural networks in image processing tasks and many other research areas. However, while various neural architectures have successfully solved numerous tasks, they constantly demand more and more processing time and training data. Moreover, the current trend of using existing pre-trained architectures just as backbones and attaching new processing branches on top not only increases this demand but diminishes the explainability of the whole model. Our research focuses on combinations of explainable building blocks for the image processing tasks, such as object tracking. We propose a combination of Mask R-CNN, state-of-the-art object detection and segmentation neural network, with our previously published method of sparse feature tracking. Such a combination allows us to track objects by connecting detected masks using the proposed sparse feature tracklets. However, this method cannot recover from complete object occlusions and has to be assisted by an object re-identification. To this end, this paper uses our feature tracking method for a slightly different task: an unsupervised extraction of object representations that we can directly use to fine-tune an object re-identification algorithm. As we have to use objects masks already in the object tracking, our approach utilises the additional information as an alpha channel of the object representations, which further increases the precision of the re-identification. An additional benefit is that our fine-tuning method can be employed even in a fully online scenario.

Video Scene Location Recognition with Neural Networks

Autoři

Korel, L.; Pulc, P.; Tumpach, J.; Holeňa, M.

Rok

2021

Publikováno

Proceedings of the 21st Conference Information Technologies – Applications and Theory (ITAT 2021). Aachen: CEUR Workshop Proceedings, 2021. p. 85-93. vol. 2962. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra softwarového inženýrství
Katedra aplikované matematiky

Anotace

This paper provides an insight into the possibility of scene recognition from a video sequence with a small set of repeated shooting locations (such as in television series) using artiﬁcial neural networks. The basic idea of the presented approach is to select a set of frames from each scene, transform them by a pre-trained single image preprocessing convolutional network, and classify the scene location with subsequent layers of the neural network. The considered networks have been tested and compared on a dataset obtained from The Big Bang Theory television series. We have investigated different neural network layers to combine individual frames, particularly AveragePooling, MaxPooling, Product, Flatten, LSTM, and Bidirectional LSTM layers. We have observed that only some of the approaches are suitable for the task at hand.

Active Learning for LSTM-autoencoder-based Anomaly Detection in Electrocardiogram Readings

Autoři

Šabata, T.; Holeňa, M.

Rok

2020

Publikováno

Proceedings of the Workshop on Interactive Adaptive Learning co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2020). Aachen: CEUR Workshop Proceedings, 2020. p. 72-77. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

Recently, the amount of generated time series data has been increasing rapidly in many areas such as healthcare, security, meteorology and others. However, it is very rare that those time series are annotated. For this reason, unsupervised machine learning techniques such as anomaly detection are often used with such data. There exist many unsupervised algorithms for anomaly detection ranging from simple statistical techniques such as moving average or ARIMA till complex deep learning algorithms such as LSTM-autoencoder. For a nice overview of the recent algorithms we refer to read. Difficulties with the unsupervised approach are: defining an anomaly score to correctly represent how anomalous is the time series, and setting a threshold for that score to distinguish between normal and anomaly data. Supervised anomaly detection, on the other hand, needs an expensive involvement of a human expert. An additional problem with supervised anomaly detection is usually the occurrence of very low ratio of anomalies, yielding highly imbalanced data. In this extended abstract, we propose an active learning extension for an anomaly detector based on a LSTM-autoencoder. It performs active learning using various classification algorithms and addresses data imbalance with oversampling and under-sampling techniques. We are currently testing it on the ECG5000 dataset from the UCR time series classification archive.

Classification Methods for Internet Applications

Autoři

Holeňa, M.; Pulc, P.; Kopp, M.

Rok

2020

Publikováno

Cham: Springer, 2020. Studies in Big Data. vol. 69. ISSN 2197-6503. ISBN 978-3-030-36961-3.

Typ

Kniha

DOI

10.1007/978-3-030-36962-0

Pracoviště

Katedra aplikované matematiky

Anotace

This book explores internet applications in which a crucial role is played by classification, such as spam filtering, recommender systems, malware detection, intrusion detection and sentiment analysis. It explains how such classification problems can be solved using various statistical and machine learning methods, including K nearest neighbours, Bayesian classifiers, the logit method, discriminant analysis, several kinds of artificial neural networks, support vector machines, classification trees and other kinds of rule-based methods, as well as random forests and other kinds of classifier ensembles. The book covers a wide range of available classification methods and their variants, not only those that have already been used in the considered kinds of applications, but also those that have the potential to be used in them in the future. The book is a valuable resource for post-graduate students and professionals alike.

Two Semi-supervised Approaches to Malware Detection with Neural Networks

Autoři

Koza, J.; Krčál, M.; Holeňa, M.

Rok

2020

Publikováno

Proceedings of the 20th Conference Information Technologies - Applications and Theory (ITAT 2020). Aachen: CEUR Workshop Proceedings, 2020. p. 176-185. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

Semi-supervised learning is characterized by using the additional information from the unlabeled data. In this paper, we compare two semi-supervised algorithms for deep neural networks on a large real-world malware dataset. Specifically, we evaluate the performance of a rather straightforward method called Pseudo-labeling, which uses unlabeled samples, classified with high confidence, as if they were the actual labels. The second approach is based on an idea to increase the consistency of the network’s prediction under altered circumstances. We implemented such an algorithm called Π-model, which compares outputs with different data augmentation and different dropout setting. As a baseline, we also provide results of the same deep network, trained in the fully supervised mode using only the labeled data. We analyze the prediction accuracy of the algorithms in relation to the size of the labeled part of the training dataset.

Comparing rule mining approaches for classification with reasoning

Autoři

Kopp, M.; Bajer, L.; Jílek, M.; Holeňa, M.

Rok

2018

Publikováno

Proceedings of the 18th Conference Information Technologies - Applications and Theory (ITAT 2018). Aachen: CEUR Workshop Proceedings, 2018. p. 52-58. vol. 2203. ISSN 1613-0073. ISBN 9781727267198.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

Classification serves an important role in domains such as network security or health care. Although these domains require understanding of the classifier’s decision, there are only a few classification methods trying to justify or explain their results. Classification rules and decision trees are generally considered comprehensible. Therefore, this study compares the classification performance and comprehensibility of a random forest classifier with classification rules extracted by Frequent Item Set Mining, Logical Item Set Mining and by the Explainer algorithm, which was previously proposed by the authors.

Hierarchical Motion Tracking Using Matching of Sparse Features

Autoři

Pulc, P.; Holeňa, M.

Rok

2018

Publikováno

Proceedings of the 14th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS). Los Alamitos: IEEE Computer Society, 2018. p. 449-456. ISBN 978-1-5386-9385-8.

Typ

Stať ve sborníku

DOI

10.1109/SITIS.2018.00075

Pracoviště

Katedra aplikované matematiky

Anotace

Fundamental approaches in motion tracking are based on registration of pixel patches from one frame to another. To ensure invariance to some changes in the image and improve the speed of discovering a match, a pyramidal approach is used to steer the process faster to optima. However, registration of the patches in high resolution is still computationally expensive. Because we require the algorithm to process Ultra HD video content in real time on commonly available hardware, especially on mid-tier graphics processing units, approaches using matching of pixel patches are not feasible. In this paper, we present and evaluate an approach inspired by motion tracking on an image pyramid. However, instead of comparing pixel patches one to another, we utilise binary image descriptors that are much shorter and inherently use a Hamming distance for their direct comparison. Evaluation of our implementation, which is available on GitHub, was carried out on the Multiple Object Tracking challenge dataset.

Motion Segmentation by Semi-Supervised Classification in Dynamic Scenery

Autoři

Pulc, P.; Keruľ-Kmec, O.; Šabata, T.; Holeňa, M.

Rok

2018

Publikováno

Proceedings of Poster Session of 3rd ECML/PKDD Workshop on Advanced Analytics and Learning on Temporal Data (AALTD 2018). Diblin: University College Dublin, 2018. p. 65-72.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

Automatic description of multimedia content heavily relies on the ability to discover a structure in such data. As our current focus is given to efficient multimedia indexing, we are mainly interested in discovery and segmentation of foreground objects from the background and their respective description or classification. Although many approaches based on Convolution Neural Networks have emerged lately, they are usually executed on all frames from the me- dia separately which is, to our belief, wasteful and poorly scalable. On the other hand, methods based on visual Simultaneous Localisation and Mapping (visual SLAM) utilise the temporal structure of the motion picture to extract at first a model of the environment and objects in the scene and later pass these models to methods for object description. In this paper, we will discuss the first two parts of the visual SLAM – motion tracking and segmentation. While many approaches impose strict restrictions in the segmentation phase to filter motion tracking outliers, we introduce restrictions to the motion tracking itself. Such approach enables us to use of-the-shelf semi-supervised classification methods in the motion segmentation phase without explicit outlier filtering.

Semi-supervised and Active Learning in Video Scene Classification from Statistical Features

Autoři

Šabata, T.; Pulc, P.; Holeňa, M.

Rok

2018

Publikováno

Proceedings of the Workshop on Interactive Adaptive Learning (IAL 2018) co-located with European Conference on Machine Learning (ECML 2018) and Principles and Practice of Knowledge Discovery in Databases (PKDD 2018). Aachen: CEUR Workshop Proceedings, 2018. p. 24-35. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

In multimedia classification, the background is usually con- sidered an unwanted part of input data and is often modeled only to be removed in later processing. Contrary to that, we believe that a back- ground model (i.e., the scene in which the picture or video shot is taken) should be included as an essential feature for both indexing and follow- up content processing. Information about image background, however, is not usually the main target in the labeling process and the number of annotated samples is very limited. Therefore, we propose to use a combination of semi-supervised and active learning to improve the performance of our scene classifier, specifically a combination of self-training with uncertainty sampling. As a result, we utilize a combination of statistical features extractor, a feed-forward neural network and support vector machine classifier, which consistently achieves higher accuracy on less diverse data. With the proposed ap- proach, we are currently able to achieve precision over 80% on a dataset trained on a single series of a popular TV show.

Semisupervised segmentation of UHD video

Autoři

Keruľ-Kmec, O.; Pulc, P.; Holeňa, M.

Rok

2018

Publikováno

Proceedings of the 18th Conference Information Technologies - Applications and Theory (ITAT 2018). Aachen: CEUR Workshop Proceedings, 2018. p. 100-107. vol. 2203. ISSN 1613-0073. ISBN 9781727267198.

Typ

Stať ve sborníku

Pracoviště

Katedra aplikované matematiky

Anotace

One of the key preprocessing tasks in informa- tion retrieveal from video is the segmentation of the scene, primarily its segmentation into foreground objects and the background. This is actually a classification task, but with the specific property that it is very time consuming and costly to obtain human-labelled training data for classifier training. That suggests to use semisupervised classifiers to this end. The presented work in progress reports the inves- tigation of semisupervised classification methods based on cluster regularization and on fuzzy c-means in connection with the foreground / background segmentation task. To classify as many video frames as possible using only a single human-based frame, the semisupervised classifica- tion is combined with a frequently used keypoint detec- tor based on a combination of a corner detection method with a visual descriptor method. The paper experimentally compares both methods, and for the first of them, also clas- sifiers with different delays between the human-labelled video frame and classifier training.

Sentiment analysis from utterances

Autoři

Kožusznik, J.; Pulc, P.; Holeňa, M.

Rok

2018

Publikováno

Proceedings of the 18th Conference Information Technologies - Applications and Theory (ITAT 2018). Aachen: CEUR Workshop Proceedings, 2018. p. 92-99. vol. 2203. ISSN 1613-0073. ISBN 9781727267198.

Typ

Stať ve sborníku

Pracoviště

Katedra softwarového inženýrství
Katedra aplikované matematiky

Anotace

The recognition of emotional states in speech is starting to play an increasingly important role. However, it is a complicated process, which heavily relies on the extraction and selection of utterance features related to the emotional state of the speaker. In the reported research, MPEG-7 low level audio descriptors[10] serve as features for the recognition of emotional categories. To this end, a methodology combining MPEG-7 with several important kinds of classifiers is elaborated.

Breaking CAPTCHAs with Convolutional Neural Networks

Autoři

Kopp, M.; Nikl, M.; Holeňa, M.

Rok

2017

Publikováno

ITAT 2017: Information Technologies – Applications and Theory. Aachen: CEUR Workshop Proceedings, 2017. p. 93-99. vol. 1885. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky
Katedra aplikované matematiky

Anotace

This paper studies reverse Turing tests to distinguish humans and computers, called CAPTCHA. Contrary to classical Turing tests, in this case the judge is not a human but a computer. The main purpose of such tests is securing user logins against the dictionary or brute force password guessing, avoiding automated usage of various services, preventing bots from spamming on forums and many others. Typical approaches to solving text-based CAPTCHA automatically are based on a scheme specific pipeline containing hand-designed pre-processing, denoising, segmentation, post processing and optical character recognition. Only the last part, optical character recognition, is usually based on some machine learning algorithm. We present an approach using neural networks and a simple clustering algorithm that consists of only two steps, character localisation and recognition. We tested our approach on 11 different schemes selected to present very diverse security features. We experimentally show that using convolutional neural networks is superior to multi-layered perceptrons.

K-best Viterbi Semi-supervized Active Learning in Sequence Labelling

Autoři

Šabata, T.; Borovička, T.; Holeňa, M.

Rok

2017

Publikováno

CEUR workshop proceedings. 2017, 2017 144-152. ISSN 1613-0073.

Typ

Článek

Pracoviště

Katedra teoretické informatiky

Anotace

In application domains where there exists a large amount of unlabelled data but obtaining labels is expensive, active learning is a useful way to select which data should be labelled. In addition to its traditional successful use in classification and regression tasks, active learning has been also applied to sequence labelling. According to the standard active learning approach, sequences for which the labelling would be the most informative should be labelled. However, labelling the entire sequence may be inefficient as for some its parts, the labels can be predicted using a model. Labelling such parts brings only a little new information. Therefore in this paper, we investigate a sequence labelling approach in which in the sequence selected for labelling, the labels of most tokens are predicted by a model and only tokens that the model can not predict with sufficient confidence are labelled. Those tokens are identified using the k-best Viterbi algorithm.

Towards Real-time Motion Estimation in High-Definition Video Based on Points of Interest

Autoři

Pulc, P.; Holeňa, M.

Rok

2017

Publikováno

Proceedings of the 2017 Federated Conference on Computer Science and Information Systems. Katowice: Polish Information Processing Society, 2017. p. 67-70. Annals of Computer Science and Information Systems. vol. 11. ISSN 2300-5963. ISBN 978-83-946253-7-5.

Typ

Stať ve sborníku

DOI

10.15439/2017F417

Pracoviště

Katedra teoretické informatiky

Anotace

Currently used motion estimation is usually based on a computation of optical flow from individual images or short sequences. As these methods do not require an extraction of the visual description in points of interest, correspondence can be deduced only by the position of such points. In this paper, we propose an alternative motion estimation method solely using a binary visual descriptor. By tuning the internal parameters, we achieve either a detection of longer time series or a higher number of shorter series in a shorter time. As our method uses the visual descriptors, their values can be directly used in more complex visual detection tasks.

Application of Meta-learning Principles in Multimedia Indexing

Autoři

Pulc, P.; Holeňa, M.

Rok

2016

Publikováno

DATESO 2016: Databases, Texts, Specifications, and Objects. Ostrava: Vysoká škola báňská - Technická univerzita Ostrava. Archiv VŠB-TUO, 2016. p. 1-11. ISBN 978-80-248-4031-4.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky
Katedra softwarového inženýrství

Anotace

Databases of video content traditionally rely on annotations and meta-data imported by a person, usually the uploader. This is supposedly due to a lack of an universal approach to the automated multimedia content annotation. As it may be hard or impossible to find a single classifier for all encountered combinations of different modalities or even a network of the classifiers, current interest of our research is to use meta-learning for multiple stages of the multimedia content classification. With this, we hope to handle correctly all modalities involved including their overlaps. Successively, the extracted classes will be used to build the index and later used for searching and discovery in the multimedia.

How to Mimic Humans, Guide for Computers

Autoři

Kopp, M.; Pištora, M.; Holeňa, M.

Rok

2016

Publikováno

ITAT 2016: Information Technologies - Applications and Theory: Conference on Theory and Practice of Information Technologies. Luxemburg: CreateSpace Independent Publishing Platform, 2016. p. 110-117. ISBN 978-1-5370-1674-0.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Anotace

This paper studies reverse Turing tests to tell humans and computers apart. Contrary to classical Turing tests, the judge is not a human but a computer. These tests are often called Completely Automated Public Turing tests to tell Computers and Humans Apart (CAPTCHA). The main purpose of such test is avoiding automated usage of various services, preventing bots from spamming on forums, securing user logins against dictionary or brute-force password guessing and many others. During years, a diversity of tests appeared. In this paper, we focused on the two most classical and widespread schemes, which are text-based and audio-based CAPTCHA, and on their use in the Czech internet environment. The goal of this paper is to point out flaws and weak spots of often used solutions and consequent security risks. To this end, we pipelined several relatively easy algorithms like flood fill algorithm and k-nearest neighbours, to overcome CAPTCHA challenges at several web pages, including state administration.

Image Processing in Collaborative Open Narrative Systems

Autoři

Pulc, P.; Rosenzveig, E.; Holeňa, M.

Rok

2016

Publikováno

ITAT 2016: Information Technologies - Applications and Theory: Conference on Theory and Practice of Information Technologies. Luxemburg: CreateSpace Independent Publishing Platform, 2016. p. 155-162. ISSN 1613-0073.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Anotace

Open narrative approach enables the creators of multimedia content to create multi-stranded, navigable narrative environments. The viewer is able to navigate such space depending on author’s predetermined constraints, or even browse the open narrative structure arbitrarily based on their interests. This philosophy is used with great advantage in the collaborative open narrative system NARRA. The platform creates a possibility for documentary makers, journalists, activists or other artists to link their own audiovisual material to clips of other authors and finally create a navigable space of individual multimedia pieces. To help authors focus on building the narratives themselves, a set of automated tools have been proposed. Most obvious ones, as speech-to-text, are already incorporated in the system. However other, more complicated authoring tools, primarily focused on creating metadata for the media objects, are yet to be developed. Most complex of them involve an object description in media (with unrestricted motion, action or other features) and detection of near-duplicates of video content, which is the focus of our current interest. In our approach, we are trying to use motion-based features and register them across the whole clip. Using GridCut algorithm to segment the image, we then try to select only parts of the motion picture, that are of our interest for further processing. For the selection of suitable description methods, we are developing a meta-learning approach. This will supposedly enable automatic annotation based not only on clip similarity per se, but rather on detected objects present in the shot.

Modeling and Clustering the Behavior of Animals Using Hidden Markov Models

Autoři

Šabata, T.; Borovička, T.; Holeňa, M.

Rok

2016

Publikováno

CEUR workshop proceedings. 2016, 2016(1649), 172-178. ISSN 1613-0073.

Typ

Článek

Pracoviště

Katedra teoretické informatiky

Anotace

The objectives of this article are to model behavior of individual animals and to cluster the resulting models in order to group animals with similar behavior patterns. Hidden Markov models are considered suitable for clustering purposes. Their clustering is well studied, however, only if the observable variables can be assumed to be Gaussian mixtures, which is not valid in our case. Therefore, we use the Kullback-Leibler divergence to cluster hidden Markov models with observable variables that have an arbitrary distribution. Hierarchical and spectral clustering is applied. To evaluate the modeling approach, an experiment was performed and an accuracy of 83.86% was reached in predicting behavioral sequences of individual animals. Results of clustering were evaluated by means of statistical descriptors of the animals and by a domain expert, both methods confirm that the results of clustering are meaningful.

Modeling and Clustering the Behavior of Animals Using Hidden Markov Models.

Autoři

Šabata, T.; Borovička, T.; Holeňa, M.

Rok

2016

Publikováno

Proceedings ITAT 2016: Information Technologies - Applications and Theory.. Luxemburg: CreateSpace Independent Publishing Platform, 2016. p. 172-178. ISBN 978-1-5370-1674-0.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Testing Gaussian Process Surrogates on CEC’2013 Multi-Modal Benchmark.

Autoři

Orekhov, N.; Bajer, L.; Holeňa, M.

Rok

2016

Publikováno

Proceedings ITAT 2016: Information Technologies - Applications and Theory.. Luxemburg: CreateSpace Independent Publishing Platform, 2016. p. 138-146. ISBN 978-1-5370-1674-0.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Evaluation of Association Rules Extracted during Anomaly Explanation.

Autoři

Kopp, M.; Holeňa, M.

Rok

2015

Publikováno

ITAT 2015 conference proceedings. Aachen: CEUR Workshop Proceedings, 2015. pp. 143-149. ISSN 1613-0073. ISBN 978-1-5151-2065-0.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Anotace

Autoři

Kopp, M.; Holeňa, M.

Rok

2013

Publikováno

ITAT 2013: Information Technologies—Applications and Theory Workshops, Posters, and Tutorials. Luxemburg: CreateSpace Independent Publishing Platform, 2013. pp. 92-99. ISBN 9781490952086.

Typ

Stať ve sborníku

Anotace

This paper describes the design and comparison of two rule based fuzzy classifiers, which are subsequently enhanced with respect to comprehensibility. The first approach is based on random forests. The second is a specific kind of a fuzzy decision tree which is built on information granules extracted from results of a fuzzy clustering algorithm. Membership functions of fuzzy sets are fitted to the outputs of these two tree based classifiers. This gives us logical rules, which have a better comprehensibility. These two approaches are compared to each other and with a support vector machine classifier as a representative of precise classifiers. The comparison is performed on data concerning network security.

Improving the Model Guided Sampling Optimization by Model Search and Slice Sampling

Autoři

Bajer, L.; Holeňa, M.; Charypar, V.

Rok

2013

Publikováno

ITAT 2013: Information Technologies - Applications and Theory Workshops, Posters, and Tutorials.. 2013, pp. 86-91. ISBN 978-1-4909-5208-6.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Using machine learning methods in a personalized reputation system.

Autoři

Pejla, J.; Holeňa, M.

Rok

2013

Publikováno

ITAT 2013: Information Technologies - Applications and Theory Workshops, Posters, and Tutorials.. 2013, pp. 104-110. ISBN 978-1-4909-5208-6.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Computing the correlation between catalyst composition and its performance in the catalysed process.

Autoři

Holeňa, M.; Steinfeldt, N.; Baerns, M.; Štefka, D.

Rok

2012

Publikováno

Computers and Chemical Engineering. 2012, 55-67. ISSN 0098-1354.

Typ

Článek

Pracoviště

Katedra teoretické informatiky

Conformal Sets in Neural Network Regression

Autoři

Demut, R.; Holeňa, M.

Rok

2012

Publikováno

Proceedings of Conference on Theory and Practice of information Technologies. Košice: Univerzita P. J. Šafárika, 2012. pp. 17-24. ISBN 978-80-971144-0-4.

Typ

Stať ve sborníku

Anotace

This paper is concerned with predictive regions in regression models, especially neural networks. We use the concept of conformal prediction (CP) to construct re- gions which satisfy given confidence level. Conformal pre- diction outputs regions, which are automatically valid, but their width and therefore usefulness depends on the used nonconformity measure. A nonconformity measure should tell us how different a given example is with respect to other examples. We define nonconformity measures based on some reliability estimates such as variance of a bagged model or local modeling of prediction error. We also present results of testing CP based on different nonconformity mea- sures showing their usefulness and comparing them to tra- ditional confidence intervals.

Conformal sets in neural network regression.

Autoři

Demut, R.; Holeňa, M.

Rok

2012

Publikováno

Proceedings of Conference on Theory and Practice of information Technologies. Košice: Univerzita P. J. Šafárika, 2012. p. 17-24. ISBN 978-80-971144-0-4.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Evolutionary optimization with active learning of surrogate models and fixed evaluation batch size

Autoři

Charypar, V.; Holeňa, M.

Rok

2012

Publikováno

Proceedings of Conference on Theory and Practice of information Technologies. Košice: Univerzita P. J. Šafárika, 2012. pp. 33-40. ISBN 978-80-971144-0-4.

Typ

Stať ve sborníku

Anotace

Evolutionary optimization is often applied to problems, where simulations or experiments used as the fit ness function are expensive to run. In such cases, surro gate models are used to reduce the number of fitness eval uations. Some of the problems also require a fixed size batch of solutions to be evaluated at a time. Traditional methods of selecting individuals for true evaluation to im prove the surrogate model either require individual points to be evaluated, or couple the batch size with the EA gener ation size. We propose a queue based method for individual selection based on active learning of a kriging model. Indi viduals are selected using the confidence intervals predicted by the model, added to a queue and evaluated once the queue length reaches the batch size. The method was tested on several standard benchmark problems. Results show that the proposed algorithm is able to achieve a solution using significantly less evaluations of the true fitness function. The effect of the batch size as well as other parameters is discussed.

Evolutionary optimization with active learning of surrogate models and fixed evaluation batch size.

Autoři

Charypar, V.; Holeňa, M.

Rok

2012

Publikováno

Proceedings of Conference on Theory and Practice of information Technologies. Košice: Univerzita P. J. Šafárika, 2012. p. 33-40. ISBN 978-80-971144-0-4.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Model-assisted evolutionary optimization with fixed evaluation batch size

Autoři

Charypar, V.; Holeňa, M.

Rok

2012

Publikováno

Doktorandské dny 2012. Praha: Česká technika - nakladatelství ČVUT, 2012. pp. 105-114. ISBN 978-80-01-05138-2.

Typ

Stať ve sborníku

Anotace

Some black-box optimization problems involve long-running simulations or expensive experiments as the goal function. To enable use of evolutionary algorithms, surrogate models are used to reduce the number of function evaluations. In adaptive model building strategies, some individuals are selected for true function evaluation in order to improve the model. When the experiment or simulation requires a fixed size batch of solutions to evaluate, traditional selection strategies either cannot be used or couple the batch size with the EA generation size. We propose a queue based method for model-assisted optimization using active learning of a kriging model, where individuals are selected based on the model predictor error estimate. The method was tested on standard benchmark problems and the effects of batch size was studied. Results indicate that the proposed method significantly reduces the number of true fitness evaluation compared to a traditional EA.

Assessing the Suitability of Surrogate Models in Evolutionary Optimization

Autoři

Demut, R.; Holeňa, M.

Rok

2011

Publikováno

Information Technologies - Applications and Theory. 2011, pp. 31-38. ISBN 978-80-89557-02-8.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Dynamic Classifier Aggregation Using Fuzzy t-conorm Integral

Autoři

Štefka, D.; Holeňa, M.

Rok

2011

Publikováno

Proceedings of the 7th International Conference on Signal Image Technology & Internet Based Systems. Los Alamitos: IEEE Computer Society. Los Alamitos: IEEE Computer Society, 2011. p. 126-133. ISBN 978-1-4673-0431-3.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Assessing the Usability of Predictions of Different Regression Models.

Autoři

Šťastný, J.; Holeňa, M.

Rok

2010

Publikováno

Informačné Technológie - Aplikácie a Teória. Seňa: PONT s.r.o., 2010, pp. 93-98. ISBN 978-80-970179-3-4.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Dynamic Classifier Aggregation using Fuzzy Integral with Interaction-Sensitive Fuzzy Measure

Autoři

Štefka, D.; Holeňa, M.

Rok

2010

Publikováno

Proceedings of the 2010 10th International Conference on Intelligent Systems Design and Applications. 2010. pp. 225-230. ISBN 978-1-4244-8135-4.

Typ

Stať ve sborníku

DOI

10.1109/ISDA.2010.5687260

Pracoviště

Katedra teoretické informatiky

Classifier Aggregation Using Local Classification Confidence.

Autoři

Štefka, D.; Holeňa, M.

Rok

2009

Publikováno

ICAART 2009. Setúbal: INSTICC Press, 2009, pp. 173-178. ISBN 978-989-8111-66-1.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Dynamic Classifier Systems and their Applications to Random Forest Ensembles

Autoři

Štefka, D.; Holeňa, M.

Rok

2009

Publikováno

Adaptive and Natural Computing Algorithms. Heidelberg: Springer, 2009, pp. 458-468. LNCS. ISSN 0302-9743. ISBN 978-3-642-04920-0.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Fuzzy Logic and Piecewise-Linear Regression.

Autoři

Fröhlich, J.; Holeňa, M.

Rok

2008

Publikováno

ITAT 2008 - Information Technologies - Applications and Theory. Košice: Univerzita P.J.Šafárika, 2008, pp. 35-38. ISBN 978-80-969184-8-5.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Classification of EEG Data using Fuzzy k-NN Ensembles

Autoři

Štefka, D.; Holeňa, M.

Rok

2007

Publikováno

ITAT 2007. Conference on Theory and Practice of Information Technologies. 2007, pp. 91-94. ISBN 978-80-969184-6-1.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

The Use of Fuzzy t-conorm Integral for Combining Classifiers.

Autoři

Štefka, D.; Holeňa, M.

Rok

2007

Publikováno

Symbolic and Quantitative Approaches to Reasoning with Uncertainty. Berlin: Springer, 2007, pp. 755-766. Lecture Notes in Computer Science. ISSN 0302-9743. ISBN 978-3-540-75255-4.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Use of Mamdani-Assilian Fuzzy Controller for Combining Classifiers

Autoři

Štefka, D.; Holeňa, M.

Rok

2007

Publikováno

Sborník semináře MIS 2007. Praha: Matematicko Fyzikální Fakulta, UK, 2007, pp. 88-97. ISBN 978-80-7378-033-3.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

Using Fuzzy k-NN Ensembles in EEG Data Classification

Autoři

Štefka, D.; Holeňa, M.

Rok

2007

Publikováno

Neuroinformatic Databases and Mining of Knowledge of them (Third book on Micro-sleeps). Praha: ČVUT v Praze, Fakulta dopravní, Ústav řidicí techniky a telematiky, 2007. ISBN 978-80-87136-01-0.

Typ

Kapitola v knize

Pracoviště

Katedra teoretické informatiky

Using Fuzzy k-NN Ensembles in EEG Data Classification.

Autoři

Štefka, D.; Holeňa, M.

Rok

2007

Publikováno

Neuroinformatic Databases and Mining of Knowledge of Them.. Prague: Czech Technical University, 2007. ISBN 978-80-87136-01-0.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

The Specificity of Neural Networks in Extracting Rules from Data

Autoři

Holeňa, M.

Rok

2006

Publikováno

Applied Artificial Intelligence. London: World Scientific, 2006, ISBN 981-256-690-2.

Typ

Stať ve sborníku

Pracoviště

Katedra teoretické informatiky

prof. Ing. RNDr. Martin Holeňa, CSc.

Publikace

Suitability of Modern Neural Networks for Active and Transfer Learning in Surrogate-Assisted Black-Box Optimization

Textual embeddings with word-type-weighted word2vec

Improving Optimization with Gaussian Processes in the Covariance Matrix Adaptation Evolution Strategy

Text-to-Ontology Mapping via Natural Language Processing with Application to Search for Relevant Ontologies in Catalysis

Using Paraphrasers to Detect Duplicities in Ontologies

Neural-Network-Based Estimation of Normal Distributions in Black-Box Optimization

Using Artificial Neural Networks to Determine Ontologies Most Relevant to Scientific Texts

Unsupervised Construction of Task-Specific Datasets for Object Re-identification

Video Scene Location Recognition with Neural Networks

Active Learning for LSTM-autoencoder-based Anomaly Detection in Electrocardiogram Readings

Classification Methods for Internet Applications

Two Semi-supervised Approaches to Malware Detection with Neural Networks

Comparing rule mining approaches for classification with reasoning

Hierarchical Motion Tracking Using Matching of Sparse Features

Motion Segmentation by Semi-Supervised Classification in Dynamic Scenery

Semi-supervised and Active Learning in Video Scene Classification from Statistical Features

Semisupervised segmentation of UHD video

Sentiment analysis from utterances

Breaking CAPTCHAs with Convolutional Neural Networks

K-best Viterbi Semi-supervized Active Learning in Sequence Labelling

Towards Real-time Motion Estimation in High-Definition Video Based on Points of Interest

Application of Meta-learning Principles in Multimedia Indexing

How to Mimic Humans, Guide for Computers

Image Processing in Collaborative Open Narrative Systems

Modeling and Clustering the Behavior of Animals Using Hidden Markov Models

Modeling and Clustering the Behavior of Animals Using Hidden Markov Models.

Testing Gaussian Process Surrogates on CEC’2013 Multi-Modal Benchmark.

Evaluation of Association Rules Extracted during Anomaly Explanation.

Investigation of Gaussian Processes in the Context of Black-Box Evolutionary Optimization

Search for Structure in Audiovisual Recordings of Lectures and Conferences

Case Study in Approaches to the Classification of Audiovisual Recordings of Lectures and Conferences

Interpreting and clustering outliers with sapling random forests

Interpreting and clustering outliers with sapling random forests

Design and comparison of two rule-based fuzzy classifiers for computer security.

Improving the Model Guided Sampling Optimization by Model Search and Slice Sampling

Using machine learning methods in a personalized reputation system.

Computing the correlation between catalyst composition and its performance in the catalysed process.

Conformal Sets in Neural Network Regression

Conformal sets in neural network regression.

Evolutionary optimization with active learning of surrogate models and fixed evaluation batch size

Evolutionary optimization with active learning of surrogate models and fixed evaluation batch size.

Model-assisted evolutionary optimization with fixed evaluation batch size

Assessing the Suitability of Surrogate Models in Evolutionary Optimization

Dynamic Classifier Aggregation Using Fuzzy t-conorm Integral

Assessing the Usability of Predictions of Different Regression Models.

Dynamic Classifier Aggregation using Fuzzy Integral with Interaction-Sensitive Fuzzy Measure

Classifier Aggregation Using Local Classification Confidence.

Dynamic Classifier Systems and their Applications to Random Forest Ensembles

Fuzzy Logic and Piecewise-Linear Regression.

Classification of EEG Data using Fuzzy k-NN Ensembles

The Use of Fuzzy t-conorm Integral for Combining Classifiers.

Use of Mamdani-Assilian Fuzzy Controller for Combining Classifiers

Using Fuzzy k-NN Ensembles in EEG Data Classification

Using Fuzzy k-NN Ensembles in EEG Data Classification.

The Specificity of Neural Networks in Extracting Rules from Data