doc. Ing. Jan Janoušek, Ph.D.

Head of the Department of Theoretical Computer Science

Projects

Algorithms for Processing Tree Data Structures and Data Compression

Program
Studentská grantová soutěž ČVUT
Code
SGS15/116/OHK3/1T/18
Period
2015
Description
With the vast amount of data needed to be archived, indexed and procesed, special data structures are required. The tree is a typical data structure which is used very often for hierarchically storing data. Specialized algorithms are needed for indexing tree data structures and also accessing, extracting and analyzing data stored in them. Another goal of this project is a design and implementation of novel methods of data compression and indexing in two areas: DNA sequence compression using dictionary methods and approximate pattern matching in genomes.

Algorithms for Processing Tree Structures and Data Compression

Program
Studentská grantová soutěž ČVUT
Code
SGS14/101/OHK3/1T/18
Period
2014
Description
With the vast amount of data needed to be archived, indexed and procesed, special data structures are required. The tree is a typical data structure which is used very often for hierarchically storing data. Specialized algorithms are needed for indexing tree data structures and also accessing, extracting and analyzing data stored in them. The aim of this research is to design efficient yet simple to understand algorithms dealing with tree pattern matching (both exact and approximate) and tree indexing, and provide a toolkit implementation. Another goal of this project is a design and implementation of novel methods of data compression in two areas: DNA sequence compression and difference compression of files.

Alogirthms for Processing Tree Data Structures, Implementing Programming Languges and Data Compression

Program
Studentská grantová soutěž ČVUT
Code
SGS17/209/OHK3/3T/18
Period
2017 - 2019
Description
With the vast amount of data needed to be archived, indexed and procesed, efficient data structures and algorithms are required. The tree is a typical data structure which is used very often for hierarchically storing data. Another goal of this project is to design and implement novel methods for data indexing combined with data compression and methods for various approximate pattern matching over the indexes. The indexes and pattern matching find applications in searching in DNA and RNA sequences. Another topic of our research is the area of algorithms for implementing dynamic programming languages.

Governance support tools for dynamic aspects of Big Data environments

Program
Programme of applied research and experimental development EPSILON
Provider
Technology Agency of the Czech Republic
Code
TH02010287
Period
2017 - 2019
Description
The key trend in management has been to attempt to make all decisions data driven. A critical element in this effort has been the data warehouse or, more broadly, business intelligence (BI). Fast changes are forcing companies to develop faster, and fast development requires effective BI management, so-called data governance (DG). Without solid DG processes and rapid development in place, companies are not able to use their own data successfully. The ongoing big data revolution brings more issues to an already complex field and shows that current DG technologies are not ready for the future. The goal of this project is to create tools which will ensure effective DG in big data environments and thereby protect the billions of dollars invested into BI development worldwide.

Processing Tree Data Structures and Data Compression

Program
Studentská grantová soutěž ČVUT
Code
SGS16/118/OHK3/1T/18
Period
2016
Description
With the vast amount of data needed to be archived, indexed and procesed, efficient data structures and algorithms are required. The tree is a typical data structure which is used very often for hierarchically storing data. Another goal of this project is to design and implement novel methods for data indexing combined with data compression and methods for various approximate pattern matching over the indexes. The indexes and pattern matching find applications in searching in DNA and RNA sequences.

Processing Tree Structures and Data Compression

Program
Studentská grantová soutěž ČVUT
Code
SGS13/097/OHK3/1T/18
Period
2013
Description
With the vast amount of data needed to be archived, indexed and procesed, special data structures are required. The tree is a typical data structure which is used very often for hierarchically storing data. Specialized algorithms are needed for indexing tree data structures and also accessing, extracting and analyzing data stored in them. The aim of this research is to design efficient yet simple to understand algorithms dealing with tree pattern matching (both exact and approximate) and tree indexing, and provide a toolkit implementation. Another goal of this project is design and implementation of novel methods of data compression in two areas: first, music score compression; second, natural language compression.

Searching in trees

Program
Studentská grantová soutěž ČVUT
Code
SGS10/225/OHK3/2T/18
Period
2010 - 2011
Description
With the vast amount of data needed to be archived, indexed and procesed, special data structures are required. The tree is a typical data structure which is used very often for hierarchically storing data. Specialized algorithms are needed for indexing tree data structures and also accessing, extracting and analyzing data stored in them. Examples of applications can be the optimization of abstract syntax trees in the process of compiling, term-rewriting, mechanical theorem proving, searching in phylogenic trees, indexing the secondary structure of RNA, searching in XML documents and evaluating the source code of functional programming languages. For this purpose we have introduced a new research discipline called Arbology, a generalization of Stringology, that deals with tree structures. We use the pushdown automaton as our computational model since the linear notions of trees, being generated by context-free grammars, are in fact context-free languages. The aim of this research is to

Tools for Automatizing the Quality Assurance in Large Business Intelligence Systems and Data Warehouses

Program
Programme of applied research and experimental development ALFA
Provider
Technology Agency of the Czech Republic
Code
TA03010964
Period
2013 - 2016
Description
Environments of BI Systems and DWHs consist of thousands of program artefacts. An automatized management of a large amount of the artefacts requires highly powerful tools, which have been lacking in the market. Using recent theoretical results and long-term practical experiences with a maintenance of environments based on SQL and XML standards, we will create tools that will represent a revolutionary change in the area of the current tools of Quallity Assurance, wherein large financial means have been invested.

Tree pattern matching and indexing trees

Program
Studentská grantová soutěž ČVUT
Code
SGS12/092/OHK3/1T/18
Period
2012
Description
With the vast amount of data needed to be archived, indexed and procesed, special data structures are required. The tree is a typical data structure which is used very often for hierarchically storing data. Specialized algorithms are needed for indexing tree data structures and also accessing, extracting and analyzing data stored in them. Examples of applications can be the optimization of abstract syntax trees in the process of compiling, term-rewriting, mechanical theorem proving, searching in phylogenic trees, indexing the secondary structure of RNA, searching in XML documents and evaluating the source code of functional programming languages. For this purpose we have introduced a new research discipline called Arbology, a generalization of Stringology, that deals with tree structures. We use the pushdown automaton as our computational model since the linear notions of trees, being generated by context-free grammars, are in fact context-free languages. The aim of this research is to

User interface generation through a code-inspection driven development

Program
Studentská grantová soutěž ČVUT
Code
SGS12/147/OHK3/2T/13
Period
2012 - 2013
Description
User interface part of software application development is considered time consuming (according to recent research up 50% of the total time is devoted to user interface). Code fragment of the interface part are often complex and hard to read. This is because the interface combines multiple cross-cutting concerns such as security, presentation, layout, validation or contextual help. Besides the presentation aspect itself are here captured information, that already exist elsewhere in the application. This makes the development and maintenance hard. The goal of our project is to minimize manual work that is related to user interface development. We suggest that machine driven code inspection is applied to transform the information to user interface. This allows us to separate cross-cutting interface concerns and manage these individually. All concerns can be combined and weaved together in the transformation process. Besides optimization of the interface code the interface can be generat