An Image-Based Approach to Visual Feature Space Analysis
2008, Schreck, Tobias, Schneidewind, Jörn, Keim, Daniel A.
Methods for management and analysis of non-standard data often rely on the so-called feature vector approach. The technique describes complex data instances by vectors of characteristic numeric values which allow to index the data and to calculate similarity scores between the data elements. Thereby, feature vectors often are a key ingredient to intelligent data analysis algorithms including instances of clustering, classification, and similarity search algorithms. However, identification of appropriate feature vectors for a given database of a given data type is a challenging task. Determining good feature vector extractors usually involves benchmarks relying on supervised information, which makes it an expensive and data dependent process. In this paper, we address the feature selection problem by a novel approach based on analysis of certain feature space images. We develop two image-based analysis techniques for the automatic discrimination power analysis of feature spaces. We evaluate the techniques on a comprehensive feature selection benchmark, demonstrating the effectiveness of our analysis and its potential toward automatically addressing the feature selection problem.
Semiautomatic benchmarking of feature vectors for multimedia retrieval
2007, Schreck, Tobias, Schneidewind, Jörn, Keim, Daniel A., Ward, Matthew O., Tatu, Andrada
Modern Digital Library applications store and process massive amounts of information. Usually, this data is not limited to raw textual or numeric data - typical applications also deal with multimedia data such as images, audio, video, or 3D geometric models. For providing effective retrieval functionality, appropriate meta data descriptors that allow calculation of similarity scores between data instances are requires. Feature vectors are a generic way for describing multimedia data by vectors formed from numerically captured object features. They are used in similarity search, but also, can be used for clustering and wider multimedia analysis applications. Extracting effective feature vectors for a given data type is a challenging task. Determining good feature vector extractors usually involves experimentation and application of supervised information. However, such experimentation usually is expensive, and supervised information often is data dependent. We address the feature selection problem by a novel approach based on analysis of certain feature space images. We develop two image-based analysis techniques for the automatic discrimination power analysis of feature spaces. We evaluate the techniques on a comprehensive feature selection benchmark, demonstrating the effectiveness of our analysis and its potential toward automatically addressing the feature selection problem.
Trajectory-based visual analysis of large financial time series data
2007, Schreck, Tobias, Tekusova, Tatiana, Kohlhammer, Jörn, Fellner, Dieter
Visual Analytics seeks to combine automatic data analysis with visualization and human-computer interaction facilities to solve analysis problems in applications characterized by occurrence of large amounts of complex data. The financial data analysis domain is a promising field for research and application of Visual Analytics technology, as it prototypically involves the analysis of large data volumes in solving complex analysis tasks.
We introduce a Visual Analytics system for supporting the analysis of large amounts of financial time-varying indicator data. A system, driven by the idea of extending standard technical chart analysis from one to two-dimensional indicator space, is developed. The system relies on an unsupervised clustering algorithm combined with an appropriately designed movement data visualization technique. Several analytical views on the full market and specific assets are offered for the user to navigate, to explore, and to analyze. The system includes automatic screening of the potentially large visualization space, preselecting possibly interesting candidate data views for presentation to the user. The system is applied to a large data set of time varying 2-D stock market data, demonstrating its effectiveness for visual analysis of financial data. We expect the proposed techniques to be beneficial in other application areas as well.