Research Projects (2010)
- Show Keywords
- 3D Computer Vision 3D reconstruction Aerial Vision Augmented Reality Augmented Video Best Paper Award Biometrics Caleydo Computational Photography Computer Graphics Computer Vision Convex Optimization Coordinate transformations detection face Fingerprint Georeferencing GPU GUI HOG Human Computer Interaction Image Labelling Industrial Applications Information Visualization integral imaging Interaction Interaction Design Machine Learning Medical computer vision Medical Visualization Mixed Reality Mobile computing Mobile phone Model Multi-Display Environments Multiple Perspectives Object detection Object recognition Object reconstruction Object Tracking On-Line Learning Robotics Segmentation Shape analysis shape from focus SLAM Software Projects Structure from Motion Surveillance SVM Symmetry Tracking Fusion Tracking, Action Recognition User Interfaces Variational Methods Virtual reality and augmented reality Visual Tracking Visualization
| Title |
Abstract
|
Start | End |
|---|---|---|---|
|
AR4DOC - Augmented Reality for Document Inspection
(details) |
Smartphones have evolved considerably in processing power over the last years. They now feature multi-core CPUs as well as GPUs and consumer-quality cameras up to HD resolution. This makes them an interesting platform for graphics and vision and opens new opportunities for research. The aim of AR4DOC is to facilitate the task of document inspection by a human operator. This requires the person to have detailed knowledge about the nature of a document, which may be outdated or even unavailable at the time of inspection. We seek to provide this information in an interactive way using Mobile Augmented Reality (AR), so that a well-grounded decision on the vailidity of a document is possible. This involves several tasks such as document localization, recognition, tracking, presentation as well as interaction.
|
2010 | 2013 |
|
HD-VIP: High Definition Video Processing
(details) |
The growth of information is nowadays enormous and at a level which had never been reached before. We currently produce almost more data in one year than was produced in the entire history of mankind so far. In particular the trend to a full digitization of audiovisual content is contributing to this explosion of available material. The exponential growth of online video, most notably YouTube among the many prominent video portals is just one example for that. Even if international studies are not arriving at exactly the same results, the figures are impressive: digital production in 2006 was approximately 160 Exabyte, and is predicted to rise to 990 Exabyte in 2010. Any video processing /editing software has to keep pace with these extraordinary data rates which requires special efforts from the hardware and the software. Fortunately we see also an extraordinary increase in processing power, especially when looking at recent developments of graphics cards (GPUs). These cards offer massive parallelism (ideally suited for video processing) at a rather modest price. All these facts make this hardware an ideal candidate for video processing. But in order to make full use of the hardware the algorithms have to be highly parallel. Typical tasks encountered in video processing (which will also be tackled by the proposed project are): Superresolution: With the advent of HDTVs in many homes there is an increasing need to produce also HDTV content. In order to make use of existing (low-resolution) material one can use so called superresolution algorithms. These methods generate from a sequence of low resolution frames a high resolution image by exploiting the high interframe redundancy. Denoising: There are many sources of noise in a video, either the material is historic or during production/compression etc. noise is added to the video. A basic task is to remove the noise but still preserve all fine scale details. Interactive video editing: For post production purposes one wants to mark objects in a video (of course the object should only be marked in a single frame and then segmented automatically in all subsequent frames) and either remove them (which requires inpainting methods to fill the holes with meaningful content), place them somewhere else in the video or replace them with different objects. Since these tasks are done interactively this requires interactive framerates. Fortunately all of these tasks can be addressed by so called variational methods. The basic idea is to formulate the task as a minimization problem of a suitable energy functional. Besides other desirable properties these methods can be implemented in a highly parallel fashion which makes them ideal candidates for implementation on modern GPUs. |
2010 | 2012 |
|
Higher Order Variational Methods
(details) |
This research project is devoted to the study of higher order convex variational methods for problems in computer vision. First order methods, i.e. methods which take into account first order derivatives have shown a great success for a variety of inverse computer vision problems. This success is mostly due to the introduction of total variation methods by Rudin, Osher and Fatemi in 1992. Total variation methods exhibit the important property to preserve sharp discontinuities in the solution while the associated optimization problem is still convex. This leads to robust problem solutions, independent of any initialization. Besides this, total variation methods also exhibit some disadvantages. First, total variation methods favor piecewise constant solutions which leads to staircaising artifacts in image restoration problems and to the preference of fronto‐parallel structures in stereo problems. Second, total variation methods introduce a shrinking bias in shape optimization problems. The aim of this project is therefore to study higher order convex variational methods in order to improve the shortcomings of first order methods. We therefore propose to investigate two approaches. The first approach is based on the so‐called generalized total variation method, recently introduced by Bredies, Kunisch and Pock. It provides a framework to recover piecewise polynomial functions based on a convex functional. We expect that this method leads to significant improvements of stereo and motion estimation problems. The second approach is based on the so‐called roto‐translation space introduced by Citti and Sarti in 2006. It allows to rewrite functionals incorporating curvature regularity by means of a convex first order functional in higher dimensions. We expect that this approach will significantly improve the performance of various shape optimization problems. |
2010 | 2013 |
|
Highly accurate range computation in driver assistence systems
(details) |
In this project we study variational methods for computing highly accurate range data in driver assistance systems. |
2010 | 2011 |
|
Mobi-Trick
(details) |
The focus of the project is outdoor mobile computer vision with all of its challenges. Mobile systems need to be compact and energy efficient and are frequently changing locations. Therefore they must be autonomous and perform processing locally. A number of challenges arise from these requirements for which the project aims to provide solutions: Being compact, there is not much space for a large number of sensors such as laser scanners, radar antennas and the like. The work in this project will focus on stereo vision but with two different types of cameras. Often a second camera is already available and stereo information increases detection accuracies. Each time the system moves it needs to adapt to the changing situation. This requires adaptive calibration and online learning. Mobile systems often work from batteries. In addition, there is not much space to include intricate cooling systems. Thus, the system must be designed to be very energy efficient. New approaches for dynamic power management will be explored in the project. To put the work into context, several applications from the area of traffic surveillance/toll enforcement will be implemented and tested in an application oriented setting. Current traffic enforcement solutions are either very large and costly (section control, toll enforcement) or do not offer much in terms of image processing (radar speed control). The technological output of Mobi Trick makes it possible to design mobile solutions for traffic monitoring, vehicle identification and classification, intelligent incident detection and observation of driver behavior. Mobile devices are also more efficient in enforcement. Their transient nature makes them less predictable. Mobile systems can also react more flexibly to changing road situations such as construction sites. |
2010 | 2013 |
|
PEGASUS: Autonomous Inspection of Overhead Power Lines using an Unmanned Aerial Vehicle
(details) |
The aim of the PEGASUS project is to develop a mobile vision system for overhead power line inspection to be mounted on an unmanned aerial vehicle (UAV). The long term goal is to develop a fully autonomous aerial vehicle which is able to perform power line inspection in an automated manner. This goal requires innovative solutions to a number of problems such as visual navigation, visual tracking and obstacle detection, model-based inspection under harsh conditions etc. In addition, due to the use of a small scale UAV (e.g. a quad-rotor helicopter) we have restricted computational resources for algorithms that need to be executed on the UAV (especially for navigation and tracking). Within PEGASUS we want to make significant progress towards this long term goal. In particular, PEGASUS will provide a set of tools for the inspector. The project is organized in four phases: First, an inspection system for a single power tower is developed. Used in ground-based inspection, the UAV provides close-up views of all points of interest from an optimal viewpoint. Second, we want to implement an automatic visual inspection system which highlights possible faulty components. In a third step, the system is extended towards multiple towers (still in the sight of the operator). Finally, the system will be used as a handheld system in manned helicopters by power line inspectors, where it should dramatically reduce the time needed for inspection. From a research perspective we will develop novel solutions for model-based recognition and pose estimation, visual navigation including obstacle avoidance and automated model-based visual inspection. All of these problems are extremely challenging because of the uncontrolled conditions (illumination etc.) and the real-time requirements. If successful, the methods developed in PEGASUS will be usable beyond the task of power line inspection. |
2010 | 2013 |
|
Smart Reality - Innovation Network for Smart Applications and Media
(details) |
The market for mobile media services will expand significantly in the next years. The explosion in the usage of smartphones and the growth of the application store model to sell individual services to smartphone users opens a new and attractive market for developers of simple, useful applications. New revenue streams can be created by in-application one-click purchasing. Aggregation of a camera on Internet-connected smartphones leads to the possibility of having a live video stream of the user's reality augmented by content and services from the Web. Location-based services and augmented reality are seen as potential killer applications of the mobile Internet because users are enabled to access additional information related to where they are, what they are seeing, or what they are doing, as well as instantly purchase related services and content.
For example, instead of just seeing a street poster for a club night and passing by, this new paradigm opens up instant access via the Internet-enabled smartphone to the club‘s location, purchasing an entrance ticket or listening to/buying the DJ mixes. A new co-operation net-work – the Innovation Network for Smart Applications and Media - will bring key Austrian R&D and innovative SMEs together to make real this new paradigm for smart mobile and media applications which we call Smart Reality, and be the first to benefit commercially from it. |
2010 | 2012 |
