Research Projects
- Show Keywords
- 3D Computer Vision 3D reconstruction Aerial Vision Augmented Reality Augmented Video Best Paper Award Biometrics Caleydo Computer Graphics Computer Vision Convex Optimization Coordinate transformations detection face Fingerprint Georeferencing GPU GUI HOG Human Computer Interaction Image Labelling Industrial Applications Information Visualization integral imaging Interaction Interaction Design Machine Learning Medical computer vision Medical Visualization Mixed Reality Mobile computing Mobile phone Model Multi-Display Environments Multiple Perspectives Object detection Object recognition Object reconstruction Object Tracking On-Line Learning Robotics Segmentation Shape analysis shape from focus SLAM Software Projects Structure from Motion Surveillance SVM Symmetry Tracking Fusion Tracking, Action Recognition User Interfaces Variational Methods Virtual reality and augmented reality Visual Tracking Visualization
| Title | Abstract |
Start
|
End |
|---|---|---|---|
|
AUGUR: portable AR visualization of structure within structure using high precission detection
(details) |
This project aims to develop portable measurement tools with in-situ visualization for the construction industry. A future measurement tool will provide a direct augmented reality view of measured properties over the real environment together with instructions as to where and how a certain task can be completed. For example, a metal detection tool should be able to provide direct visual feedback on the location of hidden metallic structures over a live video view of the inspected wall area. Furthermore it can guide a construction engineer to the optimal position for drilling a hole, avoiding any damage to existing structures. Thus the tools should combine information from several sources to provide interactive and contextaware guidance: Measurements from built-in sensors; location-aware through online tracking and registration; spatial, semantic information retrieved from a building information system (BIM). At the same time, future tools need to be simple to be used by non-expert users; therefore the system needs to be intuitive and guide users in the correct operation to fulfil their tasks. To accomplish this goal, The project addresses the following challenges:
|
2012 | 2013 |
|
CONSTRUCT: Construction Site Monitoring and Change Detection using UAVs
(details) |
The goal of the project is to develop methods for modeling and surveying large construction sites. The project will make use of unmanned aerial vehicles and existing stationary or pan-tilt zoom cameras at the construction site. The goal is to provide accurate 3D models on a regular basis of the whole site. This will generate a 4D data set (3D+time). This data can then be used for documentation, visualization (we will use a mobile augmented reality system to overlay e.g. the plan or a model of the building) as well as measurement (e.g., how much material has been transported). From a scientific point of view we will have to solve following tasks:
|
2011 | 2014 |
|
HOLISTIC: Holistic Aerial Scene Understanding Using Highly Redundant Data
(details) |
The aim of this research project is holistic scene understanding in large aerial datasets, consisting of thousands of massively redundant high-resolution images. Holistic scene understanding is one of the major problems in computer vision and photogrammetry and has recently got a lot of attention. The problem of holistic image understanding includes two fundamental tasks: 3D scene reconstruction and semantic interpretation of the imaged content at the level of pixels. The tight interaction between semantic classification and 3D reconstruction is often ignored by state of the art aerial image processing workflows, due to the lack of computational power, the absence of efficient algorithms or the enormous effort of manual intervention. However, these tasks are mutually informative and should be solved jointly as a correct class labelling is a valuable source of information for reconstruction, and 3D information can help to improve the semantic interpretation. For instance, a correct classification is a valuable source of information for reconstruction in regions where dense matching methods fail (e.g. sheets of water and reflecting windows / facades), and 3D information can be used as a prior to improve classification (e.g. building and road detection). The high resolution and redundancy due to large overlaps of aerial images requires massive processing power which will be handled by taking advantage of graphic processing units that have proved to give a significant speedup compared to single core machines. In particular, we will focus on algorithms based on variational methods, which provide a high degree of parallelization capability. In order to reduce cost-intensive manual interaction, we further will exploit publicly available user-data from the Internet to improve both interpretation and 3D reconstruction. In the HOLISTIC project we will provide a flexible framework for scene classification and 3D reconstruction from aerial images that outperforms current state-of-the art and delivers interpretable models at highest possible accuracy. To achieve this goal, we will focus our attention on the following two research subjects: (i) the joint optimization of geometry and semantic classification from aerial images in a unified framework, and (ii) the exploitation of existing geographic information systems and web data to support these two sub-tasks. In addition, we will use web-based standard to efficiently represent the obtained results for fast modeling and data parsing. |
2011 | 2014 |
|
PEGASUS: Autonomous Inspection of Overhead Power Lines using an Unmanned Aerial Vehicle
(details) |
The aim of the PEGASUS project is to develop a mobile vision system for overhead power line inspection to be mounted on an unmanned aerial vehicle (UAV). The long term goal is to develop a fully autonomous aerial vehicle which is able to perform power line inspection in an automated manner. This goal requires innovative solutions to a number of problems such as visual navigation, visual tracking and obstacle detection, model-based inspection under harsh conditions etc. In addition, due to the use of a small scale UAV (e.g. a quad-rotor helicopter) we have restricted computational resources for algorithms that need to be executed on the UAV (especially for navigation and tracking). Within PEGASUS we want to make significant progress towards this long term goal. In particular, PEGASUS will provide a set of tools for the inspector. The project is organized in four phases: First, an inspection system for a single power tower is developed. Used in ground-based inspection, the UAV provides close-up views of all points of interest from an optimal viewpoint. Second, we want to implement an automatic visual inspection system which highlights possible faulty components. In a third step, the system is extended towards multiple towers (still in the sight of the operator). Finally, the system will be used as a handheld system in manned helicopters by power line inspectors, where it should dramatically reduce the time needed for inspection. From a research perspective we will develop novel solutions for model-based recognition and pose estimation, visual navigation including obstacle avoidance and automated model-based visual inspection. All of these problems are extremely challenging because of the uncontrolled conditions (illumination etc.) and the real-time requirements. If successful, the methods developed in PEGASUS will be usable beyond the task of power line inspection. |
2010 | 2013 |
|
Highly accurate range computation in driver assistence systems
(details) |
In this project we study variational methods for computing highly accurate range data in driver assistance systems. |
2010 | 2011 |
|
Narkissos - Virtual Dressing Room
(details) |
The main goal of NARKISSOS is to develop the next generation “magic mirror“ to be installed in a dressing room of a fashion store. The magic mirror is a technical multimedia system, where the consumer can watch himself on a video wall dressed by the clothes which are chosen by touch board or which he did register per RFID tag (embedded in the clothing) at a RFID reader stationed near the video wall of the virtual dressing room. Users can interactively change shape and appearance of the clothing in the mirror image without actually having to change cloths. Customers can also observe themselves (i.e., their avatar) from every side instantaneously. | 2009 | 2012 |
|
CityFit: High-Quality Urban Reconstructions by Fitting Shape Grammars to Images and derived Textured Point Clouds
(details) |
The generation of realistic 3D models of whole cities has become a vibrant and highly competitive market through the recent activities of, most notably, Goggle Earth and Microsoft Virtual Earth. While the first generation of these systems only delivered high-quality zoomable images of the ground, the current trend is heavily geared towards 3D – that is, users can access three-dimensional height- fields of the terrain, and even 3D models of individual buildings. Simple building models, basically extruded polygons with different types of roofs, can be generated today from aerial images completely automatically. This is a solved problem. Far from solved, however, is the problem of generating automatically detailed buildings with façades. Input data for this problem are registered range maps obtained by stereo matching and sequences of highly overlapping thus redundant images (taken from a car driving in the road) where each pixel has not only a color but also a depth, a z-value. Although range maps can be directly rendered in principle, the data size is huge and, more importantly, the pixels have no semantics: A priori there is no difference between a pixel on the floor, on the wall, or on a door. But these shape semantics are required by all downstream applications using the city model. Shape grammars, on the other hand, have recently become (again) a popular method in research for representing 3D buildings. Their great advantage is that they allow to parameterize buildings, which can be used for populating virtual cities with believable architectural buildings, e.g., for 3D games. The goal of the CITYFIT project is, given highly redundant input imagery and range maps from an arbitrary building in Graz, to synthesize a shape grammar that, when evaluated, creates a clean, CAD- quality reconstruction of that building that fits the original data very closely and makes the semantics of all major architectural features explicit. These shape semantics can even be transferred back to inform the original data, so each of these “semantically enriched” data points can tell whether it belongs to ground, wall, or door. |
2008 | 2010 |
|
Deskotheque - Collaborative Interaction in Multi Display Environments
(details) |
Office space usually consists of private single-user workstations. Team work takes place on separate locations, usually supported by analogue media like printed paper. Digital data exchanges is accomplished through designated channels like e-mail or instant messengers. Deskotheque is an ongoing project aiming to extend personal workspaces to enhance team work. It represents a flexible, interactive environment for team work, conference and meeting rooms. Unused surfaces in the room, such as empty wall space and table surfaces, can be turned into interactive, digital displays to be used for multi-user co-located teamwork. |
2007 | 2011 |
|
APAFA: Automated Photogrammetric Aerial Feature Analysis
(details) |
The systematic creation of models of the real world to support the locational awareness on the Internet can be achieved if previously required massive manual labor gets replaced by automated procedures. A particular challenge exists in the automation of the extraction of the 4 classical map features buildings, circulation spaces (e.g. road networks), vegetation and water bodies, as well as their interaction. Decennia of research have been unable to automate the extraction of these features from classical aerial photography towards an economically viable result. However, we believe that we can succeed in the proposed project to develop automated procedures to create feature data for three reasons. First is the recent advent of digital aerial sensors producing highly redundant digital large format aerial photography. Redundancy will be obtained by using high forward and side overlaps, say at 80% and 60%, so that every point in the terrain is imaged at least 10 times, and any algorithm can rely on multiple analysis results that then can either reinforce or cancel one another. Second, the geometric redundancy gets augmented by a radiometric redundancy using 4 spectral bands, adding an infrared band to the classical red, green and blue color channels. Third, we will combine the classical "object reconstruction" approach available from stereo procedures, by new recognition methods. While classically a "car" on a street may have been seen via a "point cloud" and would have to get recognized simply by a representation of local height anomaly on an otherwise flat reference surface, recognition includes the use of stored images of cars in a data base to actually recognize a car as a human would do when inspecting an aerial image. The project is split up into five work packages which will focus on how reconstruction and recognition techniques can help each other and how additional information either from a previous mission or GIS can be integrated in the 3D modeling framework. One work package will address the assessment of the obtained quality, another will address project management and dissemination activities. Within the project we will develop an extensive library of combined recognition/reconstruction methods, and apply them to a range of test data sets. Test data will vary in geometric resolution (pixel size), overlaps, and types of terrain scenarios. |
2007 | 2010 |
|
VM-GPU: Variational Methods on the GPU for Industrial Problems
(details) |
The project VM-GPU fits exactly to the FIT-IT Visual Computing call. It is a combination of computer vision and graphics methods to offer solutions to a problem of great relevance for industry. In particular,
The goal of VM-GPU is to make variational methods available for industrial problems by using modern graphics hardware. If successful this project will have a large impact on the machine vision industry, it will allow for the first time to use variational methods in an industrial setting, in addition having graphics cards available as computing platforms will offer completely new ways of addressing industrial vision problems (e.g., it is very easy to scale up by just using a second graphics card). |
2007 | 2009 |
|
3D Reconstruction of Electrical Impulse Discharges
(details) |
Electrical impulse discharges in nature are visible as lightning. Their impact point can be electro magnetically located up to a precision of several hundred meters. In some restricted areas such as industrial plants, airports etc. it desirable to know the the impact region and path of the lightning up to a precision of a few meters. If visibility is not too restricted by weather conditions, a multi-camera setup would be a viable option to locate path and impact area of the discharge. In this project impulse discharges of a few meters are synthetically generated under laboratory conditions and reconstructed using a multi camera setup. |
2007 | 2008 |
|
A Low-Cost System for Automatic People Tracking in a Labyrinth
(details) |
After medical treatment of visually handicapped people it is desirable to evaluate the benefit of the treatment for the patient. Especially the capability of the patient to orient himself in a three-dimensional environment, to navigate and recognize obstacles is of interest. For a clinical evaluation under controlled circumstances a labyrinth has been built through which the patient ha to navigate. Obstacles may be randomly placed in the labyrinth. A multi-camera system keeps track of the patients movements and extracts parameters such as position, speed, head rotation etc. |
2006 | 2007 |
|
Measuring Material Displacement on Steel Specimens During Strain Tests
(details) |
Deformation analysis of material surfaces is a crucial part of material testing and quality control. In this project a stereoscopic surface measurement system has been developed which allows to measure surface deformation during stress/strain tests over a wide range of fields of view. Stereo image pairs are acquired at specified time instances which makes it possible to synchronize the acquisition with the amount of stress applied. |
2005 | 2006 |
|
Vision Based Kinematic Calibration and Error Compensation of Articulated Robot Arms
(details) |
Development of a vision system for accurate calibration of the kinematic chain of an articulated robot arm. The absolute positioning error of articulated robot arms is typically by an order of ten higher than their repeatability error. Inaccurate blueprint kinematic models typically account for 90% of this discrepancy. In this work a calibration procedure is developed which calibrates the kinematic model of a robot arm using fixed stereo rig and a calibration target mounted on the robot hand. In a single calibration framework the following parameters are automatically determined:
The procedure is fully automatic and does not require expensive, precalibrated equipment. |
2005 | 2006 |
|
Vision Based Detection of Wrong-Way Drivers
(details) |
Vehicles driving the wrong way down a motorway represent a serious source of danger. In the year 2005 in Austria, 521 vehicles were counted driving in the wrong direction on motorways. 8 people died due to wrong-way driver accidents. Immediate detection of a vehicle driving in the wrong direction could help preventing serious accidents by warning the oncoming vehicles (via traffic telematic systems or radio announcements) and by alarming the police. To guarantee immediate detection, one would have to observe every access ramp and every place where a car could turn. Thus a low-cost monitoring system capable of detecting wrong-way drivers and forwarding an alarm to a central traffic control station is needed. Image based detection suffers from one main problem: In case a large vehicle (truck, bus etc.) throws a shadow to the opposite lane of a motorway, that shadow moves against the direction of traffic and may cause a false detection. With a monocular camera system (one single camera), this problem can only be eliminated by large computational costs. |
2005 | 2007 |
|
Dust Filter Surface Analysis
(details) |
When a dust laden gas is sucked through a filter, the dust remains on its surface and forms a compact dust layer called filter cake. Periodically the filter cake is at least partially removed by inverse high pressure air pulses to allow continuous operation of the filter. Knowledge of the distribution of the filter cake on the filter surface at different stages of operation is decisive for filter operation. It is shown that the principle of calibrated shape from stereo with pattern projection for generating texture on the surface gives a robust 3D reconstruction of the filter surface when accessing through a glass window. Rigid registration of surface patches using landmark points, combined with an Iterative Closest Point Algorithm as a refinement procedure gives a continuous 3D model of the entire visible filter surface. Cake thickness is calculated by taking the height difference of two surface models, acquired before and after dust deposition. The challenging problem is to account for the non-rigid deformation of the filter cloth. A global deformation model is estimated using Thin Plate Spline Interpolation based on landmark points on the filter surface. |
2004 | 2005 |
|
Uncalibrated Euclidean Scene Reconstruction in Scanning Electron Microscopy Using the Trifocal Tensor
(details) |
The scanning electron microscope (SEM) is an important tool to examine very small structures. Its large magnification combined with good contrast and large depth of view make it possible to view and characterize microscopic structures in the sub-micron scale. In the recent years, the problem of dense surface reconstruction from multiple SEM images was a research topic on this institute. Reconstruction approaches like shape from stereo and shape from photometric stereo have been evaluated. This work presents a framework for automatic scene reconstruction from three images acquired by a scanning electron microscope. The basic assumption is that the specimen is tilted eucentrically in front of the camera, camera geometry is assumed to be unknown but constant over all views. It is shown that methods for estimating the trifocal tensor as well as modern auto-calibration approaches can be adapted to the imaging conditions in the SEM, and Euclidean scene structure can be retrieved from three uncalibrated views. The performance of the proposed framework is evaluated on synthetic data as well as real images. It is shown that Euclidean scene structure can be retrieved robustly under varying image noise and inaccurate initialization. |
2002 | 2003 |
|
Large Format Digital Camera
(details) |
Vexcel Imaging Austria has invented a novel concept for large format digital cameras and has submitted the innovation for patent protection. An instantaneous exposure with for example 10K x 10K color pixels can be obtained with the new technology at data rates of 2 images per second (producing a data rate in excess of 2 gigabits per second). A number of scientifically interesting topics exists, for example in cases where such a camera is put into a moving platform such as a satellite, an airplane or a car. Vexcel Imaging is sponsoring a research project at the Institute to study some of these questions, addressing high precision image matching and mosaicking, the creation of color images from high-resolution black&white images in combination with lower resolution rgb-images, removal of motion blur, strategies for on-line data compression to reduce data rates, imaging strategies, the study of sample applications for such on-line large image data sets etc. Partner: |
2001 | 2003 |
|
Mobile Robotics
(details) |
Research topics on mobile robotics: -Visual localisation and map building -Usage of local visual landmarks -Localisation using omnidirectional camera |
2001 | 2005 |
|
3-Dimensional Measurement and Virtual Reconstruction of Ancient Lost Worlds of Europe (MURALE)
(details) |
Tools will be developed for the reconstruction of a currently excavated archaeological site for different periods throughout the time of its occupation. This includes the development of 3D acquisition systems that can measure a range of objects of different dimensions to produce accurate and realistic looking 3D models and of tools for virtual reconstruction, documentation and classification of objects ranging from pottery sherds to entire urban building complexes. The project will provide archaeologists with the tools to support the analysis and restoration of their finds. The virtual artefacts and buildings together with the related information will be stored in a multimedia database with appropriate searching and querying functionality. 3D visualisation technologies will be developed for the interactive presentation of archaeological results in museums and via the internet, to the broad public as well as to experts. To provide Europe with an archaeological lead. Partner: Departement Elektrotechniek ESAT-PSI, Katholieke Universiteit Leuven |
2000 | 2003 |
