Research Projects
- Show Keywords
- 3D Computer Vision 3D reconstruction Aerial Vision Augmented Reality Augmented Video Best Paper Award Biometrics Caleydo Computer Graphics Computer Vision Convex Optimization Coordinate transformations detection face Fingerprint Georeferencing GPU GUI HOG Human Computer Interaction Image Labelling Industrial Applications Information Visualization integral imaging Interaction Interaction Design Machine Learning Medical computer vision Medical Visualization Mixed Reality Mobile computing Mobile phone Model Multi-Display Environments Multiple Perspectives Object detection Object recognition Object reconstruction Object Tracking On-Line Learning Robotics Segmentation Shape analysis shape from focus SLAM Software Projects Structure from Motion Surveillance SVM Symmetry Tracking Fusion Tracking, Action Recognition User Interfaces Variational Methods Virtual reality and augmented reality Visual Tracking Visualization
| Title | Abstract |
Start
|
End |
|---|---|---|---|
|
Diagnostik der Tumorheterogenität – ein neuer Steuerfaktor für die Therapie des Dickdarmkarzinoms?
(details) |
Das Kolonkarzinom ist weltweit eine der häufigsten Krebserkrankungen, die trotz Fortschritte in der Behandlung nach Ausbildung von Metastasen fast immer zum Tod führt. Gemäß internationalen Standards ist derzeit die pathohistologische Untersuchung entscheidend für das therapeutische Vorgehen. Für Patienten in fortgeschrittenen Tumorstadien wurden kürzlich Therapien verfügbar, die auf den Mutationsstatus des Tumors ausgerichtet sind, jedoch eine mögliche Tumorheterogenität nicht berücksichtigen. Derzeit nicht detektierte Tumorklone werden für das oft fehlende Therapieansprechen und die Tumorprogression verantwortlich gemacht. Das beantragte Projekt soll durch Anwendung neuer sensorischer Verfahren zur kosteneffizienten und verlässlichen Bestimmung der genetischen Diversität von Dickdarmkarzinomen beitragen. Mittels statistischer Verfahren und bioinformatischer Analyse der genetischen Profile werden die Häufigkeit sowie die prognostische Bedeutung der Tumorheterogenität für das biologische Verhalten der Tumore sowie ihr Ansprechen auf spezifische onkologische Therapien ermittelt. Durch spezielle, an der TUG entwickelte Visualisierungstechniken wird die erhobene Datenfülle für Pathologen und klinische Onkologen verständlich und verwertbar gemacht. Eine umfassende genetische Tumoranalyse setzt das vollinhaltliche Einverständnis des Patienten voraus, welches untrennbar mit dem Verständnis und der Zustimmung zu den hierzu verwendeten Methoden verbunden ist. Ein weiteres Projektziel ist daher die Untersuchung von Erwartungen und Hoffnungen aber auch von Vorbehalten bzw. Befürchtungen, die in die Beratung und Aufklärung des Patienten Eingang finden sollen und die die unterschiedlichen Einstellungen der Patienten zu den diagnostischen Verfahren berücksichtigen. Diese neuen Diagnoseverfahren werden ein Ansprechen auf eine Therapie wesentlich gezielter voraussagen können als die derzeitigen Methoden, den Patienten Nebenwirkungen unwirksamer Medikamente ersparen und damit nicht zuletzt zu einer Kostenreduktion im Gesundheitssystem beitragen. | 2012 | 2014 |
|
AUGUR: portable AR visualization of structure within structure using high precission detection
(details) |
This project aims to develop portable measurement tools with in-situ visualization for the construction industry. A future measurement tool will provide a direct augmented reality view of measured properties over the real environment together with instructions as to where and how a certain task can be completed. For example, a metal detection tool should be able to provide direct visual feedback on the location of hidden metallic structures over a live video view of the inspected wall area. Furthermore it can guide a construction engineer to the optimal position for drilling a hole, avoiding any damage to existing structures. Thus the tools should combine information from several sources to provide interactive and contextaware guidance: Measurements from built-in sensors; location-aware through online tracking and registration; spatial, semantic information retrieved from a building information system (BIM). At the same time, future tools need to be simple to be used by non-expert users; therefore the system needs to be intuitive and guide users in the correct operation to fulfil their tasks. To accomplish this goal, The project addresses the following challenges:
|
2012 | 2013 |
|
CONSTRUCT: Construction Site Monitoring and Change Detection using UAVs
(details) |
The goal of the project is to develop methods for modeling and surveying large construction sites. The project will make use of unmanned aerial vehicles and existing stationary or pan-tilt zoom cameras at the construction site. The goal is to provide accurate 3D models on a regular basis of the whole site. This will generate a 4D data set (3D+time). This data can then be used for documentation, visualization (we will use a mobile augmented reality system to overlay e.g. the plan or a model of the building) as well as measurement (e.g., how much material has been transported). From a scientific point of view we will have to solve following tasks:
|
2011 | 2014 |
|
HOLISTIC: Holistic Aerial Scene Understanding Using Highly Redundant Data
(details) |
The aim of this research project is holistic scene understanding in large aerial datasets, consisting of thousands of massively redundant high-resolution images. Holistic scene understanding is one of the major problems in computer vision and photogrammetry and has recently got a lot of attention. The problem of holistic image understanding includes two fundamental tasks: 3D scene reconstruction and semantic interpretation of the imaged content at the level of pixels. The tight interaction between semantic classification and 3D reconstruction is often ignored by state of the art aerial image processing workflows, due to the lack of computational power, the absence of efficient algorithms or the enormous effort of manual intervention. However, these tasks are mutually informative and should be solved jointly as a correct class labelling is a valuable source of information for reconstruction, and 3D information can help to improve the semantic interpretation. For instance, a correct classification is a valuable source of information for reconstruction in regions where dense matching methods fail (e.g. sheets of water and reflecting windows / facades), and 3D information can be used as a prior to improve classification (e.g. building and road detection). The high resolution and redundancy due to large overlaps of aerial images requires massive processing power which will be handled by taking advantage of graphic processing units that have proved to give a significant speedup compared to single core machines. In particular, we will focus on algorithms based on variational methods, which provide a high degree of parallelization capability. In order to reduce cost-intensive manual interaction, we further will exploit publicly available user-data from the Internet to improve both interpretation and 3D reconstruction. In the HOLISTIC project we will provide a flexible framework for scene classification and 3D reconstruction from aerial images that outperforms current state-of-the art and delivers interpretable models at highest possible accuracy. To achieve this goal, we will focus our attention on the following two research subjects: (i) the joint optimization of geometry and semantic classification from aerial images in a unified framework, and (ii) the exploitation of existing geographic information systems and web data to support these two sub-tasks. In addition, we will use web-based standard to efficiently represent the obtained results for fast modeling and data parsing. |
2011 | 2014 |
|
Caleydoplex- Information Exploration in Teams
(details) |
Critical decisions involving a lot of data are rarely made by a single person, but are rather discussed and evaluated by a team of experts. Examples are doctors deciding for treatment of severe illness, emergency services having to react to ongoing crises, or engineers collaborating to make technical decisions concerning expensive products. These activities can be assisted by information visualization tools. However, traditional information visualization rarely considers the collaborative nature of data analysis tasks. The foundation of our research proposal is the extension of a multiple view visualization system to a multi-display environment. Multiple view visualization shows data in different representations and thereby accommodates for different knowledge backgrounds and user preferences. Multi-display environments turn unused wall and table spaces into interactive surfaces using off-the-shelf projection technology and integrate private workstations smoothly into this shared interactive workspace. Our research aim is the design and creation of a co-located collaborative information visualization workspace dealing with two principal challenges: display space management and collaborative interaction techniques. Intelligent display space management adopts information visualizations and placement of views automatically to the physical display properties and supports the users interacting with the environment. Combined with visual linking of related data entities distributed across the environment, it will help to establish a common knowledge ground. Collaborative interaction techniques are required to organize such a rich, but potentially complex environment. We will investigate high-level activity support for typical tasks in shared information workspaces and how users can maintain awareness of each other’s activities. The proposed research benefits from two ongoing projects at Graz University of Technology: Deskotheque delivers the basic technology necessary for collaborative work in multi-display environments, while Caleydo, a visualization project from the biomedical domain, provides an excellent use case, including the necessary experts willing to collaborate in studies. Using these frameworks, we plan to conduct several usability studies, with prototypes of different levels of sophistication. This research is part of the project Caleydo. |
2011 | 2014 |
|
Managed Volume Processing (MVP)
(details) |
Volumetric data is very common in medicine, geology or engineering, but the high complexity in data and algorithms has prevented widespread use of volume graphics. Recently, however, 3D image processing and visualization algorithms have been parallelized and ported to graphics processing units (GPUs). This proposal is concerned with new ways of designing volume graphics algorithms for the GPU that can interactively cope with these huge problems by better utilization of GPU capacity. Unfortunately, only certain parts of common image or volume processing algorithms can be mapped to the standard GPU stream processing model. For most real-world problems, writing programs for this architecture is a tedious task. As a result, most algorithms use the available processing power only for small subtasks -- the number crunching in inner loops. For example, direct volume rendering (DVR) methods send rays into a volumetric object, accumulate intensities, divide rays into sub-rays, scatter rays in materials and/or extract certain features. All GPU implementations of DVR use one processing unit for one pixel, regardless of whether the pixel will require very complex calculations or not. This strategy frequently leads to strong load imbalances. A particular problem of interactive applications such as volume graphics is that they are not traditional number crunching tasks, which only require optimal computational throughput, while having relaxed or no constraints concerning latency. On the contrary, interactive applications demand meeting real-time deadlines to ensure interactive response. This is a classical real-time resource scheduling problem. It can only be achieved by adaptive algorithms that rely on complex flow control and memory management decisions during the parallel execution. Both is currently only available on the CPU, which allows access to privileged mode through the operating system. On the GPU, components for high level scheduling involving latency hiding and memory management are missing or inaccessible. The desired full utilization of the GPU is very difficult to achieve for complex graphics algorithms with real-time demands. Building a toolset that allows harvesting the full GPU power for a general class of real-time volume graphics algorithms is the main goal of this proposal. We propose a managed volume processing system that incorporates the missing components. Its key modules are a task model, a workload scheduler with real-time capabilities and a virtual memory management system executed in tandem on the GPU and CPU. We will rely on the most recent hardware developments and use OpenCL as the standardized interface to access them. | 2011 | 2014 |
|
Smart Reality - Innovation Network for Smart Applications and Media
(details) |
The market for mobile media services will expand significantly in the next years. The explosion in the usage of smartphones and the growth of the application store model to sell individual services to smartphone users opens a new and attractive market for developers of simple, useful applications. New revenue streams can be created by in-application one-click purchasing. Aggregation of a camera on Internet-connected smartphones leads to the possibility of having a live video stream of the user's reality augmented by content and services from the Web. Location-based services and augmented reality are seen as potential killer applications of the mobile Internet because users are enabled to access additional information related to where they are, what they are seeing, or what they are doing, as well as instantly purchase related services and content.
For example, instead of just seeing a street poster for a club night and passing by, this new paradigm opens up instant access via the Internet-enabled smartphone to the club‘s location, purchasing an entrance ticket or listening to/buying the DJ mixes. A new co-operation net-work – the Innovation Network for Smart Applications and Media - will bring key Austrian R&D and innovative SMEs together to make real this new paradigm for smart mobile and media applications which we call Smart Reality, and be the first to benefit commercially from it. |
2010 | 2012 |
|
AR4DOC - Augmented Reality for Document Inspection
(details) |
Smartphones have evolved considerably in processing power over the last years. They now feature multi-core CPUs as well as GPUs and consumer-quality cameras up to HD resolution. This makes them an interesting platform for graphics and vision and opens new opportunities for research. The aim of AR4DOC is to facilitate the task of document inspection by a human operator. This requires the person to have detailed knowledge about the nature of a document, which may be outdated or even unavailable at the time of inspection. We seek to provide this information in an interactive way using Mobile Augmented Reality (AR), so that a well-grounded decision on the vailidity of a document is possible. This involves several tasks such as document localization, recognition, tracking, presentation as well as interaction.
|
2010 | 2013 |
|
PEGASUS: Autonomous Inspection of Overhead Power Lines using an Unmanned Aerial Vehicle
(details) |
The aim of the PEGASUS project is to develop a mobile vision system for overhead power line inspection to be mounted on an unmanned aerial vehicle (UAV). The long term goal is to develop a fully autonomous aerial vehicle which is able to perform power line inspection in an automated manner. This goal requires innovative solutions to a number of problems such as visual navigation, visual tracking and obstacle detection, model-based inspection under harsh conditions etc. In addition, due to the use of a small scale UAV (e.g. a quad-rotor helicopter) we have restricted computational resources for algorithms that need to be executed on the UAV (especially for navigation and tracking). Within PEGASUS we want to make significant progress towards this long term goal. In particular, PEGASUS will provide a set of tools for the inspector. The project is organized in four phases: First, an inspection system for a single power tower is developed. Used in ground-based inspection, the UAV provides close-up views of all points of interest from an optimal viewpoint. Second, we want to implement an automatic visual inspection system which highlights possible faulty components. In a third step, the system is extended towards multiple towers (still in the sight of the operator). Finally, the system will be used as a handheld system in manned helicopters by power line inspectors, where it should dramatically reduce the time needed for inspection. From a research perspective we will develop novel solutions for model-based recognition and pose estimation, visual navigation including obstacle avoidance and automated model-based visual inspection. All of these problems are extremely challenging because of the uncontrolled conditions (illumination etc.) and the real-time requirements. If successful, the methods developed in PEGASUS will be usable beyond the task of power line inspection. |
2010 | 2013 |
|
Mobi-Trick
(details) |
The focus of the project is outdoor mobile computer vision with all of its challenges. Mobile systems need to be compact and energy efficient and are frequently changing locations. Therefore they must be autonomous and perform processing locally. A number of challenges arise from these requirements for which the project aims to provide solutions: Being compact, there is not much space for a large number of sensors such as laser scanners, radar antennas and the like. The work in this project will focus on stereo vision but with two different types of cameras. Often a second camera is already available and stereo information increases detection accuracies. Each time the system moves it needs to adapt to the changing situation. This requires adaptive calibration and online learning. Mobile systems often work from batteries. In addition, there is not much space to include intricate cooling systems. Thus, the system must be designed to be very energy efficient. New approaches for dynamic power management will be explored in the project. To put the work into context, several applications from the area of traffic surveillance/toll enforcement will be implemented and tested in an application oriented setting. Current traffic enforcement solutions are either very large and costly (section control, toll enforcement) or do not offer much in terms of image processing (radar speed control). The technological output of Mobi Trick makes it possible to design mobile solutions for traffic monitoring, vehicle identification and classification, intelligent incident detection and observation of driver behavior. Mobile devices are also more efficient in enforcement. Their transient nature makes them less predictable. Mobile systems can also react more flexibly to changing road situations such as construction sites. |
2010 | 2013 |
|
HD-VIP: High Definition Video Processing
(details) |
The growth of information is nowadays enormous and at a level which had never been reached before. We currently produce almost more data in one year than was produced in the entire history of mankind so far. In particular the trend to a full digitization of audiovisual content is contributing to this explosion of available material. The exponential growth of online video, most notably YouTube among the many prominent video portals is just one example for that. Even if international studies are not arriving at exactly the same results, the figures are impressive: digital production in 2006 was approximately 160 Exabyte, and is predicted to rise to 990 Exabyte in 2010. Any video processing /editing software has to keep pace with these extraordinary data rates which requires special efforts from the hardware and the software. Fortunately we see also an extraordinary increase in processing power, especially when looking at recent developments of graphics cards (GPUs). These cards offer massive parallelism (ideally suited for video processing) at a rather modest price. All these facts make this hardware an ideal candidate for video processing. But in order to make full use of the hardware the algorithms have to be highly parallel. Typical tasks encountered in video processing (which will also be tackled by the proposed project are): Superresolution: With the advent of HDTVs in many homes there is an increasing need to produce also HDTV content. In order to make use of existing (low-resolution) material one can use so called superresolution algorithms. These methods generate from a sequence of low resolution frames a high resolution image by exploiting the high interframe redundancy. Denoising: There are many sources of noise in a video, either the material is historic or during production/compression etc. noise is added to the video. A basic task is to remove the noise but still preserve all fine scale details. Interactive video editing: For post production purposes one wants to mark objects in a video (of course the object should only be marked in a single frame and then segmented automatically in all subsequent frames) and either remove them (which requires inpainting methods to fill the holes with meaningful content), place them somewhere else in the video or replace them with different objects. Since these tasks are done interactively this requires interactive framerates. Fortunately all of these tasks can be addressed by so called variational methods. The basic idea is to formulate the task as a minimization problem of a suitable energy functional. Besides other desirable properties these methods can be implemented in a highly parallel fashion which makes them ideal candidates for implementation on modern GPUs. |
2010 | 2012 |
|
Higher Order Variational Methods
(details) |
This research project is devoted to the study of higher order convex variational methods for problems in computer vision. First order methods, i.e. methods which take into account first order derivatives have shown a great success for a variety of inverse computer vision problems. This success is mostly due to the introduction of total variation methods by Rudin, Osher and Fatemi in 1992. Total variation methods exhibit the important property to preserve sharp discontinuities in the solution while the associated optimization problem is still convex. This leads to robust problem solutions, independent of any initialization. Besides this, total variation methods also exhibit some disadvantages. First, total variation methods favor piecewise constant solutions which leads to staircaising artifacts in image restoration problems and to the preference of fronto‐parallel structures in stereo problems. Second, total variation methods introduce a shrinking bias in shape optimization problems. The aim of this project is therefore to study higher order convex variational methods in order to improve the shortcomings of first order methods. We therefore propose to investigate two approaches. The first approach is based on the so‐called generalized total variation method, recently introduced by Bredies, Kunisch and Pock. It provides a framework to recover piecewise polynomial functions based on a convex functional. We expect that this method leads to significant improvements of stereo and motion estimation problems. The second approach is based on the so‐called roto‐translation space introduced by Citti and Sarti in 2006. It allows to rewrite functionals incorporating curvature regularity by means of a convex first order functional in higher dimensions. We expect that this approach will significantly improve the performance of various shape optimization problems. |
2010 | 2013 |
|
Highly accurate range computation in driver assistence systems
(details) |
In this project we study variational methods for computing highly accurate range data in driver assistance systems. |
2010 | 2011 |
|
Image Processing and Statistical Learning
(details) |
The goal of this project is to study statistical learning methods in particular boosting and random forest for computer vision tasks. We are especially interested in on-line learning. |
2009 | 2010 |
|
KIRAS - SECRET
(details) |
Different authorities like such as the Ministry of the Interior often require to find certain event or behavior patterns in recordings in large video archives. This "forensic" search is computationally extremely expensive and due to restricted storage permissions often even not possible. Thus, security-critical events can often not prevented or being postpursued. To overcome these problems, the aim of the OUTLIER project is the investigation of algorithms, methods, and processes to alleviate the work of security staff in searching and pursuiting of events in video archives. Furthermore these tasks should be performed more efficiently. Based on the requirements of the Ministry of the Interior as well as the possibilities of an infrastructure operator these issues should be examined and a research prototype should be created. This should occur in cooperation of AIT and ICG (University of Technology Graz) as research partners and ASE as an industrial partner. Essential research subjects are: (i) detection and segmentation of people, (ii) comparisons and finding of events in different video streams, and (iii) analyses and learning of behavior patterns. In addition, a social-scientific acceptance research will be established by the research institute of the Red Cross (FRK). Based on these results recommendations are compiled for the optimization by use and minimization of problem potentials from social-scientific view. |
2009 | 2012 |
|
Narkissos - Virtual Dressing Room
(details) |
The main goal of NARKISSOS is to develop the next generation “magic mirror“ to be installed in a dressing room of a fashion store. The magic mirror is a technical multimedia system, where the consumer can watch himself on a video wall dressed by the clothes which are chosen by touch board or which he did register per RFID tag (embedded in the clothing) at a RFID reader stationed near the video wall of the virtual dressing room. Users can interactively change shape and appearance of the clothing in the mirror image without actually having to change cloths. Customers can also observe themselves (i.e., their avatar) from every side instantaneously. | 2009 | 2012 |
|
OUTLIER
(details) |
The ever increasing number of cameras in surveillance system requires automatic video analysis in order to spot critical situations and to alert the monitoring personnel in a timely manner. While most current approaches in this area aim for detecting a large number of specific events on a large set of complex application scenarios, the goal of this project is to go far beyond state of the art by developing novel online learning methods to detect unusual situations in a camera specific scenario. We will exploit the huge amount of data available for a specific camera to reliably learn usual and unusual situations. In particular the OUTLIER project will carry out basic research in the following areas:
These generic learning algorithms will be applied for the detection of unusual situations in public places and traffic scenarios. Examples are the detection of unusual crowd behavior (upcoming panic, barred emergency exits, or toppled persons), suspicious behavior of pedestrians (e.g. going from one car to another, loitering), vehicles or persons moving on unusual locations, the detection of unusual types of moving objects and detection of unusual situations like accidents, clashes and collisions. Unlike other approaches we do not want to model these situations explicitly and individually, but we will resort to learning to discriminate the usual situation from the unusual one. Research partners in the project are JRS, TUG for basic and applied research and Siemens for industrial exploitation of project results. |
2009 | 2011 |
|
Multimedia Documentation Lab
(details) |
The potential for integration of multimedia content into the analysis of security relevant affairs is researched for the first time within the scope of Austrian security research efforts. The project’s goal is to harvest audio-visual information from specified open multimedia sources such as TV broadcasts and allow for integration into existing environments at user sites. The intended use of the system is to allow experts to efficiently generate more realistic and high-quality situation reports in the face of critical situations. Subsequently, these can be employed for communication with the population of Austria and to increase its security and sense of security - target goals of the KIRAS framework. An exemplary implementation of a prototype will be installed at the Zentraldokumentation of the Austrian Armed Forces. In terms of audio-processing the project builds upon existing technologies of the industrial partner, while the visual processing is investigated by ICG as academic partner and will mainly deal with person/face detection, tracking and recognition methods. |
2009 | 2011 |
|
inGeneious - Holistic Visualization of Biomolecular and Clinical Data
(details) |
Ziel des Projekts inGeneious ist es, Visualisierungsmethoden und Work-Flows zu entwickeln, die Biologen und Medizinern bei der Analyse biomolekulare Daten im Kontext von klinischen Faktoren sowie biologischen Prozesse unterstützen. Die Berücksichtigung dieser Faktoren bei der Analyse von zum Beispiel Genexpressionsdaten ist entscheidend, da auf diese Weise Rückschlüsse über Zusammenhänge von genetischer Predisposition und Krankheitsverlauf gewonnen werden können. Zwei zentrale Forschungsfragen sind Gegenstand des inGeneious-Projektes. Zunächst soll eine ganzheitliche Betrachtungsweise der drei Datenräume durch Multiple-View-Verfahren und effizientes visuelles Verbinden von Informationen ermöglicht werden. Darauf aufbauend soll eine vergleichende Analyse divergierender Gruppen durch neue, vergleichende Visualisierungsmethoden ermöglicht werden. Experten erhalten damit ein Werkzeug um die immer größer werdende Menge biomolekularer Daten effizient verwenden zu können. Diese Forschungsarbeit wird innerhalb des Projekts Caleydo durchgeführt. |
2009 | 2011 |
|
SMART Vidente - Subsurface Mobile Augmented Reality Technology for Outdoor Infrastructure Workers
(details) |
SMART VIDENTE focuses on research on the next-generation field information system for utility companies, providing mobile workforces with capabilities for on-site inspection and planning, data capture and as-built surveying. For achieving this aim, handheld Augmented Reality technology is used for on-site modification and surveying of geometric and semantic attributes of geospatial 3D models on the user’s handheld device. The project aims at providing a fully functional handheld Augmented Reality device for utility field workers. To achieve this goal, we require a software solution that can visualize registered three-dimensional underground models in real time. Registration in 3D requires being able to perform accurate global localization and posing tracking in real time without relying on unrealistic assumptions concerning prior scene knowledge. We will address this issue through fusion of vision, inertial and GPS sensors. Visualization requires the rendering of complex 3D models of underground infrastructure in a way that is easily comprehensible and useful to the mobile worker. This requires visualization techniques for geometric as well as non-geometric information from the geo-database, in particular of hidden objects through so-called “X-ray vision”. These visualization techniques need to be adaptive to scene complexity and environmental conditions. The three-dimensional geometry to be shown is not available per default, but must be extracted from a conventional database system and interpreted on-the-fly as a 3D visualization using procedural modeling techniques. We want to support annotation and even surveying tasks in the field, so the system must also allow to write information back to the geo-database. Finally, we will work with three large Austrian infrastructure companies to assess the usability of our solutions. |
2009 | 2011 |
|
Person Re-Identification
(details) |
The goal of this project is to develop an interactive visual search method that finds a given pedestrian in a large archive of other camera views efficiently. A user-selected pedestrian image or sequence is used to obtain initial discriminative features and an initial ranked list of hypothetical matches. A discriminative pedestrian recognition model is learned in an on-line manner by user interaction assigning positive and negative labels to the initially retrieved results and on-line boosting for feature selection. This enables that the best discriminative features for the current query are selected. |
2008 | 2010 |
|
Semi-Supervised Learning for the Analysis of Unstructured Documents
(details) |
The goal of this project is to develop and analyze methods for analyzing textual information. This should be realized by using semi-supervised learning methods, which use labeled as well as unlabeled data. In particular, existing methods which are already applied for pattern recognition should be adapted such that those can also be applied for textual data. For a practical analysis comparisons to SVM and k-NN classifier using a boosting algorithm should be performed, the influence of the amount of labeled/unlabeled data and the convergence should be analyzed. Moreover, a fair comparative study between batch and on-line methods is performed. |
2008 | 2011 |
|
MARCUS - Mobile Augmented Reality and Context in Urban Scenarios
(details) |
MARCUS is an exchange program with the Human Interface Technologies Laboratory (Christchurch, NZ) and the University of Otago (Otago, NZ). Its aim is to extend the scope of the research work performed in the EU Integrated Project "IPCity" with researchers in New Zealand. The focus of research will be on how mobile devices can create new types of interactive urban experiences. For example, location specific information overlaid on the real world can be used to aid navigation through cities, in outdoor game play, or for providing user supplied comments at certain sites. |
2008 | 2010 |
|
Ludwig Boltzmann Institut für Klinisch-Forensische Bildgebung
(details) |
Die klinische Rechtsmedizin gewann in den letzten Jahren aufgrund einer Sensibilisierung der Öffentlichkeit gegenüber häuslicher und sexueller Gewalt, Gewalt gegenüber Kindern und Verdachtsfällen von medizinischen Behandlungsfehlern stark an Bedeutung. Die forensische Untersuchung von Lebenden ist bis heute jedoch auf eine äussere Besichtigung des Körpers beschränkt. Das neue Ludwig-Boltzmann-Institut (LBI) für klinisch-forensische Bildgebung hat zum Ziel, Verfahren zur Erfassung von inneren Verletzungsbefunden als Grundlage für forensische Gutachten zu entwickeln. Mittels Computertomographie (CT) und Magnetresonanztomographie (MRT), welche in der Klinik etabliert sind, können zusätzliche, objektiv nachweisbare innere Verletzungsbefunde erhoben werden, die eine verbesserte Einschätzung der ausgeübten Gewalt gegen die untersuchte Person ermöglichen. Die Methoden sind jedoch auf klinische Diagnostik ausgerichtet, während forensisch wichtige Befunde nicht oder nicht optimal dargestellt werden. Das Institut fuer Maschinelles Sehen und Darstellen kooperiert mit dem LBI zur Entwicklung neuer Methoden der Bildverarbeitung und Computergrafik zum Zwecke der Bildgebung. |
2008 | 2015 |
|
IMPPACT - Image-based Multi-scale Physiological Planning for Ablation Cancer Treatment
(details) |
IMPPACT is a European research project, which develops an intervention planning system for Radiofrequency Ablation of malignant liver tumours. TU Graz is dealing with medical visualization and augmented reality in the project. Problem or Context Radiofrequency Ablation (RFA) is a minimally invasive form to treat cancer without open surgery, by placing a needle inside the malignancy and destroying it through intensive heating. Though the advantages of this approach are obvious, the intervention is currently hard to plan, almost impossible to monitor or assess, and therefore is not the first choice for treatment. Project IMPPACT will develop a physiological model of the liver and simulate the RFA intervention result, accounting for patient specific physiological factors.
Mathematical modelling together with experimental validation lead to a patient specific intervention planning system. read more Expected Results & Impacts IMPPACT will be modelling a physiological organ including the metabolism and patient specific tissue properties. This alone is a huge step forward as compared to the state-of-the-art intervention planning systems that do not address this issue. The IPS will allow prediction of treatment results on a patient specific base. It will therefore bring down the risk of local recurrences and eliminate the nowadays so common repeated treatments of the same tumour, making RFA an as effective treatment as resection. |
2008 | 2011 |
|
HydroSys - Advanced spatial analysis tools for on-site environmental monitoring and management
(details) |
The research aim of the project is to provide a system infrastructure to support teams of users in the on-site monitoring of events and analysis of natural resources. The project will introduce the innovative concept of event-driven campaigns using handheld devices, potentially supported by an unmanned aerial vehicle (UAV). Event-driven campaigns provide users the capacity to analyse and predict environmental changes on-site, supporting the process of taking appropriate countermeasures to avoid environmental degradation. During these campaigns, users will be able to setup and retrieve data from mobile sensorstations, the UAV and external sources (such as permanent sensor networks) in order to generate dense information on a small area. The whole sensor network system will gather and store sensor data, and process simulations based on physical process models. Hence, a shared information system fusing heterogeneous data sources will be provided that supports teams of stakeholders to monitor environmental processes on-site, complementing remote monitoring and management. To enrich the data sets from a specific location, additional remotely controlled cameras will be deployed, mounted on sensorstations and below the UAV. Users will be able to analyse the environment using mobile phones and handheld computers, supported by advanced user interface techniques. The project will improve monitoring and management for environmental scientists, institutions, service providers, engineering companies and municipalities through its strong integration of handhelds and sensor networks. The project will progress well beyond the current state in the art, by dealing with short-term events and detailed analysis of small sites. The analysis of such events is hardly supported by current methods, but has a large impact on environmental degradation. Furthermore, information is made available to citizens by providing mechanisms to access top-level environmental data. Within the project, cutting edge inter-disciplinary research will be performed to develop user-centered solutions. When the data is integrated with analytical tools in a shared information space it will also aid a wide range of managers and planners pursuing more environmentally sensitive solutions to engineering problems. To aid the process, the research is steered by considerable end-user involvement in all its phases. |
2008 | 2011 |
|
Christian Doppler Laboratory for Handheld Augmented Reality
(details) |
Augmented Reality (AR) combines real and virtual in a single view, putting information right were it belongs - into the real world. AR is still a young research field and hence strongly driven by basic research and experimental methods, while only few successful commercial applications have been deployed. One of the reasons is that past hardware (such as head-mounted displays and Tablet PCs) have not been sufficiently inexpensive and ergonomically satisfactory. Therefore, recent AR research shows a trends towards deploying AR on advanced mobile phones, using the phone camera as video see-through interface for a “magic lens” style of AR. Recent research in the proposer’s group has first the first time established a baseline technology for achieving real-time performance AR on mobile phones, and this development has been meet with great interest from industry. This proposal the logical consequence of this development. It is concerned with extending this research in several directions, in particular making techniques more scalable (sometimes several orders of magnitude), so that realistic real world scenarios interesting for commercial applications can be attacked by industry. Firstly, we want to expand our real-time computer-vision based pose tracking and object recognition techniques. Secondly, we propose to develop realistic AR image synthesis and visualization methods. Thirdly, we suggest an investigation into efficient 3D interaction techniques with and for AR phones. Finally, we suggest the creation of a distributed infrastructure based on Web 2.0 technology for scalable content creation and deployment of geo-referenced AR applications on phones. |
2008 | 2015 |
|
CityFit: High-Quality Urban Reconstructions by Fitting Shape Grammars to Images and derived Textured Point Clouds
(details) |
The generation of realistic 3D models of whole cities has become a vibrant and highly competitive market through the recent activities of, most notably, Goggle Earth and Microsoft Virtual Earth. While the first generation of these systems only delivered high-quality zoomable images of the ground, the current trend is heavily geared towards 3D – that is, users can access three-dimensional height- fields of the terrain, and even 3D models of individual buildings. Simple building models, basically extruded polygons with different types of roofs, can be generated today from aerial images completely automatically. This is a solved problem. Far from solved, however, is the problem of generating automatically detailed buildings with façades. Input data for this problem are registered range maps obtained by stereo matching and sequences of highly overlapping thus redundant images (taken from a car driving in the road) where each pixel has not only a color but also a depth, a z-value. Although range maps can be directly rendered in principle, the data size is huge and, more importantly, the pixels have no semantics: A priori there is no difference between a pixel on the floor, on the wall, or on a door. But these shape semantics are required by all downstream applications using the city model. Shape grammars, on the other hand, have recently become (again) a popular method in research for representing 3D buildings. Their great advantage is that they allow to parameterize buildings, which can be used for populating virtual cities with believable architectural buildings, e.g., for 3D games. The goal of the CITYFIT project is, given highly redundant input imagery and range maps from an arbitrary building in Graz, to synthesize a shape grammar that, when evaluated, creates a clean, CAD- quality reconstruction of that building that fits the original data very closely and makes the semantics of all major architectural features explicit. These shape semantics can even be transferred back to inform the original data, so each of these “semantically enriched” data points can tell whether it belongs to ground, wall, or door. |
2008 | 2010 |
|
vdQA: Video Quality Analysis
(details) |
Automatic and efficient quality analysis of audiovisual content has become a crucial step before storing the material for later use. While most approaches in this area are only dealing with low level signal analysis, the goal of this project is to go far beyond state-of-the-art procedures. On the basis of novel as well as proven computer vision methods, we will attempt to incorporate high level knowledge in the analysis step, thus achieving significant better and faster results than current methods, comparable in their reliability with a human operator. In particular the vdQA project will carry out research in the following areas: • Improvement of optical flow field methodologies to deal with multi-frame information • Application of novel segmentation methods in order to enable semantic quality analysis. • Knowledge assisted artefact assessment and classification. • Novel methods for fast and robust detection of difficult impairments like unsteadiness, flicker, freeze frames, test patterns and lost frames. • Research into methodologies that are particularly well suited for implementations taking advantage of GPU hardware. The grand challenge in the end is the combination of robustness, speed and integration of human knowledge. The research and industrial partners have dedicated roles in the work programme to achieve those goals. The industrial partners have excellent knowledge of the market and will provide user requirements as well as extensive test material. The academic partners will do research in their respective fields, namely development of basic algorithms for optical flow, tracking, segmentation, classification and usage of GPUs as well as algorithms for content based quality analysis and semantic technologies to represent knowledge. Towards the project end the industrial partners will evaluate and test the developments together with pilot end users. Project Partners: Joanneum Research, Institute of Information Systems & Information Management |
2008 | 2010 |
|
CranUS - Cranial Ultrasound Simulation
(details) |
The use of augmented reality in medicine is an important field, especially in teaching and training of sensitive tasks. To support teaching and training of neonatal cranial sonography, an augmented reality simulator was developed. Physical models of a newborn and an ultrasound probe were tracked and their movements displayed in their virtual representation. The head of the newborn model was augmented with a 3D volume, reconstructed from ultrasound images of a real patient. Reconstructing a 3D volume from irregular source data takes a special focus on positioning the images and the subsequent interpolation. Moving the physical model towards each other, the according slices are generated in realtime. |
2007 | 2008 |
|
Doctoral Program for the Confluence of Graphics and Vision
(details) |
Computer vision and computer graphics constitute two closely related areas of research: Though both fields rely on the same physical and mathematical principles and on a common set of representations, they mainly differ in how these representations are built. Traditionally these two fields have been treated as separate academic discipline. Exploiting the commonalities between vision and graphics turns out to be a scientifically profitable endeavour. There are many examples of fruitfull combination of graphics and vision, but there is no systematic education of students (especially in Austria). Therefore, the goal of this doctoral program Confluence of Vision and Graphics is to educate highly talented PhD students in this interdisciplinary field and to teach them a common view of this challenging topic from the start. All proposed topics require a significant amount of vision and graphics. The students will be co-supervised jointly by one professor with vision and one professor with graphics expertise. The proposed educational program will ensure that the students will be trained to become future leading scientists, which will face the challenges of research excellence in the interdisciplinary area of graphics and vision, academic leadership, and social competence as a member of a particular research group as well as being a part of the global research network. |
2007 | 2019 |
|
VIPEM - Visual Analytics for Personalized Medicine
(details) |
VIPEM ist ein System zur hypothesengesteuerten Analyse multidimensionaler Datenräume im Gebiet der personalisierten Medizin. Ein multidimensionaler Datenraum, bestehend aus molekularen und klinischen Daten, wird unter gleichzeitiger Anwendung algorithmischer Verfahren und direkter Benutzerinteraktion gefiltert und hierarchisch strukturiert. Ein zentrales Forschungsproblem der personalisierten Medizin ist die Frage, wie die Verknüpfungen zwischen genetischen Variationen und Krankheiten, bzw. dem Ansprechen auf bestimmte Medikamente, gefunden werden können. Dazu gilt es, z.B. Gendaten mit klinischen Daten zu verknüpfen und in Folge spezifische Patientengruppen zu identifizieren. Die großen Datenmengen der molekularen Analyseverfahren (genetische Polymorphismen, Genexpressionsdaten, Proteomics) können nur mehr mit Methoden der Bioinformatik und Statistik bewältigt werden. Aber auch Standardmethoden der Statistik und der Bioinformatik versagen, wenn die Daten sehr inhomogen strukturiert sind dies ist bei den klinischen Daten der Fall und wenn Strukturen in den Daten durch Rauschen bzw. dominante Muster verdeckt werden. VIPEM soll mit Hilfe von Visualierungsmethoden die Struktur in den Datenräumen sichtbar machen und eine interaktive Navigation und Strukturierung sowohl der molekularen, als auch der klinischen Daten erlauben. VIPEM baut auf Grundlagenergebnissen in den Bereichen Informations-Visualisierung und multimodale Benutzerschittstellen auf. Durch eine enge Verknüpfung mehrerer gleichzeitig wirksamer Eingabekanäle und die sofortige Sichtbarkeit der Analyseschritte in der Visualisierung steht dem Experten ein Werkzeug zu interaktiven Erkundung von komplexen Datenräumen zur Verfügung. Als Eingabeparameter für Analysealgorithmen nutzt VIPEM hierbei die menschliche Fähigkeit, komplexe Muster und Zusammenhänge visuell bereits in Ansätzen zu erfassen, und erlaubt dadurch das Freilegen sonst verdeckter Strukturen. VIPEM fokussiert auf die hohe Nachfrage nach visualisierter Analytik im Bereich der Bioinformatik. Der innovative Zugang von VIPEM versteht sich als einmaliges Verkaufsargument, zumal sich mit VIPEM ein viel versprechendes Produkt abzeichnet, welches sicher innerhalb der nächsten zwei bis drei Jahre seinen Stellenwert als verwertbares Produkt am Markt behaupten könnte. Diese Forschungsarbeit wird als Teil des Projekts Caleydo durchgeführt. |
2007 | 2009 |
|
POMAR 3D - Position and Orientation Measurement in 3D for Augmented Reality
(details) |
Positionierungs- und Orientierungsmodul für einen Mobilen Augmented Reality- Client zur 3D-Echtzeitvisualisierung unterirdischer Ver- und Entsorgungsinfrastruktur |
2007 | 2008 |
|
Genoptikum - Interactive Biomedical Information Visualization
(details) |
Genoptikum is an interactive data exploration system for the visualization of and navigation in molecular and clinical data in the field of personalized medicine. Genoptikum addresses the essential but to date unsolved problem of how to identify connections between genetic variants and their corresponding diseases or the response to certain drugs and treatments, respectively. It is, therefore, necessary to connect gene data and clinical data in order to categorise specific subgroups of patients with certain disease features. The huge amount of data provided by molecular analytical methods (genetic polymorphisms, gene expression data, proteomics) can only be analysed by applying statistical methods and bioinformatics. However, even standard methods of statistics and bioinformatics fail when the data are inhomogeneous as is the case with clinical data and when data structures are obscured by noise and dominant patterns. Genoptikum should make the structure of the data spaces visible by using innovative methods of visualisation based on multiple high resolution displays in combination with data projection technologies. Genoptikum is bases on fundamental results in the fields of visualisation of information and multimodal user interfaces which enable an interactive navigation and structuring of both molecular and clinical data. Through a close link between several input channels, which are simultaneously active, and by immediate visualisation of the steps of the analysis, the expert is provides with a tool for the interactive exploration of complex data spaces. As input parameter for analysis algorithms Genoptikum makes use of the human visual capacity to grasp complex patterns to reveal hidden structures and correlations in large data spaces. This research is part of the project Caleydo. |
2007 | 2009 |
|
ICAO Face Normalization and Analysis
(details) |
The goal of this project is the research and development of state of the art computer vision and object recognition algorithms to analyze face portrait images according to the ICAO (International Civil Aviation Organization) standards and specifications. Therefore a close cooperation with Siemens IT Solutions and Services Biometric Center in Graz exists, where the Biometry group is developing a software solution for this purpose. Current passports issued in the European Union contain biometric data like e.g. digital photographs and fingerprints in order to uniquely identify its owner. To be able to read passports all over the world, the ICAO has specified a number of guidelines and requirements for the structure of these biometric features. In case of face portrait images, examples for these requirements are neutral appearance, eyes opened, mouth closed, frontal pose, straight-looking eyes, properly-sitting eye-glasses, or uncovered faces. Since these analysis steps have to be performed in an automatic fashion, each of these requirements imposes certain computer vision research challenges which are tackled in this research project. Examples for the topics involved in these analysis steps are model-based segmentation using active shape and active appearance models, fast and robust AdaBoost based machine learning algorithms for face and face component detection, or classification of facial expressions using multi-classifier fusion approaches. |
2007 | 2009 |
|
Deskotheque - Collaborative Interaction in Multi Display Environments
(details) |
Office space usually consists of private single-user workstations. Team work takes place on separate locations, usually supported by analogue media like printed paper. Digital data exchanges is accomplished through designated channels like e-mail or instant messengers. Deskotheque is an ongoing project aiming to extend personal workspaces to enhance team work. It represents a flexible, interactive environment for team work, conference and meeting rooms. Unused surfaces in the room, such as empty wall space and table surfaces, can be turned into interactive, digital displays to be used for multi-user co-located teamwork. |
2007 | 2011 |
|
EVis: Autonomous Traffic Monitoring by Embedded Vision
(details) |
The world will witness a tremendous increase in the number of vehicles in the near future. Future traffic monitoring systems will therefore play an important role to improve the throughput and safety of roads. Current monitoring systems capture (usually vision-based) traffic data from a large sensory network; however, they require continuous human supervision which is extremely expensive. In the proposed EVis research project we investigate the scientific and technological foundations for future autonomous traffic monitoring systems. Autonomy is achieved by a novel combination of three approaches: First, vision-based detection and classification methods are augmented by self-learning and scene adaptation mechanisms which will significantly reduce the effort of manual configuration. Second, visual data is fused with data from other sensors such as radar, infrared or inductive loop sensors. Sensor fusion helps to improve the robustness and confidence, to extend the spatial and temporal coverage as well as to reduce the ambiguity and uncertainty of the processed sensor data. Finally, the developed vision and fusion methods are implemented on a distributed embedded platform which makes them wider applicable and supports real-time operation. Our autonomous traffic monitoring system will be evaluated using real world traffic data. The evaluation will be conducted in three different case studies: offline testing using recorded data, online testing on a traffic test site, and on a test installation on a public road. |
2007 | 2010 |
|
APAFA: Automated Photogrammetric Aerial Feature Analysis
(details) |
The systematic creation of models of the real world to support the locational awareness on the Internet can be achieved if previously required massive manual labor gets replaced by automated procedures. A particular challenge exists in the automation of the extraction of the 4 classical map features buildings, circulation spaces (e.g. road networks), vegetation and water bodies, as well as their interaction. Decennia of research have been unable to automate the extraction of these features from classical aerial photography towards an economically viable result. However, we believe that we can succeed in the proposed project to develop automated procedures to create feature data for three reasons. First is the recent advent of digital aerial sensors producing highly redundant digital large format aerial photography. Redundancy will be obtained by using high forward and side overlaps, say at 80% and 60%, so that every point in the terrain is imaged at least 10 times, and any algorithm can rely on multiple analysis results that then can either reinforce or cancel one another. Second, the geometric redundancy gets augmented by a radiometric redundancy using 4 spectral bands, adding an infrared band to the classical red, green and blue color channels. Third, we will combine the classical "object reconstruction" approach available from stereo procedures, by new recognition methods. While classically a "car" on a street may have been seen via a "point cloud" and would have to get recognized simply by a representation of local height anomaly on an otherwise flat reference surface, recognition includes the use of stored images of cars in a data base to actually recognize a car as a human would do when inspecting an aerial image. The project is split up into five work packages which will focus on how reconstruction and recognition techniques can help each other and how additional information either from a previous mission or GIS can be integrated in the 3D modeling framework. One work package will address the assessment of the obtained quality, another will address project management and dissemination activities. Within the project we will develop an extensive library of combined recognition/reconstruction methods, and apply them to a range of test data sets. Test data will vary in geometric resolution (pixel size), overlaps, and types of terrain scenarios. |
2007 | 2010 |
|
VM-GPU: Variational Methods on the GPU for Industrial Problems
(details) |
The project VM-GPU fits exactly to the FIT-IT Visual Computing call. It is a combination of computer vision and graphics methods to offer solutions to a problem of great relevance for industry. In particular,
The goal of VM-GPU is to make variational methods available for industrial problems by using modern graphics hardware. If successful this project will have a large impact on the machine vision industry, it will allow for the first time to use variational methods in an industrial setting, in addition having graphics cards available as computing platforms will offer completely new ways of addressing industrial vision problems (e.g., it is very easy to scale up by just using a second graphics card). |
2007 | 2009 |
|
AUTOVISTA: Advanced Unsupervised Monitoring and Visualization of Complex Scenarios
(details) |
The trend in video surveillance is an ever increasing number of (digital) cameras for surveying complex scenarios (e.g. crowds). Currently available video surveillance systems cannot cope with this increased complexity, the detection rates are too low and the systems are not reliable enough. This hinders the broad use of automatic surveillance systems. AUTOVISTA proposes to use modern visual computing technologies to advance the state-of-the-art of video surveillance considerably. In order to cope with the increasing number of cameras, AUTOVISTA will (1) use novel on-line learning techniques to increase the detection rate and decrease the false alarm rate, while the camera adapts in an unsupervised manner to the surveyed scene. Besides an increased performance, this has the additional advantage that the installation and maintenance effort will be substantially decreased; (2) exploit novel visualization and interaction techniques to support the human operator. Furthermore two complementary visualization modes are proposed, blending smoothly between these allows the operator to maintain coherence. These techniques will enable a single operator to cope simultaneously with a large amount of cameras. AUTOVISTA will tackle the problem of increased people densities and highly cluttered scenes in a novel manner. Instead of relying on single person detection and tracking (which is not feasible for high people density scenarios), methods will be investigated to handle the crowd as a whole. AUTOVISTA will derive spatio-temporal crowd statistics, describe normal crowd behavior and use this for unusual event detection. |
2007 | 2009 |
|
3D Reconstruction of Electrical Impulse Discharges
(details) |
Electrical impulse discharges in nature are visible as lightning. Their impact point can be electro magnetically located up to a precision of several hundred meters. In some restricted areas such as industrial plants, airports etc. it desirable to know the the impact region and path of the lightning up to a precision of a few meters. If visibility is not too restricted by weather conditions, a multi-camera setup would be a viable option to locate path and impact area of the discharge. In this project impulse discharges of a few meters are synthetically generated under laboratory conditions and reconstructed using a multi camera setup. |
2007 | 2008 |
|
Presenccia - Presence: Research Encompassing Sensory Enhancement, Neurosciense, Cerebral-Computer Interfaces and Applications
(details) |
This Integrated Project will undertake a research programme that has as its major goal the understanding and exploitation of brain mechanisms for the enhancement of presence and interaction in mixed and virtual reality. The project is highly interdisciplinary, combining neuroscience, computer science, psychiatry, psychology, psychophysics, mechanical engineering, philosophy and drama. By presence we mean the propensity of humans to respond to fake stimuli as if they were real, something observed daily in every virtual reality laboratory. Understanding the neural basis of this ‘presence response’ its enhancement and its application is the fundamental object of study within the IP from many different points of view, and including visual, haptic and auditory modalities. The most interesting, challenging and useful mixed environments are social. The types of interaction we plan in mixed reality are those supporting interactions between real people and other remote real people, real people and virtual people, and between virtual people and virtual people. The aim is to make people’s responses real even if the perception of the reality is based on virtual stimuli. The project will carry out fundamental research adopting a neuroscience methodology and theoretical standpoint combined with research in the delivery of presence through multisensory modelling and rendering, and wide area tracking and display systems. A substantial part of the project is concerned with interaction through brain-computer interfaces. The whole will be brought together through a number of applications, in particular a persistent virtual community that represents the project itself, methods for projecting sensations of ownership to virtual representations of self, and the exploitation of neurofeedback for the enhancement of creativity through mixed reality environments. |
2006 | 2009 |
|
IPCity - Interaction and Presence in Urban Environments
(details) |
The research aim of the IPCity project is to investigate analytical and technological approaches to presence in real life settings. Analytically, this includes extending the approaches to presence accounting for the participative and social constitution of presence, the multiplicity and distribution of events in time and space. Technologically, this translates into developing portable environments for on-site configuration, mobile and light-weight mixed reality interfaces with the ambition to weave them into "the fabric of everyday life". Methodologically, this calls for moving "out of the lab" with field trials in real settings, applying a triangulation of disciplines and methods for evaluation. These range from interpretative-ethnographic to quasi-experimental approaches and include cognitive science, social-psychological, and cultural-anthropological disciplines. The vision of the IPCity project is to provide citizens, visitors, as well as professionals involved in city development or the organization of events with a set of technologies that enable them to collaboratively envision, debate emerging developments, experience past and future views or happenings of their local urban environment, discovering new aspects of their city. This includes:
The showcases include urban renewal projects, large scale events, and explorative edutainment and story telling applications. |
2006 | 2010 |
|
Vidente - An Augmented Reality System for Visualisation of Subsurface Utility Networks
(details) |
The objective of VIDENTE is the creation of a prototype for a mobile Augmented Reality (AR) solution for utility companies. This system will provide real time visualisation of up-to-date data of subsurface utility networks for maintenance purposes. AR enables will enable outdoor users on location to see information that they would otherwise have to retrieve from hardcopy or non-registered on-screen visualisations. The prototype mobile system will visualise 3D and semantic information derived from data stored in an underlying Geographic Information System (GIS). Partner Company: Grintec |
2006 | 2008 |
|
A Low-Cost System for Automatic People Tracking in a Labyrinth
(details) |
After medical treatment of visually handicapped people it is desirable to evaluate the benefit of the treatment for the patient. Especially the capability of the patient to orient himself in a three-dimensional environment, to navigate and recognize obstacles is of interest. For a clinical evaluation under controlled circumstances a labyrinth has been built through which the patient ha to navigate. Obstacles may be randomly placed in the labyrinth. A multi-camera system keeps track of the patients movements and extracts parameters such as position, speed, head rotation etc. |
2006 | 2007 |
|
Visualization of biomechanical properties of articular cartilage in the knee and ankle joint by means of multi-parametric MR imaging
(details) |
MRI is widely accepted as a non-invasive technique for visualizing the morphology of healthy and damaged of degenerate articular cartilage. In order to visualize early pathological changes of in vivo cartilage, use of parameter selective MR imaging is combined with relaxation time and diffusion constant mapping which makes it possible for the first time to perform a biochemical evaluation of cartilage in vivo. As well as assessing the biochemical properties, it is important to assess the biomechanical properties of cartilage, to make a full functional assessment of articular cartilage, this is particularly important in cartilage implants. The aim of this project is to further develop and validate individual MR parameters for evaluating biomechanical properties of cartilage, in particular cartilage stiffness. In order to fore fill the aims of this project we propose a 3 phase approach with in vitro, in situ and in vivo studies. MRI techniques including T1 and T2 relaxation time mapping, diffusion measurements, and sodium MRI will evaluate cartilage under controlled mechanical loading. The parameters measured in normal and degenerative cartilage using MRI will be correlated to the results of biochemical, histological and biomechanical tests. In vivo MRI studies of the biochemical and biomechanical properties of articular cartilage and cartilage implants require the application of controlled reproducible loads throughout the range of movement; therefore, as part of the project we will develop an MRI compatible kinematic device. For the planned MR visualization of the biomechanical properties of cartilage, optimal 3D segmentation and 3D reconstruction techniques of the cartilage layers must be developed. Image analysis will allow dynamic visualization of joint motion as well as determination of quantitative parameters including thickness, volume, surface area and joint contact area under physiological loading. This 3D visualization approach ensures that the evaluation of biochemical and biomechanical properties of articular cartilage can be performed under realistic mechanical loading of the joint. So far, such information has only been available through arthroscopic surgery. Thus, along with the basic science research on the biomechanics of articular cartilage, this non-invasive MR method also offers improved diagnosis, follow-up and rehabilitation of patients with cartilage disorders or implants. |
2006 | 2008 |
|
Robust Fingerprint Recognition and Classification
(details) |
The main aim of this project is to apply global models to fingerprint images for robust extraction of local features. These so-called minutiae features are used within classic pattern recognition algorithms for fingerprint matching (recognition, authentication). The direction field of a fingerprint is a crucial parameter for extracting minutiae features. Nevertheless many fingerprint images are of such poor quality, that the direction of the field can not be extracted for certain regions in the image. On the other hand it has been shown that if one can properly "guess" the direction, it is possible to apply enhancement algorithms which adaptively improve the clarity of ridges and furrows of such regions. In order to do this "guess work" computationally, a model for the directional field of a fingerprint must be applied during the extraction process. In another concept the extracted parameters of the directional field model can be employed for fingerprint classification. |
2006 | 2008 |
|
Computer Supported Automated Sports Game Analysis - Pilot Project Beachvolleyball
(details) |
The main aim of the project is to create a novel method for automated game analysis, which offers new insights in the structure of sport games. The first type of sport investigated will be beach volleyball which offers a not too complex structure(4 players, 1 ball and a rather small field) and is similar to several other types of sport.
All this functionality need to be offered in a software tool that can be used by sports scientists.
|
2006 | 2008 |
|
GenView - Visualization of Genetic Data
(details) |
This project is concerned with the visualization of Micro Array data using multiple displays and visual data mining techniques. It was the first research activity that led to the project Caleydo. |
2005 | 2006 |
|
CashFlow - A visualization framework for 3D Flow
(details) |
CashFlow is an open source flow visualization system extending Open Inventor and Coin3D toolkit by System In Motion with new features for visualization and rendering of arbitrary grids and vector data sets. |
2005 | 2006 |
|
Mammography
(details) |
Breast cancer is the most common cancer among women. In the European Community breast cancer is diagnosed in 157,000 cases and kills 70 000 annually. Early detection is essential to increase the surveillance rate. Therefore a lot of work has been done in recent years to enhance the prediction quality of the three most promising image modalities for breast cancer detection: X-ray imaging is used for screening of woman older than 50 years. Younger woman with dense breasts can be examined with sonography. Functional magnet resonance imaging gives insight in the interaction of different tissue types with the contrast agent. Radiologists face the difficulty of combining the information obtained by non invasive methods to reduce the need for biopsy and surgery. In this project methods to fuse X-ray and fMR images will be researched. Partner: Image Diagnost |
2005 | 2007 |
|
Mobile Augmented Reality Quest
(details) |
Using a PDA based digital tour guide, visitors to Technisches Museum Wien and Landesmuseum Kaernten, Klagenfurt, are engaged in an entertaining and educational quest that uses Augmented Reality presentation as a medium. |
2005 | 2007 |
|
ARIS*ER - Augmented Reality in Surgery
(details) |
ARIS*ER is a Marie Curie Research Training Network addressing a cross-disciplinary area, critically in need of developing researchers with the understanding of the cross-cutting issues involved in developing tools for Minimally Invasive Therapy. The multidisciplinary consortium, aims at providing training for young researchers through a structured training and knowledge transfer program – that will provide Europe with human resources with knowledge that will lead to better healthcare to citizens in Europe. The particular focus of the research training and joint project will be software that is user-friendly, fast and reliable for the practitioners of minimal invasive therapy (MIT). MIT is a compound of minimally invasive surgery, image guided surgery and interventional radiology. MIT makes use of numerous sources of information including multi-modal images and patient information systems. Intelligent processing of this information comprises a vast pool of knowledge that can aid the operator in his decisions. The ultimate goal of the joint project, which will give the recruited researcher training-by-doing, is to create an Augmented Reality system for interactive image guided therapy providing the clinical user with a new generation of decision support tools. This system will integrate intra-operative and pre-operative image-information and enable the user to see beyond the organ surface to inner structures and pathology. An intuitive human computer interface consisting of 3D display systems, haptics and robotics will hide the underpinning complexity of the decision support tools. Demonstrators will be made aiming at providing a seamless workflow for the clinical user conducting image-guided therapy. |
2005 | 2009 |
|
Urban Crossroads Traffic Surveillance
(details) |
Automatic detection and tracking of vehicles entering a crossroads during red-light phase. Traffic-light regulated crossroads in urban regions are monitored. Vehicles entering the crossroads while the traffic light is red are detected and a video sequence is stored for further prosecution of the driver. |
2005 | 2006 |
|
Measuring Material Displacement on Steel Specimens During Strain Tests
(details) |
Deformation analysis of material surfaces is a crucial part of material testing and quality control. In this project a stereoscopic surface measurement system has been developed which allows to measure surface deformation during stress/strain tests over a wide range of fields of view. Stereo image pairs are acquired at specified time instances which makes it possible to synchronize the acquisition with the amount of stress applied. |
2005 | 2006 |
|
Vision Based Kinematic Calibration and Error Compensation of Articulated Robot Arms
(details) |
Development of a vision system for accurate calibration of the kinematic chain of an articulated robot arm. The absolute positioning error of articulated robot arms is typically by an order of ten higher than their repeatability error. Inaccurate blueprint kinematic models typically account for 90% of this discrepancy. In this work a calibration procedure is developed which calibrates the kinematic model of a robot arm using fixed stereo rig and a calibration target mounted on the robot hand. In a single calibration framework the following parameters are automatically determined:
The procedure is fully automatic and does not require expensive, precalibrated equipment. |
2005 | 2006 |
|
Vision Based Detection of Wrong-Way Drivers
(details) |
Vehicles driving the wrong way down a motorway represent a serious source of danger. In the year 2005 in Austria, 521 vehicles were counted driving in the wrong direction on motorways. 8 people died due to wrong-way driver accidents. Immediate detection of a vehicle driving in the wrong direction could help preventing serious accidents by warning the oncoming vehicles (via traffic telematic systems or radio announcements) and by alarming the police. To guarantee immediate detection, one would have to observe every access ramp and every place where a car could turn. Thus a low-cost monitoring system capable of detecting wrong-way drivers and forwarding an alarm to a central traffic control station is needed. Image based detection suffers from one main problem: In case a large vehicle (truck, bus etc.) throws a shadow to the opposite lane of a motorway, that shadow moves against the direction of traffic and may cause a false detection. With a monocular camera system (one single camera), this problem can only be eliminated by large computational costs. |
2005 | 2007 |
|
TRICam - Robust Object Detection and Tracking on an Embedded System
(details) |
Development of Computer Vision Algorithms for Robust Object Detection and Tracking on an Embedded DSP Platform. FREQUENTIS was working on an embedded platform called "TRICam". The platform is capable of acting as a high quality Motion-JPEG or MPEG4 Encoder or Decoder; furthermore, it can perform a set of selected image processing algorithms for object detection and tracking in real-time. The focus was mainly put on traffic monitoring, vehicle detection and tracking. The goal was to also deal with various algorithms for video surveillance in public places and public transport. |
2005 | 2007 |
|
VITUS2 - Video based Image analysis for Tunnel Safety
(details) |
The main aim of VITUS project is to build and implement a prototype for an automatic video image analysis system in order to increase safety in tunnel roads. A feasibility study about video image analysis in tunnels was carried out, and the implementation of the prototype and evaluation of the system is work on going. Experiments on real sequences using innovative image processing algorithms display promising results. |
2005 | 2006 |
|
MISTRAL - Measurable intelligent and secure semantic extraction and retrieval of multimedia data
(details) |
Multimedia data has a rich and complex structure in terms of inter- and intra-document references and can be an extremely valuable source of information. However, this potential is severely limited until and unless effective methods for semantic extraction and semantic-based cross-media exploration and retrieval can be devised. Today’s leading-edge techniques in this area are working well for low-level feature extraction (e.g. colour histograms), are focussing on narrow aspects of isolated collections of multimedia data, and are dealing only with single media types. MISTRAL follows the following lines of radically new research: MISTRAL will extract a large variety of semantically relevant metadata from one media type and integrate it closely with semantic concepts derived from other media types. Eventually, the results from this cross-media semantic integration will also be fed back to the semantic extraction processes of the different media types so as to enhance the quality of the results of these processes. MISTRAL will focus on most innovative, semantic-based cross-media exploration and retrieval techniques employing concepts at different semantic levels. MISTRAL addresses the specifics of multimedia data in the global, networked context employing semantic web technologies. The MISTRAL results for semantic-based multimedia retrieval will contribute to a significant improvement of today’s human-computer interaction in multimedia retrieval and exploration applications. New types of functionalities include but are not limited to:
|
2005 | 2007 |
|
Indoor Modelling and Tracking
(details) |
Precise indoor localisation is an important issue for Augmented Reality. The two major ingredients necessary to experiment with "location aware" Augmented Reality applications are
The localized device is often held by an individual, allowing applications to interact with the physical environment. Various Augmented Reality setups have been build during this project. Additionally work is shown to answer the question how to accurately and rapidliy survey an indoor environment. Next to manual strategies there automated aproaches of surveying are shown. Furthermore hybrid tracking approaches are demonstrated where heterogeneous tracking sensors are attached to handheld devices (UMPCs, micro PCs). In this context the sensor data is fused, which leads to a seamless tracking experience. |
2004 | 2008 |
|
Nonlinear Registration for Intra-Modality CT Applications
(details) |
IntroductionResearch interests in single-modality nonlinear registration include four different kinds of subproblems. Deformable registration of two or more CT lung data sets at different states in the breathing cycle going from Functional Residual Capacity (FRC, expiration) to Total Lung Capacity (TLC, inspiration) for modelling breathing motion and deriving lung ventilation. Deformable registration of a contrast-enhanced and a native CT lung data set for deriving lung perfusion. Deformable registration of contrast-enhanced and native CT liver data sets at one or several phases in the contrast-uptake cycle for liver perfusion. And finally, highly accurate partially rigid bone registration for head and neck CT-Angiography applications to extract bone structures from CTA images.
Deformable Lung RegistrationThe input for this task consists of native CT thorax scans at two or more different breathing states between Total Lung Capacity (TLC, inspiration) and Functional Residual Capacity (FRC, expiration). Deformable registration of distinct breathing states is a prerequisite for deriving ventilation information by simple subtraction of expiration from inspiration data or by fusion with special functional scans and it leads to models of breathing motion in the lung. For this purpose we have available high resolution sheep lung data at up to five distinct static breathing states and human lung data at inspiration/expiration. Another application of deformable lung registration is the fusion of native and contrast-enhanced CT lung data to show perfusion information again either by subtraction or by fusion with a special scan. A notion of vessel consistency should be included in the deformable registration, since it is important that the same amount of vessels is regarded before and after registration. Deformable Liver RegistrationSimilar to the lung registration, liver registration for perfusion measurements is a topic of interest. Contrast-enhancing techniques are used to get up to 8 liver images at different phases of the contrast uptake cycle. Each of these images has to be registered to a native scan to correct motions due to breathing. Afterwards subtraction techniques are used to derive the amount of perfusion in the liver. The setup of the registration algorithm is very similar to the lung registration problem. Partially Rigid Bone RegistrationThe intended application of rigid bone registration is a very accurate registration of bones from native and contrast-enhanced CT images of the head and the neck. In contrast-enhanced images vessels and bones have very similar intensities, such that simple segmentation algorithms like thresholding do not work which are frequently used for CTA image studies. The intended strategy for the removal of bone structures is to take a simple (threshold-based) bone segmentation taken from the native image and register it to the contrast-enhanced image. Registration is necessarysince small patient movements may occur (especially in the neck and shoulder area) between the acquisition of both kinds of images. Registration has to be very accurate in this area, since there are vessel structures that lie close to or inside the bone structures as well. It can be assumed that the bones themselves are rigid but the relative position of bones to each other may change. Pairs of bones should be registered rigidly but the relative bone movements are taken into account leading to a partially rigid registration scheme. |
2004 | 2006 |
|
Robotics and Computer Vision Laboratory
(details) |
The Robot Vision Laboratory was established in 2004 to provide a common platform for experiments, demos and prototyping in the field of computer vision. Among other things, the lab inventory contains a 6DOF articulated robot arm, a PeopleBot mobile robot platform, several imaging and illumination devices. |
2004 | 2010 |
|
Dust Filter Surface Analysis
(details) |
When a dust laden gas is sucked through a filter, the dust remains on its surface and forms a compact dust layer called filter cake. Periodically the filter cake is at least partially removed by inverse high pressure air pulses to allow continuous operation of the filter. Knowledge of the distribution of the filter cake on the filter surface at different stages of operation is decisive for filter operation. It is shown that the principle of calibrated shape from stereo with pattern projection for generating texture on the surface gives a robust 3D reconstruction of the filter surface when accessing through a glass window. Rigid registration of surface patches using landmark points, combined with an Iterative Closest Point Algorithm as a refinement procedure gives a continuous 3D model of the entire visible filter surface. Cake thickness is calculated by taking the height difference of two surface models, acquired before and after dust deposition. The challenging problem is to account for the non-rigid deformation of the filter cloth. A global deformation model is estimated using Thin Plate Spline Interpolation based on landmark points on the filter surface. |
2004 | 2005 |
|
Active Appearance Models in Quantitative Musculo Skeletal Radiology
(details) |
Rheumatoid Arthritis (RA) is an incurable disease leading to severe disabling mutilations of synovial joints. RA affects predominantly the peripheral joints of the appendicular skeleton. RA is with 17% one of the leading causes of disability among persons aged 15 years or older. The prevalence is 1-2%. A recent study estimated the total cost to the North American economy caused by arthritis and its related effects to be 64 billion. The accurate quantification of the progression of the disease is a decisive factor during its treatment. Until now mainly manual quantification procedures are utilized. They are time consuming and lack reproducibility as well as accuracy. Among others these restrictions have severe adverse effects to clinical trials and to continuous therapy of patients. We propose a computer based method that performs the quantification by means of automated image analysis and pattern recognition. The goal is to fully automatically identify the bones of the hand/wrist and extract exact quantitative information about the extent of the erosions caused by rheumatoid arthritis based on a radiograph. The following lines will be investigated during the project:
|
2004 | 2007 |
|
Image Description Toolbox (IDTB)
(details) |
(Matlab) - collection of some most promising methods and algorithms related to appearance based object recognition. |
2004 | 2006 |
|
MUSCLE Network of Excellence
(details) |
MUSCLE aims at creating and supporting a pan-European Network of Excellence to foster close collaboration between research groups in multimedia datamining on the one hand and machine learning on the other in order to make breakthrough progress towards the following objectives:
|
2004 | 2008 |
|
3D Paper Structure Analysis
(details) |
Most technological paper qualities are directly or indirectly influenced by the three-dimensional paper structure which means the spatial arrangement of fibres, fillers, pores and if necessary of the coating. In the area of the structure analysis, two main groups can be distinguished: destructive and non-destructive approaches. The paper structure analysis methods have to fulfil the following basic requirements. The quality of the 3D data must be high enough to make an exact delimitation between the relevant paper contents possible. Furthermore the spatial resolution must at least lie in the area of a micron. In order to be able to make statistically reasonable statements for the complete paper sample, sizes of the range of at least one square centimetre must be analyzed. To conclude, the time expenditure to the digitalization of the paper sample should also lie within a practicable area of hours. Primary objective of the project is the detailed analysis of the fibre network structure of a paper sample by means of image analysis methods. Still this application is an extremely complex problem and in this resolution (microns) and sample size (square centimetre) unsolved. Therefore the development of a completely new concept is necessary. This concept should finally make a general assignment of the three-dimensional paper structure to pre-defined paper-ingredients classes possible. As a final result the segmentation of single fibres and their three-dimensional tracking is realizable. |
2003 | 2008 |
|
TheoMedia - Theology in Media Society
(details) |
In this project textual, formal and structural interactions between fundamental theology and media society will be researched. The most important facts which influence the media society of the western world are internet, multi media lifestyle and religious symbolism. This project which is carried out together with the Institute for Fundamental Theology, Graz University and Joanneum Research Graz. The Institute for Computer Graphic and Vision mainly deals with the semi automatically retrieving of religious symbols. Religious symbols were in former times mainly used only in liturgical events. Today they are media effective prepared and presented. Examples are the presentation in TV of the war "good against bad" after 11th September or the Star Wars Trilogy. In the project a semi automatic digital film footage structuring and analysis according to religious symbols should be implemented. Joanneum Research will provide its Content Analysis Module, the search infrastructure and already existing annotation tools. |
2003 | 2006 |
|
Computer Vision Methods for the Automatic Analysis of Fibrous Structures in Biological Soft
(details) |
Soft tissue like tendons, arteries, veins or skins are important biological materials. A greater understanding of the foundations and interactions of structure and function of soft tissue, and, in particular, the associated mechanobiology is of great interest in the field. A thorough understanding of the complex interrelations between mechanical factors and the associated biological responses may help to improve diagnostics which allow disease and injury to be treated earlier. The research proposed here will develop a fully automatic system for analyzing macroscopic structures obtained from histological images of arteries by means of modern computer vision techniques. Besides being interesting from the mechanobiological point of view the structural analysis of images of collagen fibers poses also several challenging questions from a computer vision point of view. In particular, due to the wide variety of different appearances of collagen fibers in images this task is non trivial. The main task of this research is the development of novel segmentation techniques for robustly segmenting individual fibril bundles and estimating their parameters, like location and shape, fibril density, mean fibril orientation, wriggling of fibrils etc. This will be achieved by developing novel perceptual grouping methods operating on the extracted orientation data of fibrils. Another major challenge of this research is to extend the structural analysis from 2D to 3D. |
2003 | 2005 |
|
Plug and Detect (PnD)
(details) |
Plug and Detect (PnD) is a technology to automatically calibrate and register a network of surveillance cameras in an on-line manner on a common ground plane. It could become a key technology in visual surveillance systems, because it tries to ...
|
2003 | 2007 |
|
Screening and Selection Systems for Directed Evolution of Enzymes
(details) |
Current paper and corresponding presentation The concept of "directed evolution" is dependent on the availability of systems that allow the identification of interesting enzyme variants within a large, artificially generated diversity. Thus, one prerequisite is the availability of systems that allow the detection of specific enzyme features such as activity, selectivity, stability etc. at smallest culture scales and at high-throughput conditions. Therefore, focus is put on the development of methods witch are rapid, sensitive and cost-effective and preferably work at the microwell-plate, single colony or even single cell level. Surrogate substrate analogues which would allow easy detection of reaction products by e.g. fluorescence techniques are not feasible due to the "First Law of Directed Evolution" which says "you get what you screen for". The "real" substrates should be converted or at least derivatives that are very close to these substrates have to be used in such systems. Therefore, general analytical methods which allow following the enzyme-catalyzed reaction of any desired substrate are developed. Another important prerequisite is the availability of methods that allow a high degree of automation. Such methods include high throughput detection systems based on image analysis and software for accurately recognizing hits. In addition, statistical methods and systems for data management are developed to properly set up and evaluate the results of screening programs and to handle large data volumes. |
2003 | 2006 |
|
FSP/JRP Cognitive Vision
(details) |
We envision a scenario in which every person will interact in a natural way with artificial devices as an aid in daily life situations such as orientation, search and information retrieval. We refer to this long-term vision as the Personal Assistance (PA) scenario, where a combination of mobile devices and distributed ambient spaces unobtrusively support users by being aware of the present situation and by responding to user requests. Subprojects at ICG: |
2003 | 2009 |
|
CONEX
(details) |
Robust and Adaptive Approaches to Scene and Object Recognition: The goal of this joint project is to investigate new robust and adaptive approaches in the area of object and scene recognition. Object and scene recognition is a necessary requirement for developing truly cognitive systems as well as for the development of advanced and novel multimodal interfaces leading to ambient intelligence. Having a robust object and scene recognition system the following applications will greatly benefit: novel user interfaces which understand human activities, intelligent surveillance, indexing multi-media databases and content analysis of images, autonomous mobile systems and robotics, industrial inspection and robotics, etc. The goal is to develop computer vision based systems that can recognize objects, and in the context of environment perform localization and navigation. The major challenge is to develop systems and methods that can work under realistic unconstrained conditions (i.e., outside the lab). The three partners proposing this project (Center for Machine Perception, Czech Technical University Prague, CMP, Computer Vision Lab, Faculty of Computer and Information Science, University of Ljubljana, CVL, and Institute for Computer Graphics and Vision, Graz University of Technology ICG) have considerable expertise in this area and developed complementary methods and techniques. The goal of the project is to join the efforts and combine the expertise. In particular, we do the following activities:
|
2003 | 2005 |
|
Handheld Augmented Reality on Mobile Phones
(details) |
Mobile systems are one of the latest hot topics in augmented reality research. There has been a lot of success in creating mobile setups using mobile personal computer hardware such as notebooks. The advantage of those approaches is that mostly the same hard- and software can be used as for traditional setups. The disadvantage is that those setups are not mobile in a sense that a user would like to use them. In the Handheld AR project we focus on realising augmented reality on off-the-shelf mobile devices. |
2002 | 2008 |
|
Entrance Surveillance
(details) |
The aim of this project is the development of an imaging system for an industrial partner. The developed system should survey entrances using video images and register the people passing the entrance. The system has to operate under outdoor conditions (sun light, fog, etc.). |
2002 | 2002 |
|
Autonomous Agents in Augmented Reality Environments
(details) |
Body and facial gestures as well as speech are familiar and widely accepted means of human communication. Animated characters, often with autonomous and affective behavior, have proved to be very useful in man-machine communication since they are able to exploit and deliver information through multimodal channels and thus engage the user in a natural conversation. Autonomous agents have been actively researched in recent years as an interface to computerized systems bridging the communication gap between man and computer, and the real and virtual world. Augmented Reality (AR) applications share the same goal through enhancement of the real environment with useful virtual information, where virtual objects appear to coexist with the real world. |
2002 | 2006 |
|
START Augmented Reality for Pervasive Computing
(details) |
The ultimate limit of today‘s user interfaces lies in the used two-dimensional abstractions that are only effective for their original domain of document-centric work. In contrast, in our everyday world we are not limited to our desktop surface. Information in the real world is perceived and processed in three dimensions, continuously and in real time. A human-computer interface that can capture these properties will be able to render a new level of services to the user, enabling the use of computers for new application domains and for new user populations. This „anywhere“ and „anytime“ requirement for pervasive computing cannot be fulfilled with miniature versions of desktop environments. A new style of user interface, a paradigm shift is needed. Therefore, the core of the proposed research is the following thesis: Augmented reality (AR), i. e., enhancing a user‘s perception of the real world with computer generated graphics and annotations, can make working with computers in 3D as productive as the desktop metaphor in 2D. This thesis is motivated by the fact that AR allows to integrate the whole world into the interface – the world essentially becomes the interface. Therefore users are able to leave their physical desktops and computer desktops to interact with their environment and with other users. The AR platform Studierstube lead by the proposer is world-wide unique in its combination of augmented reality, 3D display and groupware elements. Studierstube, the study room where Faust was searching for enlightenment, describes the philosphy of using the place as a mediator to information and insight. We are currently developing a wearable augmented reality system, which allows the user interface to be in any place, with and for anybody. Within the proposed augmented reality/pervasive computing infrastructure, an environment can be turned into a virtual „ether“ encompassing users that are enabled to interact with the computer through realworld objects. The proposed project work will expand the augmented reality platform Studierstube into a pervasive computing environment built on a variety emerging technologies. A number of promising application areas is selected, for which applications will be developed that try out the new style of interfacing with the computer in practice. Influence |
2002 | 2009 |
|
Ubiquitous Tracking
(details) |
Mixed Reality (MR) provides a natural interface to the calm pervasive technology anticipated in large-scale Ubiquitous Computing environments. However, the range of classic AR applications has been limited by the scope, range and cost of sensors used for tracking. Hybrid tracking approaches can go some way to extending this range. An approach, called Ubiquitous Tracking or Ubitrack, is proposed in which data from widespread and diverse heterogeneous tracking sensors is automatically and dynamically fused, and then transparently provided to applications. Follow this link for a fuller description. Developed as part of the PRESENCCIA Integrated Project. |
2002 | 2006 |
|
Uncalibrated Euclidean Scene Reconstruction in Scanning Electron Microscopy Using the Trifocal Tensor
(details) |
The scanning electron microscope (SEM) is an important tool to examine very small structures. Its large magnification combined with good contrast and large depth of view make it possible to view and characterize microscopic structures in the sub-micron scale. In the recent years, the problem of dense surface reconstruction from multiple SEM images was a research topic on this institute. Reconstruction approaches like shape from stereo and shape from photometric stereo have been evaluated. This work presents a framework for automatic scene reconstruction from three images acquired by a scanning electron microscope. The basic assumption is that the specimen is tilted eucentrically in front of the camera, camera geometry is assumed to be unknown but constant over all views. It is shown that methods for estimating the trifocal tensor as well as modern auto-calibration approaches can be adapted to the imaging conditions in the SEM, and Euclidean scene structure can be retrieved from three uncalibrated views. The performance of the proposed framework is evaluated on synthetic data as well as real images. It is shown that Euclidean scene structure can be retrieved robustly under varying image noise and inaccurate initialization. |
2002 | 2003 |
|
Large Format Digital Camera
(details) |
Vexcel Imaging Austria has invented a novel concept for large format digital cameras and has submitted the innovation for patent protection. An instantaneous exposure with for example 10K x 10K color pixels can be obtained with the new technology at data rates of 2 images per second (producing a data rate in excess of 2 gigabits per second). A number of scientifically interesting topics exists, for example in cases where such a camera is put into a moving platform such as a satellite, an airplane or a car. Vexcel Imaging is sponsoring a research project at the Institute to study some of these questions, addressing high precision image matching and mosaicking, the creation of color images from high-resolution black&white images in combination with lower resolution rgb-images, removal of motion blur, strategies for on-line data compression to reduce data rates, imaging strategies, the study of sample applications for such on-line large image data sets etc. Partner: |
2001 | 2003 |
|
Virtual Liver Surgery Planning
(details) |
Resection is the treatment of choice for patients suffering from liver tumors. Knowledge about involved liver segments, tumor size and topographic relationship of the tumor to vessels is needed for the decision if sufficient liver function capacity is guaranteed after a resection and for detailed planning of a possible resection. The main information sources are cross sectional imaging modalities like CT which deliver 2D images. The radiologist has thus to put all the information of the cross sectional images together in order to provide the surgeons with the needed information about the 3D topology. This process is difficult, tedious and time consuming. Combining the methods of medical computer vision and graphics a liver surgery planning system can be developed that enables a better overview and thus helps unfolding the full potential of surgical methods. Available approaches show that a number of improvements, specially on the fields of automation of segmentation and user friendly visualization are necessary to attain clinical applicability and gain the full acceptance by radiologists and surgeons. The goal of this research project is to develop an experimental environment for the staging of liver operations. Special efforts will be put on two issues. First, a fully automated segmentation of the liver, its vessels and tumors will be studied. Second, the environment for the interactive, cooperative visualization of the medical sensor data, the extracted anatomical structures, and for the use of tools to assess the best surgical approach will be developed and assessed. After development, the approaches for segmentation/partitioning, visualization and interactive resection will undergo a careful validation procedure. |
2001 | 2003 |
|
Visual Pose Determination by a Robot in six degrees of freedom
(details) |
We propose to develop a robotic system for the exact pose estimation of rigid objects in six degrees of freedom (DoF). Our approach is based on the following scenario: Given be a set of unordered objects in arbitrary position which come from a small variety (<10) of different types as may happen on a conveyor belt during an assembly process. We assume the objects to be mostly un-occluded and their CAD-model given. Our goal is to pick with a robot a specific object in such a way that it has a defined position. The robot is equipped with multiple black and white cameras (e.g., 2-3). In order to achieve maximum generality we do not intend to use range information nor calculate a 3D-representation from stereo images. This task requires a solution for the following sub-problems: Recognition and initial pose estimation, which will be approached using robust appearance-based recognition methods; Pose refinement will be handled by model-based methods; Movement planning to enhance the accuracy of the pose estimation. Partnerlist: |
2001 | 2002 |
|
Sorting and packaging of apples by means of digital image processing
(details) |
In the process of packaging high quality apples the proper placement of the fruit in the tray is critical (i.e. the most appealing side should face towards the customer, all stalks should face into the same direction). Currently this sorting is done manually, at an additional cost of about 36 Euro (500 Schillings) per ton of apples. The automation of this process should result in a significant cost reduction. The first step of the packaging process rotates the apple so that its symmetry axis is vertical with either stalk or bloom facing upwards. This is done mechanically. The next step determines the position of the most appealing side of the apple (i.e. the horizontal angle) and decides whether stalk or bloom is facing upwards (the vertical angle). This is done by digital image processing. The apple is rotated by 360 degrees. A digital camera captures a sequence of images at different orientations and the image processing software measures fruit quality and vertical orientation in real-time. In the last step, a robot brings the apple to its ideal position and puts it into the tray. Partnerlist: Schuster Messtechnik |
2001 | 2002 |
|
Mobile Robotics
(details) |
Research topics on mobile robotics: -Visual localisation and map building -Usage of local visual landmarks -Localisation using omnidirectional camera |
2001 | 2005 |
|
Route Tracer
(details) |
During an exhibition time of 5 months the whole internet as seen from one internet node should be visualized. The visitors can interact with the scene in the following ways: free 3D navigation, guided navigation (following the node links) and virtually send email (the user will fly behind an email package following a route from sender node to the recipient node). Since the number of expected nodes exceeds the limit of rendering performance multiresolution analysis must be employed. A spatially immersive display is used to let the visitors experience the wideness of the internet. |
2000 | 2000 |
|
EXPO 2000
(details) |
During the EXPO 2000 international exhibition in Hannover environmental topics related to the city of Graz are demonstrated. The visitors can use one of three gaming sites to explore the Graz efforts for saving energy and preventing the pollution of the environment.
|
2000 | 2000 |
|
3-Dimensional Measurement and Virtual Reconstruction of Ancient Lost Worlds of Europe (MURALE)
(details) |
Tools will be developed for the reconstruction of a currently excavated archaeological site for different periods throughout the time of its occupation. This includes the development of 3D acquisition systems that can measure a range of objects of different dimensions to produce accurate and realistic looking 3D models and of tools for virtual reconstruction, documentation and classification of objects ranging from pottery sherds to entire urban building complexes. The project will provide archaeologists with the tools to support the analysis and restoration of their finds. The virtual artefacts and buildings together with the related information will be stored in a multimedia database with appropriate searching and querying functionality. 3D visualisation technologies will be developed for the interactive presentation of archaeological results in museums and via the internet, to the broad public as well as to experts. To provide Europe with an archaeological lead. Partner: Departement Elektrotechniek ESAT-PSI, Katholieke Universiteit Leuven |
2000 | 2003 |
|
Virtual Liver Surgery Planning
(details) |
The liver surgery planning system (LSPS) developed at our institute is addressing obstacles associated with surgical planning of planning liver tumor resections based on CT image data. The main goal is to support radiologist's during data preparation and to give surgeons precise information for making optimal decisions. This was achieved by combining the fields of medical image analysis and computer graphics, which allows for innovative problem solutions, especially when user interaction with complex 3D objects is needed. |
2000 | 2003 |
|
Radargrammetry
(details) |
Building reconstruction, IFSAR, man-made objects, multiple view fusion","SAR imagery has begun to get consideration as a source for three-dimensional building models. High resolutions at pixel sizes of 30cm to 10cm and single pass interferometry support the geometric reconstruction of various small man-made objects. Our work aims at the detection and reconstruction of the structure of building ensembles from high resolution inerferometric and slant range SAR data. To fully automate the building reconstruction procedure we use a phenomenological approach, using layover and shadow boundaries together with edge information and intelligent combinations of measurements from all IFSAR data sources available. |
1999 | 2002 |
|
CARO - CEO (Center for Earth Observation) Austrian Relay Office
(details) |
This project is to exploit impulses originating from the European Comission's project "Center for Earth Observation" (CEO) for applications in Austria. One component of activities requiredto achieve that is -Installation of a "CEO& Middleware Node"& (MWND) at Austrian Research Centers Seibersdorf ARC. A MWND is a mirror-site, fully synchronised with the CEO's operational system "Information about Earth Observation" (INFEO). Thus, users in Austria (and in neighbouring countries) are offered the full amount of functions and data of INFEO under optimal network performance. According to the project name "CARO - CEO& Austrian Relay Office", a number of additional activities is planned, mainly to be carried out at the project leading Institute for Computer Graphics and Vision ICG at TU& Graz: -Infromation and education of users on the possibility employing remote sensing data and networking technology in their work environment. -Developing data catalouges and software-tools by exploiting latest developments of software-technology (WWW/HTML, Java, etc.). -Populating the CEO's system INFEO with information related to Austria (or institutes, projects, datasets etc.). -Participating in international consortia to generate additional services and databanks relevant for CEO (e.g. literature inventories). -Consulting research institutes, companies and governmental bodies on planning multinational cooperative projects, also including organisations in the neighbouring EU application states. Partnerlist: Austrian Research Centers Seibersdorf ARC |
1999 | 2002 |
|
Application of Image Processing in Materials Science
(details) |
The purpose of the project is to develop a powerful tool for the investigation of the deformation and fracture behaviour of materials. It is an interdisciplinary project combining the fields of materials science and computer science. The aim of the project related to computer science, more precisely to computer vision is: To develop a system for the automatic reconstruction of surfaces from scanning electron stereophotograms, integrating various reconstruction techniques (shape from stereo, photometric stereo etc.). The system shall be especially appropriate for analyzing fracture surfaces and for recording the local deformation fields during in-situ loading experiments in the scanning electron microscope.
|
1998 | 2000 |
|
Golden Eye: Real Time Black Jack Analysis
(details) |
1998 | 2000 | |
|
Improvement of acquisition methods for geotechnical data and on site application of numerical simulation methods.
(details) |
Um die Durchführung von numerischen Berechnungen basierend auf objektivierten, vollständigen und problemorientierten Daten zu ermöglichen, ist die Entwicklung von verbesserten Dokumentationsmethoden erforderlich. Diese sollen durch dreidimensionale Modellierung und Darstellung des Gebirges erreicht werden. Mittels stereographischer Aufnahmemethoden sowie dem Einsatz von entsprechenden Datenbanksystemen sollen die für die Erstellung von numerischen Modellen zur Beschreibung der angetroffenen geologisch-geotechnischen Verhältnisse erforderlichen Daten erfaßt, verwaltet und für weitere Bearbeitungen zur Verfügung gestellt werden. Partnerlist: Institut für Felsmechanik und Tunnelbau (220) |
1997 | 1998 |
|
Numerical Simulation in Tunnelling
(details) |
The Austrian Science Fund established in 1997 the FSP "Numerical Simulation in Tunnelling". The FSP consists of 7 Projects and involves 8 Austrian university institutes at Graz, Innsbruck and Vienna. Univ.-Prof. Dipl.-Ing. Dr. techn. Gernot Beer (Head of the Institute for Structural Analysis at TU Graz ) has taken over the coordination of the FSP. |
1997 | 1998 |
|
Remote Sensing Information System DIVA
(details) |
The Digital Interactive Venus Atlas (DIVA) is a remote sensing information system for a catalog of SAR radar image data from the Magellan mission to the planet Venus (1990--1994). The system consists of a Java based user interface accessible via the WWW, an image pyramid covering almost the entire Venusian surface, a remote interface to a data archive and a metadata catalog to retrieve quicklooks and information about available data sets.
|
1997 | 1998 |
|
Visualization Methods for Volumetric Medical Datasets
(details) |
Modern radiological measurement techniques like CT, MR, US or PET offers the possibility to obtain 3D-information of the interiour of the human body without invasive operation. The drawback of these measurement techniques is the difficulty for the user to visualize and navigate through the gathered volumetric datasets. We work on three main aspects to get an comprehensive and handy toolkit for physicians to make their work as easy as possible. These aspects are an efficient tool for selecting and handle 3D datasets (like DICOM image series). Next is an automatic navigation routine to let someone fly through organs. And the third is a comparison of different volume visualization techniques on their achieved quality and performance. Partnerlist: Department for Radiology |
1997 | 2000 |
|
Studierstube - Augmented Reality Framework
(details) |
In our everyday world we are not limited to our computer desktop. Information in the real world is perceived and processed in three dimensions, continuously and in real time. A human-computer interface that can capture these properties will be able to deliver new computer applications and services anywhere and anytime. Augmented reality (AR), which enhances a user's perception of the real world with computer generated information, can turn the everyday world into a user interface for ubiquitous computing applications. STUDIERSTUBE is a leading framework for the development of mobile, collaborative and ubiquitous AR applications. |
1997 | 2006 |
|
Interactive operation planning at the face skull model
(details) |
For the planning of an operation of heavy deformations in the mouth-, jawbone- and face area the use of a model of the patient skull as accurate as possible is almost indispensable. The surgeon can refine the diagnosis with this model and plan an operation within the surgical therapy based on the model (model operation) for detecting possible problems.
|
1997 | 1998 |
|
Integrated image analysis system for the early recognition of malignant melanoma
(details) |
The project aims at the creation of a computer image analysis workstation for clinical application. The workstation will receive Epiluminescence Microscopy digital images of pigmented skin lesions and information pertaining to a patient. This input will be converted into a proposed diagnosis of a lesion as malignant, benign or dysplastic, with an associated error probability. |
1997 | 1998 |
|
GIPLib - General Image Processing Library
(details) |
Giplib is an image processing library in C++. It has been developed at our institute and has been used for several projects as well as for education. Giplib is under continuous development. While the functionality is wide-spreaded its strength is in stereo reconstruction. |
1997 | 2006 |
|
Integrating Shape from Stereo and Physics Based Vision for Uncalibrated Surface Reconstruction
(details) |
Uncalibrated stereo, or more precisely the recovery of the epipolar geometry and the projective structure of a real world object purely from the knowledge of homologue points, is an active field of research. The application is mainly found in robot vision as one has to deal with a permanently changing image acquiring situation not allowing an absolute calibration. The aim of the project is to apply this technique for stationary cameras for which an absolute calibration is hard to perform or even impossible (for example a scanning electron microscope). The core of the system is the integration of physics based vision and shape from stereo in the uncalibrated case. For this purpose robust shape from stereo techniques are developed. The physics based vision algorithms are especially adapted to the situation in the scanning electron microscope. The project is strongly related to the project "Application of Image Processing in Materials Science" |
1996 | 1998 |
|
Optical-Radar sensor Fusion for Environmental Applications - ORFEAS
(details) |
Exploiting data from cameras or scanners on satellites already is a well-established methodology. Compared, because of the much higher complexity of the imaging process, methods for processing data from imaging Radar sensors (SAR) are less far developed.
|
1996 | 1997 |
|
Multimedia-Methods for Application of Multisensor Remote Sensing Data for Environmental Tasks
(details) |
Interactive multimedia, based on computer networks, may be of great benefit for many disciplines. That is in specific true for remote sensing (earth observation), where daily a large amount of image-data is collected over the entire globe. By multimedia, the most recent satellite data and associated meta-information is available to every potential user.
|
1996 | 1997 |
|
Science on Stage
(details) |
This article describes a project, which was carried out in the course of a scientific television show called "Science on Stage". Certain parts of the show featured inline-skaters, whose movements were tracked with techniques based on optical beacon tracking and visualised on a big video projection screen. By tracking one marker per skater movements were tracked in 3 degrees of freedom. These data values were converted into so-called "speed"- and "height" estimates which in turn were used to control a predefined "virtual-city" fly-through. In this way it was possible for the audience to recognise a direct relationship between the movements of the inline-skaters and the projected fly-through seen on the screen. |
1996 | 1996 |
|
Vision based robot navigation research network
(details) |
The goal of the VIRGO network is to coordinate European research and postgraduate training activities that address the development of intelligent robotic systems able to navigate in unknown and possibly dynamic environments. This goal will be achieved through a framework which enhances RTD activities in European laboratories, already established in the aforementioned scientific area. Specifically alternative environment representations based on visual information will be studied and methods to process these representations and use them to control the motion of a robot will be developed. Environment learning issues will also be considered, aiming at autonomous acquisition of the discriminating environment features. The task of ICG in the network is to coordinate research in Fusion. Partnerlist: List of partners maintained by project coordinator |
1996 | 1999 |
|
Morphos - Development of a measuring system for verifying existing models from combustion engine industry
(details) |
An apparatus for generating high resolution digital images using a translationally moved CCD-line-camera was realized. With it, objects which have been given a special texture can be imaged comfortably. By tilting the camera and the different perspectives that result from this, it is possible to perform a 3D reconstruction of surface points. The measured object can be turned automatically in the scanning space, which makes it possible to get views from all sides. This requires an exact registration of the individual views in relation to each other. A surface description which clearly reproduces the object is derived from the 3D point cloud that results from the recorded views just mentioned. The accuracy reached is 0.1 mm for the RMS-error in space and 0.1( for the determination of the object rotation. The scanning volume is 500(150(150 mm3. This results in a relative accuracy of 1:5000. Clearly, the requirement of 1:1000 has been reached. The complete reconstruction of a complex object such as e.g. a water cooling jacket with a length of 500 mm takes about one week, where one day of user interaction is required for making the recordings and about one hour is needed for the preparation of the reconstructed point data for the surface reconstruction. What remained unsolved in this project is the problem of generating a fully closed surface for very complex objects without a significant amount of user interaction. In this area there is room for further development.
|
1996 | 1997 |
|
Flexible Assembling Unit for Industrial Applications
(details) |
The series production of parts with a small to medium number of units requires a fast and inexpensive adaption to new parts. A pure hardware solution, as it is common for example in the motor industry, is contrary to the above requirement. The goal of this project is to develop a vision driven, flexible and automatic assembling unit. In order to solve this demanding task, the separation into three, from the viewpoint of vision almost independent steps is proposed. The first step is to isolate one part from the pile. This task is generally referred as binpicking. Where the task of step one was to detect one plane among many parts of one class the task now is to determine the exact position of one known part (step 2). And the third step is the surveillance of the assembling itself. |
1996 | 1996 |
|
RADIANCE Site at ICG
(details) |
The aim was to visualize an office room (situated at the ICG) as realistic as possible.
|
1995 | 1998 |
|
Parallel and Distributed Processing Strategies for Large Image Data Sets
(details) |
Large image data sets are typical for remote sensing. A unique such dataset was generated by the radar sensor of NASA's Magellan probe covering planet Venus. Processing this data set is challenging because of its size, algorithmic complexity and a world-wide demand for accessing the data. We design and prototype a system where users submit tasks within a GUI based entirely on common Internet services. Then, the broker queries the Distributed Method Base and schedules the jobs across available back-end computers minimizing a cost function. The back-end may be a conglomerate of computers of any size, accessed through the queuing system DQS and connected to the broker via CORBA. Our design is fairly general and may be applied to any other tasks of managing large data sets. Partnerlist: |
1995 | 1997 |
|
Measurements on Surface Properties for Realistic Image Synthesis
(details) |
For many years now computer scientists try to make objects look more realistic on computer displays. Realism here means to generate images on a computer which are indistinguishable from real photos. This realism is possible through the use of local and global illumination methods. The basis for a physically correct calculation of the illumination depends on the accuracy of the scene description. This means both the geometry as well as the description of the surfaces and light sources. Aim of this project is to find an accurate and efficient way to measure and define material or surface properies for generating realistic images. Applications are in the fields of architecture, lighting design and optimization, visual shape and fine arts. |
1995 | 1997 |
|
Radar Interferometry
(details) |
Radar interferometry is capable of yielding extremely high accuracies in observing horizontal and vertical motion even from space platforms, since it is capable of measurements in the sub-wavelength domain (at the cm to mm-level). This information is collected over the entire image, so that also the actual spatial distribution of height differences or horizontal motion is being observed. A radar system uses bursts of microwave energy to scan the terrain on the ground and produces a backscatter image of the reflected microwave energy. In so called "repeat-pass" radar interferometry, two takes obtained from orbits close together are required. The received signals of a certain location in both images have different phases. This is used to compute the direction towards the object point (or its velocity in between). Most crucial step of the required processing chain is the phase unwrapping which is to solve the ambiguity of the differential phase numbers. This exploitation is based on certain a priori assumptions and stochastic processing. |
1994 | 1998 |
|
Information Fusion in Image Understanding 2
(details) |
This project has been the main support of research into theory and foundations of image understanding at ICG since 1994. The main goal of is research in theory and methodology of selection and combination of visual information in image understanding. Information fusion has become a crucial task in sensor fusion, as well as in all areas of computer vision, where active systems are employed and continuously producing ambiguous, imprecise, incomplete, sometimes even contradictory visual information. Within this research project, mathematical frameworks of probability theory, Dempster Shafer theory of evidence, and Fuzzy set theory have been investigated for their applicability in different cases of fusion. Current research in this project aims at generic active object recognition. |
1994 | 1998 |
|
Automated Melanoma Recognition
(details) |
During the last fifty years the occurrence of melanoma has increased dramatically. Whereas in the thirties of this century one out of 100 000 people living in the United States or Europe suffered from melanoma, this number has risen to 15 out of 100 000 nowadays and is still increasing. Because of the curability of melanoma by surgical excision in the early stages of tumor development, early tumor recognition is of utmost importance. During the last few years a significant improvement in early tumor recognition has been achieved by using the so-called epiluminescence microscopy (ELM). This technique uses oil immersion to render the epidermis translucent, thus giving insight into sub-surface structures of the skin which are not visible otherwise. This project deals with the computerized analysis of digitized ELM-images of pigmented skin lesions, the final goal being the automatic classification of the lesions as benign or malignant (i.e. melanoma). The analysis is carried out in several steps: The first step is the segmentation process, in which the skin lesion is extracted from the background in the ELM-image. In the next step, a set of parameters which contain information about the malignity of the skin lesion are determined. The choice of the appropriate parameters is based on information found in medical literature, as well as on personal discussions with experienced dermatologists. Finally, a classification based on the previously extracted parameters is carried out. The evaluated results show that the initial efforts already provide the correct result in 86% of all cases examined thus far. |
1994 | 1997 |
|
Indoor Room Imaging
(details) |
The conversion of objects into digital images is the topic of "Electronic Imaging". To get a digital image one needs an "electronic eye". Today solid-state electronic cameras are one of the physical realizations of an "electronic eye". Obvisously a solid-state camera which is connected to a computer must be digital, whereby the sensor itself is analog, but the camera is digital. It is producing digital numbers in the form of pixels. Design considerations for a solid-state sensing system require a knowledge of the component parts of the system and of their interaction to affect the entire system's performance. System component selection involves careful consideration of the specific application requirements and desired results. The process for selecting system components and evaluating their operation is not always intuitively understood. So it will be necessary to find rules for the selection process as well as a procedure for predicting and evaluating system needs. A solid-state image sensing system is the product of many interacting factors. The major ones to be considered are the illumination source, the optical characteristics of the image, the optics, the solid-state image sensor and the video processing circuitry. Each of these factors can be broken down into component parts. Exact considerations for the development of these component parts is part of the ongoing work at the Institute. |
1994 | 1996 |
|
Tools for Exploitation of Radar Image Geometry
(details) |
|||
|
Cybercity project
(details) |
|||
|
Stereoscopic 3-D Surface Reconstruction Using Scanning Electron Microscope Images
(details) |
In many applications of the scanning electron microscope a user needs to know the three dimensional shape of the examined object. The conventional way to obtain such a digital elevation model (DEM) is the manual evaluation of stereoscopic images by photogrammetric methods. To get around such a time consuming procedure a prototype system for automated digital surface reconstruction and analysis was developed in cooperation with the Erich-Schmid-Institut (Leoben). |
||
|
European Magellan Data Node
(details) |
|||
|
National Library
(details) |
As part of a coordinated CD-I and computer-based public representation of the Austrian National Library Vienna, we created a 3-dimensional phototextured model of the library's Great Hall. |
||
|
Design of a Hypermedia Data and a Method Data Base
(details) |
|||
|
Automated Geocoding of Satellite Radar Images
(details) |
|||
|
Automated 3D Reconstruction of Buildings from Aerial Images Using GIS Data
(details) |
In a CyberCity buildings are the most important part. In this case, they are fully phototextured and include parts such as roofs with chimneys and skylights, facades with balconies and bay- windows, etc. In the past roofs were modelled interactively on photogrammetric workstations or special programs were used with intervention of the human operator. The current research is oriented towards the automation of the data collection process so that large numbers of buildings of a city can efficiently be collected and regularly updated. |
||
|
CCD camera system for facade recording
(details) |
Imagine in a vehicle based scanner system producing image stripes of facades under motion distortion caused by the roughness of the street. |
||
|
3D Object Scanner
(details) |
The creation of three-dimensional digital models of complex real-world objects is the task of this project. These models represent documents which have to be readable by a CAD-system for industrial use. |
||
|
Modelling of a Room
(details) |
The aim was to visualize an office room (situated at the ICG) as realistic as possible. |
||
|
Comparing Computer Generated Images
(details) |
Algorithms and software from computer graphics for the photo-realistic rendering of complex scenes have reached a remarkable level of maturity over the last years. Little work has been done on evaluating the quality of such rendering algorithms on an objective quantitative or on a subjective qualitative basis. Such tests, however, would be very useful in order to rank rendering algorithms depending on the application domain and subsequently to automatically choose the algorithms most appropriate for the problem at hand. |
||
|
Concept for Realistic Image Sythesis
(details) |
In this project we describe the problems which arise in realistic renderings of existing objects on computers. The complete process from the generation of the scene to computer generated images will be explained. This involves the extraction of geometry, surface properties and illumination as well as the rendering using global illumination models and the tone reproduction in the displaying step on output devices. |
||
|
Analysis of Combustion Processes in Engines
(details) |
Using an endoscope for the optical access to the combustion chamber of an engine makes it possible to study injection and combustion phenomena without compromising engine operation conditions. |
||
|
Micro Image Understanding Environment MyUE
(details) |
Current approaches of image understanding and object recognition are generally limited to very restricted types of scenes, objects and scenarios. Thus, the resulting implemented techniques are limited and provide poor performances when they are tested outside their original context. The ambitious goal of the Micro Image Understanding Environment (µUE) project is to demonstrate that it is feasible to specify and create a small and useful image understanding/object recognition system. In order to avoid the common mistake of constantly reimplementing the same code, our system will try to optimize the research efforts productivity by massively relying on market products and image understanding research standards. That's why we think that the software development efforts will be limited. In this way our system can be considered as small. |
||
|
Mapping the Human Retina
(details) |
The purpose of this project is to successfully demonstrate the application of our general concept of information fusion in image understanding. The medical motivation lies in the extremely difficult diagnostic assessment of age-related macular degeneration (AMD). |
||
|
Information Fusion in Image Understanding
(details) |
The main goal of this project is to develop a new system for information fusion in image understanding which can deal with selection and combination of uncertain visual information. The system is active in the sense that at each step of processing it can select and request the most promising next source of information. Thus, it will be a powerful new mechanism for the efficient control of an image understanding system. |
||
|
Active Fuzzy Fusion
(details) |
Information fusion deals with the integration of information from several different sources. The necessity to fuse fuzzy information arises quite naturally in problems of image understanding because imprecision, uncertainty and ambiguity can be found at all levels, from the image itself to the results of high-level processing. In addition individual visual cues are often unreliable, even misleading. Thus integration of vision modules is necessary to obtain a reliable interpretation of complex images. |
||
|
Active Interpretation of Occlusion
(details) |
Occlusion plays an important role in recovering relevant structure from a dynamic scene. In the region near the edges it causes a change of visual information that is implicitly modeled by projective relations between two viewpoints in a stereo camera setting. In the case of uncalibrated images, a new contribution is to apply the framework of plane homographies in projective space under epipolar constraints to define a rule base for edge classification. |
||
|
Active Fusion Using Bayesian Networks
(details) |
|||
|
Scanner Evaluation System
(details) |
The goal of the project is to create a portable system for scanner evaluation based on test targets, algorithms and procedures which give us adequate tools for testing the performance of various types of scanners, but in particular high performance film scanners (e.g. photogrammetric scanners, graphic art scanners). This evaluation has to be done in an objective manner which is independent of the operator of the device. Different measurement and analysis methods by which specific elements of a scanners performance can be quantified have been found and implemented in our scanner evaluation system. The system gives us the possibility to perform automated but also interactive scanner evaluation. Issues of concern are geometric resolution, geometric accuracy, radiometric resolution and range, colour reproduction, scanning speed and sustained throughput. |
||
|
Active Inspection System for Steel Quality Control
(details) |
In the steel-producing industry it is very important to control the quality of steels. At the moment the estimation is mainly done by human inspection of specially prepared steel surfaces with the help of light-microscopes. Each particular steel specimen is classified by assigning it to a reference chart, corresponding to the micro-structure of interest. For example, in the case of carbide distribution the reference chart consists of 28 classes arranged in 4 rows of 7 columns according to the 4 types and 7 degrees of the carbide distributions. |
||
|
Active Fusion with Evidence Theory
(details) |
