Joost Geurts

 Research engineer
INRIA
  • CHORUS Chorus is a European research project about search engine technology in its broadest context. It aims at creating the conditions of mutual information and cross fertilisation between research effort.
  • VITALAS VITALAS is a european R&D project about novel search technologies which address specific demands of large scale professional audio-visual archives, such as broadcasters and press agencies.
    VITALAS is a use-case driven project that aims at providing a pre-industrial prototype system dedicated to intelligent access services to multimedia professional archives that would provide the consumer with new technological functionalities. The project seeks to make novel contributions in cross-media (audio/speech, video, image, text) indexing and content enrichment, and uses different interactive retrieval methods (query refinement with log files, RFB, context adaptation).
CWI (2002 - 2006)
  • Cuypers A document engineering framework
    Cuypers is a research prototype system, developed to experiment with the automatic generation of Web-based presentations as an interface to semi-structured multimedia databases. We developed three scenarios: ScalAR automatically generates multimedia documents that are adapted for a specific delivery context.The documents are generated based on a fixed set of relationships that are represented in an RDF graph that is stored in a Sesame repository. The relations are converted from a relational database provided by the Rijksmuseum, which the museum uses to populate a part of its website.
    The SEMINF demonstrator automatically infers semantic relationships between the query results based on the Dublin Core metadata that is associated with the media items in the archive. This metadata is made available through the Open Archives Initiative (OAI), which facilitates interoperability between digital archives. The inferred relationships are then used to automatically generate a multimedia document that conveys these relations to the user. The DISC use case generates complex multimedia biographies exploiting a large repository of sementic web data.
    Related research project : a Document Engineering Model and Processing Framework for Multimedia Documents
    Electronic documents are different from their traditional counterparts in the sense that they do not have an inherent physical representation. Document engineering uses this notion to automatically adapt the presentation of a document to the context in which it is presented. The document engineering paradigm is particularly well suited for textual documents. Nevertheless, the advantages of document engineering are also desirable for documents which are not based on text-flow, such as time based multimedia and other types of spatio-temporal constrained documents. Existing document engineering technology, however, implicitly assumes that documents are based on text-flow. Some of these assumptions conflict with spatio-temporal constrained documents, which explains why current document engineering tools do not work as well for such documents. In our research we make the underlying assumptions of text-flow based document engineering explicit and study the way these assumptions conflict with spatio-temporal constrained documents. We use this to define requirements for a document engineering model that is independent of implicit text-flow assumptions. The resulting model defines a source document as an explicit representation of the message intended by the author. The transformation rules exploit knowledge about domain, design and discourse in order to convey the intended message effectively and ensure that the result meets the requirements imposed by the presentation context. We have implemented this model by developing an architecture that integrates elements from web, document processing and knowledge intensive architectures.
  • Semantic Web version of the Media Streams cinematography ontology
    This work was done in cooperation with Prof. Marc Davis at the Garage Cinema Research group at the School of Information Management and Systems (SIMS) University of California, Berkeley USA.

    Media Streams is an intelligent annotation tool for digital video material developed by Marc Davis in the early nineties. Part of Media Streams is a proprietary cinematographic ontology embedded in the Media Streams application. Web technology was used to transform the existing video ontology into a commonly accessible format. Recombination of existing video material was then used as an example application, in which the video metadata enables the retrieval of video footage based on both content descriptions and cinematographic concepts, such as establishing and reaction shots.
  • Automatic inference of semantic relationships between Dublin Core annotated media item
    This work was done in cooperation with Prof. Jane Hunter and Suzanne Little at Distributed Systems Technology Centre (DSTC) Brisbane, Australia.

    We developed a search, retrieval and presentation system, which used Dublin Core metadata, describing mixed-media resources, to infer semantic relationships across multiple large archives. These semantic relationships were mapped to spatial temporal relations and conveyed using a multimedia presentation. Our underlying hypothesis was that by using automated computer processing of metadata to organize and combine semantically-related objects within multimedia presentations, the system may be able to generate new knowledge by exposing previously unrecognized connections. In addition, the use of multi-layered information-rich multimedia to present the results enables faster and easier information browsing, analysis, interpretation and deduction by the end-user.