04 Nov 2024
14:00  - 16:00

Europainstitut (Riehenstrasse 154), Pavillon 2, Seminarraum 00.002

Organizer:
RISE

Events, Workshop

RISE Crash Course: "Information Extraction from Images with AI"

In this two-hour course, you will learn how to use multimodal large language models (such as ChatGPT-4o, Gemini 1.5, or Claude Sonnet 3.5) to extract structured information from images.

[Translate to English:] Eine Karteikarte mit Text und der Text in strukturierter Form

In this two-hour, hybrid course, you will learn how multimodal large language models, such as ChatGPT-4o, Gemini 1.5, or Claude Sonnet 3.5, can be used to extract structured information directly from images. This approach eliminates the often necessary intermediate step of text recognition and transcription that is common in traditional methods (such as Transkribus).

Using concrete examples from ongoing research projects, the course will demonstrate the practical possibilities and limitations of this technology. It will also address the technical and methodological prerequisites required for successful implementation. Additionally, aspects of data quality, the FAIRness (Findability, Accessibility, Interoperability, Reusability) of the extracted data, as well as the associated costs, will be considered and reflected upon.

The course is aimed at researchers and students in the social sciences and humanities. The number of participants is limited to 15 people (on-site) per session. If needed, the course can be offered in English. Questions and contributions in English are, of course, always welcome regardless of the course language. 

Places will be allocated on a first-come, first-served basis. Register now.


Export event as iCal