{"id":53934,"date":"2026-06-03T10:36:23","date_gmt":"2026-06-03T08:36:23","guid":{"rendered":"https:\/\/www.schirn.de\/?page_id=53934"},"modified":"2026-06-08T14:33:13","modified_gmt":"2026-06-08T12:33:13","slug":"ai-glossar","status":"publish","type":"page","link":"https:\/\/www.schirn.de\/en\/ai-glossar\/","title":{"rendered":"AI Glossar"},"content":{"rendered":"<section class='wp-block-ho-pattern header-text toggle-optional-button' data-pattern='ho\/header-text-option-button'>\n\n<div  class='margin-bottom-7'>\n\n<div  class='container-fluid cf-gutter-5 cf-side-5'>\n\n<div  class='row'>\n\n<div  class='col col-xs-12 col-md-10'>\n\n<div  class='margin-bottom-5'>\n<h1 class='ho-sd-h1' style=\"white-space: normal;\">GLOSSARY<\/h1>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n\n<div  class='row'>\n\n<div  class='col col-xs-12 col-md-6'>\n\n<div class=\"wp-block-ho-div\">\t<p class='ho-sd-h3'>Key terms and background information on the exhibition THE WORLD THROUGH AI<\/p>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n\n<div  class='row padding-top-4 ho-pattern-optional-button'>\n\n<div  class='col col-xs-12'>\n\n<div  class='wp-block-ho-buttons '>\n<div class=\"wp-block-ho-buttons-item\">\n\t<a href=\"\"  target=\"\" role=\"button\" aria-pressed=\"false\" rel=\"nofollow\" class=\"ho-button is-primary\">\n\t\t\t\t<span class=\"ho-button-label \">Lorem Ipsum<\/span>\n\t<\/a>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/section>\n\n<section class='wp-block-ho-pattern toggle-optional-headline toggle-optional-button' data-pattern='ho\/text-option-headline-button'>\n\n<div  class='margin-bottom-7'>\n\n<div  class='container-fluid cf-gutter-5 cf-side-5'>\n\n<div  class='row ho-pattern-optional-headline'>\n\n<div  class='col col-md-6 col-xs-12'>\n\n<div  class='padding-bottom-4'>\n<h2 class='ho-sd-h3' style=\"white-space: normal;\">Lorem ipsum<\/h2>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n\n<div  class='row'>\n\n<div  class='col col-md-6 col-xs-12'>\n\n<div class=\"wp-block-ho-div\">\t<p class='ho-sd-body'><strong>AI INFRASTRUCTURES<\/strong><br>The vast material foundation that makes artificial intelligence possible: data centers humming with servers, undersea cables, electrical grids, cooling systems, and the mines supplying rare minerals for chips. Far from being \u201cvirtual\u201d or weightless, AI depends on enormous physical resources\u2014energy, water, land, and human labor. To speak of infrastructures is to remind ourselves that every generated image or text has a concrete, planetary cost.<br><br><strong>AI SLOP<\/strong><br>The flood of low-quality, mass-produced content generated by AI and scattered across the internet: stereotyped images, formulaic text, spammy videos. The result of the default parameters of the commercial AI models, \u201cslop\u201d is cheap to produce and overwhelming in volume. The term captures the growing anxiety that automated abundance is degrading our shared information environment, burying human expression beneath an undifferentiated sludge of synthetic filler.<br><br><strong>ANALYTIC (DISCRIMINATIVE) AI<\/strong><br>The branch of artificial intelligence that classifies, sorts, and recognizes rather than creates. It answers questions like \u201cIs this a face?\u201d or \u201cIs this email spam?\u201d\u2014drawing boundaries between categories within existing data. For decades this was AI\u2019s dominant mode, powering facial recognition, medical diagnostics, and surveillance systems. It is the analytic counterpart to generative AI: where one discriminates between things, the other produces new ones.<br><br><strong>ARTIFICIAL NEURAL NETWORK <\/strong><br>A computational model based on a simplified view of the organization of biological neurons in animal brains, which receive input data that is transformed and sent on to other neurons. Artificial neurons differ from biological neurons in many ways, particularly in how they are calibrated. An artificial neural network is said to be \u201cdeep\u201d when it contains more than three layers of neurons. For example,<em> convolutional neural networks*<\/em> may contain dozens or even hundreds of layers of neurons. <br><br><strong>BIOLOGICAL NEURAL NETWORK<\/strong><br>A group of neurons that exchange information through electrochemical signals in a network of connections, capable of adapting their relationships to perform specific functions such as perception, memory, or movement. <br><br><strong>CHATGPT<\/strong><br>The conversational AI system released by OpenAI in November 2022, widely seen as the spark of the generative AI wave we are still living through. Reaching a hundred million users within two months, it brought generative AI out of research labs and into everyday life almost overnight. More than a product, \u201cChatGPT\u201d became shorthand for a cultural and economic upheaval whose consequences are still unfolding.<br><br><strong>CLIP (CONTRASTIVE LANGUAGE-IMAGE PRETRAINING) <\/strong><br>Released in 2021 by OpenAI, CLIP jointly encodes natural language descriptions and images in a common <em>latent space*<\/em> to make it easier to search for, classify, and describe images. <br><br><strong>CONVOLUTIONAL NEURAL NETWORK<\/strong><br><em>Artificial neural network* <\/em>composed of feature detectors (called \u201cfilters\u201d or \u201ckernels\u201d) that scan an image to detect simple patterns (such as lines, textures, and color variations). This information is then combined by subsequent layers to infer the presence of increasingly elaborate features: corners, angles, textures, parts of objects, and, finally, the objects themselves. <br><br><strong>DALL-E<\/strong><br><em>Text-to-image generative AI model*<\/em> released by OpenAI in 2021. <br><br><strong>DATA CENTERS<\/strong><br>The physical buildings where the computation behind AI actually happens: vast warehouses packed with servers, storing data and running the calculations that train and operate AI models. They consume enormous amounts of electricity and water for cooling, and their numbers are growing rapidly. Often located far from public view, data centers are the hidden engine rooms of the digital world: the concrete reality beneath the \u201ccloud.\u201d<br><br><strong>DATA EMBEDDING <\/strong><br>Transformation of data (for example, image or words) into vectors so that this data can be processed by an algorithm. Embedding may take place within a <em>latent space.*<\/em> <br><br><strong>DEEP LEARNING <\/strong><br>A subset of <em>machine learning,*<\/em> deep learning refers to techniques using so-called <em>deep neural networks<\/em>,* meaning they are composed of multiple successive layers that process information. This enables them to identify increasingly complex features within data. The success of deep learning methods applied to image recognition is what triggered the AI boom from 2012 onward. <br><br><strong>EIGENFACE<\/strong><br>A face recognition technique based on the abstraction of distinguishing facial features and the creation of a <em>latent space*<\/em> where images of faces are positioned according to their similarity. When presented with a new face, this enables the identification of the images closest to it, and potential recognition of it.<br><br><strong>FACE AND EMOTION RECOGNITION<\/strong><br>Analytic AI systems that identify individuals from their facial features, or claim to read inner states\u2014happiness, anger, fear\u2014from expressions. Widely deployed in surveillance, policing, advertising, and border control, they raise sharp concerns about privacy and consent. Emotion recognition is especially contested: many scientists doubt that feelings can be reliably inferred from faces at all, making the technology as questionable as it is powerful.<br><br><strong>FOUNDATION MODEL<\/strong><br>A large <em>generative AI model,*<\/em> pretrained on a broad spectrum of mainly unlabeled data, using self-supervised techniques. It is used as a common base for a wide range of specialized tasks, each of which may require fine-tuning. <br><strong><br>GENERATIVE ADVERSARIAL NETWORK (GAN)<\/strong><br>A <em>generative AI model* <\/em>introduced in 2014 whose training is based on the interaction between a \u201cgenerator\u201d neural network, which generates images made of random combinations of pixels, and a \u201cdiscriminator\u201d that evaluates whether these images resemble those in the training data, for example, a cat. Through the discriminator\u2019s feedback, the generator learns by trial and error to output similar new images. GANs were widely used by artists during the second half of the 2010s.<br><br><strong>GENERATIVE AI MODEL <\/strong><br>A model is called \u201cgenerative\u201d when it is able to <em>produce<\/em> new data (images, text, etc.) after training on vast amounts of other data. <br><br><strong>GENERATIVE PRETRAINED TRANSFORMER (GPT) <\/strong><br>Family of language models developed by OpenAI from 2018 onward, which has popularized <em>transformers* <\/em>Pretrained on large datasets of unlabeled text, they can then be fine-learning-tuned to perform specialized tasks, such as translation or code generation. They make up the base of ChatGPT, released in November 2022. <br><br><strong>HALLUCINATION<\/strong><br>The term for when a generative AI system produces confident output that departs from reality. It invents facts, citations, or events in text, or, in images, conjures impossible details: six-fingered hands, garbled writing, objects that melt into one another. Because these systems generate what is statistically plausible rather than what is true or real, such errors are not glitches but structural features. The word itself is debated: critics argue it wrongly implies that the machine \u201cperceives\u201d or malfunctions, when it is only ever predicting likely patterns. <br><br><strong>IMAGE GENERATOR <\/strong><br>See <em>image-to-image generative AI model* <\/em>and <em>text-to-image generative AI model.*<\/em><br><strong><br>IMAGE-TO-IMAGE GENERATIVE AI MODEL<\/strong><br><em>Generative AI model*<\/em> that uses an input image or images to produce new images. <br><br><strong>IMAGE-TO-TEXT GENERATIVE AI MODEL<\/strong><br>See <em>multimodal model.*<\/em><br><br><strong>IMAGENET <\/strong><br>Database created in 2009 at the instigation of researcher Fei-Fei Li, containing fourteen million images manually annotated by crowdworkers (through the crowdsourcing website Amazon Mechanical Turk), used to organize an annual competition (ImageNet Large Scale Visual Recognition Challenge, or ILSVRC) to compare the performances of different computer vision algorithms. In 2012, <em>convolutional neural networks*<\/em> took part in the competition for the first time, demonstrating their technical superiority and triggering a revolution in the field.<br><br><strong>INTERPOLATION<\/strong><br>Mathematical operation consisting of estimating missing points in the data as inferred by nearby data. <br><br><strong>LAION-5B<\/strong><br>Dataset created by the German nonprofit LAION, consisting of five billion images paired with their description: their \u201ccaptions\u201d come partially from \u201calt\u201d tags provided by whoever put them online, in other words, micro-captions which accompany every image published on a website coded in HTML. (Tue tags are displayed when, for whatever reason, the image doesn\u2019t appear.) True data for LAION-SB comes from Common Crawl, a web archive updated regularly since 2008. <br><br><strong>LANGUAGE MODEL <\/strong><br>A <em>text-to-text generative AI model*<\/em> that, during its training (which involves calculating the probability of a sequence of words appearing in a given context), has encoded enough data about linguistic structure and relationships (syntax, relationships between words, sentences, concepts) to be able to generate text (predict the next words), translate from one language to another, analyze opinions, transcribe text from audio data, summarize documents, and so on.<br><br><strong>LARGE LANGUAGE MODEL (LLM)<\/strong><br>A <em>language model*<\/em> whose number of parameters and the size of its <em>training dataset*<\/em> are vast enough for it to acquire the ability to provide coherent answers to complex or specialized questions, without having been specifically trained to do so. <br><br><strong>LATENT DIFFUSION MODEL<\/strong> <br>Learning technique used by most<em> text-to-image generative AI models*<\/em> (<em>Dall-E*<\/em>, <em>Stable Diffusion,*<\/em> and <em>Midjourney*<\/em>) that involves training an algorithm by having it add noise (randomly selected pixels) to the training dataset (usually images), then predicting how much noise must be removed to recover the original data. This process allows the algorithm to integrate enough information about the images to be able to generate new ones. <br><br><strong>LATENT SPACE<\/strong><br>A multidimensional vector space in which digital objects (words, images, sounds) are positioned according to their similarities and differences. It is made up of combinations of distinguishing features retained by the algorithm when it learns to recognize objects within the <em>training dataset.*<\/em><br><br><strong>LORA (LOW-RANK ADAPTATION)<\/strong><br><em>Low-rank adaptation<\/em>, or LoRA, is a technique used for adjusting a \u201cgeneric\u201d <em>generative AI model*<\/em> (for example, <em>Stable Diffusion*)<\/em> to process new tasks or fields by modifying its parameters without having to retrain the entire model. <br><br><strong>MACHINE LEARNING <\/strong><br>The technique of configuring an algorithm on a set of data\u2014called <em>training datasets*<\/em>\u2014so that it \u201clearns\u201d their underlying relationships. The adjustment process (also called \u201clearning,\u201d \u201cconfiguration,\u201d or \u201ccalibration\u201d) enables the algorithm to classify or predict new data. <br><br><strong>MACHINE VISION \/ COMPUTER VISION<\/strong><br>Field of computer science concerned with teaching computers to detect and interpret elements (objects, faces, gestures, situations) in images. <br><br><strong>MATRICES<\/strong><br>Charts of numbers arranged in rows and columns that can represent input data (for example, the intensity of each pixel in an image) or else the parameters of a <em>neural network.* <\/em>Processing data using a layer of neurons involves multiplying the matrix of input by the matrix of parameters (before applying a bias vector and an activation function).<br><br><strong>METADATA<\/strong><br>Information <em>about<\/em> data. For example, the <em>data<\/em> of a digital image are its pixels, and the <em>metadata<\/em> are the resolution, dimensions, file format, creation date, etc., of the image.<br><br><strong>MIDJOURNEY<\/strong><br>AI research lab known for its eponymous generative AI program that outputs images from natural language descriptions (or<em> prompts*<\/em>). Released in 2022, Midjourney quickly set a standard in the field, yet it has also sparked controversy (as other text-to-image models like <em>Dall-E*<\/em> or <em>Stable Diffusion*<\/em> have done): for using artists\u2019 work without their consent, for making it easy to plagiarize, and even for its role in winning digital art and photography prizes.<br><br><strong>MULTIMODAL MODEL<\/strong><br><em>Generative AI models*<\/em> that are able to pass from one modality (such as text, image, video, or audio) to another. For example, a multimodal model may produce a natural language description of an image (image-to-text) or else output an image based on a natural language description (text-to-image). AI has also revolutionized the field of moving images with <em>text-to-video generative AI models*<\/em> such as Runway Gen-3 or Sora, which can generate all sorts of videos based on <em>prompts.*<\/em><br><br><strong>NATURAL LANGUAGE PROCESSING (NLP)<\/strong><br>Computer science field concerned with language, its aim being to create programs capable of carrying out textual or oral tasks: translation, question-answering, summarizing, text generation, etc.<br><br><strong>NSFW <\/strong><br>Abbreviation for \u201cnot safe for work,\u201d referring to violent or pornographic content (particularly images).<br><br><strong>PROMPT<\/strong><br>A prompt is the natural language input which, when given to a <em>generative AI model,* <\/em>triggers and guides the content produced: a question, a paragraph to change, the description of an image, and so on. How precise the prompt is (examples, specifications, etc.) can cause considerable variations in both the type and quantity of output. <br><br><strong>SCRAPING <\/strong><br>The process of automatically extracting data from the internet.<br><br><strong>SEQUENTIAL DATA<\/strong><br>Data arranged in sequences where order matters. For example, the words in a sentence or the notes of a melody. Starting in the late 2010s, <em>transformers*<\/em> made it possible to process sequential data much more efficiently, leading to the development of large <em>language models*<\/em> such as GPT-2, GPT-3, and ChatGPT. In turn, this enabled the processing of <em>prompts*<\/em> in <em>text-to-image generative AI models.*<\/em><br><br><strong>SLOPAGANDA<\/strong><br>A blend of <em>\u201cAI slop*\u201d<\/em> and \u201cpropaganda,\u201d coined in 2025 for AI-generated content designed to manipulate beliefs and emotions for political ends. What sets it apart from older propaganda is scale, speed, and personalization: it can be produced cheaply in vast quantities and micro-targeted to individuals. Often it aims not at factual deception but at building emotional associations that stick through sheer repetition.<br><strong><br>STABLE DIFFUSION<\/strong><br><em>Generative AI model* <\/em>released in 2022 that can generate artificial images using text prompts, one version of which is called \u201cStable Diffusion XL\u201d or \u201cSDXL.\u201d The model was trained on the LAION-5B database and the LAION-Aesthetics V2 subset, whose code can be viewed and downloaded. Stable Diffusion (as well as <em>Dall-E*<\/em>) has sparked the same controversies as <em>Midjourney.*<\/em> <br><br><strong>TEXT-TO-IMAGE GENERATIVE AI MODEL <\/strong><br><em>See multimodal model.*<\/em><br><br><strong>TEXT-TO-TEXT GENERATIVE AI MODEL <\/strong><br>See <em>language model*<\/em> and <em>large language model.*<\/em><br><br><strong>TEXT-TO-VIDEO GENERATIVE AI MODEL <\/strong><br>See <em>multimodal model.*<\/em><br><br><strong>TRAINING SET<\/strong><br>A collection of data, sometimes manually labeled, that an algorithm learns to recognize or complete by trial and error. With each error, the algorithm adjusts its parameters so as to gradually encode enough information about the data (their features, similarities, and differences) to be able to classify or produce new ones.<br><br><strong>TRANSFORMER<\/strong><br>A <em>deep learning*<\/em> <em>NLP*<\/em> model proposed in 2017 by a team of Google engineers, whose main innovation is to give a central role to the \u201cattention\u201d mechanism, which consists of weighting the relative importance of each word in relationship to others. This parallel (rather than sequential) treatment requires less computing and has made it possible to multiply the size of <em>training sets*<\/em> and the number of parameters, contributing to the emergence of <em>large language models.*<\/em><br><br><strong>UPSCALING<\/strong><br>Refers to the process of using AI to increase the resolution of an image or video, usually by adding pixels. <br><br><strong>VARIATIONAL AUTOENCODER<\/strong><br>An <em>artificial neural network*<\/em> invented in 2013 by Diederik P. Kingma and Max Welling. It is designed to learn a probabilistic distribution of <em>training sets*<\/em> in a <em>structured latent space*<\/em> in order to generate similar data, or even new data, located between the categories of existing data. <br><br><strong>VECTORS<\/strong><br>The basic way AI systems represent the world: as long lists of numbers. A word, an image, a sound\u2014each is translated into a string of coordinates that fixes its position in a vast mathematical space. Things with similar meanings end up close together. This translation of meaning into numbers is what lets machines \u201ccompute\u201d language and images, but it also reduces every nuance to measurable distance.<br><br><strong>WEB CRAWLING<\/strong><br>An automated web browsing process used to collect information such as website content and links. <br><br><strong>WORLD MODEL<\/strong><br>AI systems that don\u2019t just generate a static image or text, but simulate an environment and predict what happens next as a user acts within it. Trained on vast amounts of video, they can conjure explorable, physically plausible worlds from a single prompt. Promoted as tools for training robots and self-driving cars, they extend generative AI from making pictures toward modeling reality itself.<br><br><br><em>Alban Leveau-Vallier and Antonio Somaini<br>Partially translated from the French by Michelle Noteboom<br>Translated into German by Stefan Barmann<\/em><br><br><\/p>\n<\/div>\n<\/div>\n\n\n\n\n<div  class='col col-xs-12 ho-pattern-optional-button padding-top-4'>\n\n<div  class='wp-block-ho-buttons '>\n<div class=\"wp-block-ho-buttons-item\">\n\t<a href=\"https:\/\/www.google.de\/\"  target=\"\" role=\"button\" aria-pressed=\"false\" rel=\"nofollow\" class=\"ho-button is-primary mobile-full-width\">\n\t\t\t\t<span class=\"ho-button-label \">Lorem Ipsum<\/span>\n\t<\/a>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/section>","protected":false},"excerpt":{"rendered":"","protected":false},"author":22,"featured_media":50389,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"ho_image_focal_point":"","footnotes":""},"class_list":["post-53934","page","type-page","status-publish","has-post-thumbnail","hentry"],"acf":{"seo_titel":"Glossary for the Exhibition THE WORLD THROUGH AI","seo_description":"Understanding Artificial Intelligence: Find clear explanations of key AI concepts in the glossary for the exhibition \u2018The World Through AI\u2019.","keywords":"The World Through AI, SCHIRN Frankfurt, AI Exhibition","noindex":true,"redirect":"","relevancy":"0","post_type_label":false},"_links":{"self":[{"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/pages\/53934","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/users\/22"}],"replies":[{"embeddable":true,"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/comments?post=53934"}],"version-history":[{"count":5,"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/pages\/53934\/revisions"}],"predecessor-version":[{"id":54694,"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/pages\/53934\/revisions\/54694"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/media\/50389"}],"wp:attachment":[{"href":"https:\/\/www.schirn.de\/en\/wp-json\/wp\/v2\/media?parent=53934"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}