OpenAI
Entwickler von ChatGPT und GPT-Modellen
GPT Image
creativeKI-Bildgenerierung (ehem. DALL-E) mit GPT Image 1.5. API $0.005-$0.04/Bild.
WebsiteOpenAI Operator
agentsBrowser-basierter KI-Agent mit 87% Erfolgsrate bei komplexen Web-Tasks.
WebsiteOpenAI GPT-5.x
modelsGPT-5.4: Unified Architecture, 57.7% SWE-bench Pro, 1M Kontext. API $2.50/$15 pro Mio. Tokens.
WebsitearXiv:2605.19192v1 Announce Type: new Abstract: Multimodal agents use screenshots, documents, and webpages to choose to...
arXiv:2605.19932v1 Announce Type: new Abstract: Large language model (LLM) agents increasingly operate over long and re...
arXiv:2605.19846v1 Announce Type: cross Abstract: Vision-Language Models (VLMs) have demonstrated remarkable capabiliti...
arXiv:2410.18856v4 Announce Type: replace Abstract: Frontier large language models (LLMs), such as GPT-5, Claude 4.5, G...
arXiv:2604.11417v5 Announce Type: replace-cross Abstract: Co-speech gestures increase engagement and improve speech und...
arXiv:2605.16630v2 Announce Type: replace-cross Abstract: Hybrid local--cloud agents enrich user requests with context ...
arXiv:2605.17046v2 Announce Type: replace-cross Abstract: Autonomous AI coding agents are becoming a core tool for ML p...
arXiv:2605.20149v1 Announce Type: new Abstract: Large language models (LLMs) are widely used for open-ended tasks, but ...
arXiv:2605.19069v1 Announce Type: new Abstract: Code-switching -- the natural alternation between two languages within ...
arXiv:2605.19077v1 Announce Type: new Abstract: Task-oriented dialogue systems -- handling transactions, reservations, ...