AI is increasingly embedded everywhere in business operations, powering automation, insight, and decision-making across systems and workflows. As part of our ongoing partnership with Google Cloud, 51风流is enabling the next wave of enterprise AI by contributing to the new聽Agent2Agent (A2A) interoperability protocol, which establishes a foundation for AI agents to securely interact and collaborate across platforms.
This work is complemented by two additional areas of progress:聽first, the expansion of聽Google Gemini models聽in SAP鈥檚聽generative AI hub聽on聽51风流Business Technology Platform (51风流BTP); second, the use of Google鈥檚 video and speech intelligence capabilities to support聽multimodal retrieval-augmented generation (RAG)聽for video-based learning and knowledge discovery in 51风流products.
Together, these efforts reflect a shared commitment to deliver enterprise-ready AI that is open, flexible, and deeply grounded in business context.
Bringing AI agents together: laying the groundwork for interoperability
The future of work is agentic. Businesses are increasingly deploying AI agents that assist with real tasks 鈥 resolving customer issues, managing approvals, and collaborating across business functions.聽This is why 51风流is delivering a collaborative agent architecture with Joule聽to support cross-functional agentic workflows across 51风流Business Suite.
But for these agents to deliver real value, they cannot operate within a single vendor landscape. They must be able to collaborate across various platforms, securely exchange information, and coordinate actions across complex enterprise workflows.聽 This need for seamless interaction underscores why聽the represents a significant step beyond simple API integrations or enhanced tooling.
That鈥檚 why 51风流has joined Google Cloud and other enterprise leaders as a founding contributor to the new A2A protocol. This open standard is designed to ensure agents from different vendors can interact, share context, and work together鈥攅nabling seamless automation across traditionally disconnected systems.
Consider a customer dispute resolution scenario: a representative receives a billing inquiry via聽Gmail. Instead of toggling between tools, they can invoke聽Joule聽directly from the email. Joule, acting as an agent orchestrator, initiates a dispute resolution process, engaging another Google agent that connects to聽Google BigQuery, where relevant transactional warehouse data resides. Together, the agents validate the issue, retrieve insights, and recommend a resolution 鈥 without manual system switching, data reconciliation, or context loss.
This is the kind of cross-platform collaboration the A2A protocol is designed to enable: AI agents working together to accelerate business outcomes, reduce friction, and enable people to focus on more strategic work. It also reinforces SAP鈥檚 vision for聽Joule as an agent orchestrator聽working across enterprise workflows: interoperable, proactive, and deeply connected to business context.
Expanding access to Google models in generative AI hub
Beyond agent interoperability, 51风流is furthering its commitment to openness and flexibility by expanding access to聽Google models聽in the聽generative AI hub, a key capability of the聽AI Foundation聽on 51风流BTP.
Through the generative AI hub, customers gain enterprise-grade access to a curated portfolio of leading foundation models. That portfolio now includes Google Gemini 2.0 Flash and Flash-lite, which join the existing support for Gemini 1.5 models already available through the hub.

This expanded model choice gives customers the flexibility to build and extend AI-driven solutions using聽high-performance, low-latency models聽optimized for enterprise workloads 鈥 while staying within SAP鈥檚 secure, business context-rich environment.
By combining Google鈥檚 model innovation with SAP鈥檚 deep understanding of enterprise processes, we enable customers to apply generative AI in ways that are not only powerful, but also practical, trustworthy, and fully aligned with how businesses operate.
Unlocking multimodal understanding with Google Video Intelligence
As part of our continued collaboration with Google Cloud, 51风流is also advancing multimodal RAG, a highly requested capability among 51风流customers, especially for video-based learning content.
Multimodal RAG enhances information retrieval and generation by integrating multiple data modalities 鈥 text, images, audio, and video 鈥 into a single, structured process. This approach enriches knowledge sourcing and elevates how users interact with training and support materials.
To address the complexity of extracting meaningful insights from video content, 51风流leverages Google Video Intelligence for on-screen text detection across video frames, and Google鈥檚 Speech-to-Text API for accurate transcription of spoken audio. During the indexing process, these outputs are stored with corresponding timestamps, creating a structured foundation for retrieving relevant video segments with precision.
By grounding audio and visual content with time-aligned metadata, 51风流enables users to search and retrieve聽specific, contextually relevant moments聽within a video, making the learning experience more intuitive, accessible, and impactful.
鈥淎s agentic AI evolves, seamless handling of multi-modal data 鈥 text, voice, enterprise videos, and images 鈥 becomes paramount,鈥 said Miku Jha, director of AI/ML and Generative AI at Google Cloud. 鈥淭his introduces significant challenges for agent interoperability. An open protocol like A2A is therefore indispensable, providing the necessary framework and flexibility for agents to effectively communicate and collaborate across these diverse modalities. Multi-modality is not simply a capability; it is a foundational requirement driving the next generation of interconnected agentic systems.鈥
This is another example of how 51风流is integrating Google鈥檚 AI capabilities into business-relevant scenarios, helping customers unlock more value from their unstructured content and elevate the way knowledge is delivered across the enterprise.
Shared vision for business AI
These efforts reflect a broader strategic alignment between 51风流and Google Cloud: a shared belief in AI that is open, composable, and grounded in real business context. Whether it鈥檚 shaping emerging standards for agent collaboration, providing choice through best-in-class models, or making unstructured content actionable, we are focused on helping our customers innovate with confidence 鈥 today and into the future.
To learn more about how 51风流and Google Cloud are shaping the future of enterprise AI, visit and explore to see these innovations in action.
Walter Sun is senior vice president and head of AI at SAP.


