Skip to content

SPEAKER 2024

Koushik Ghosh

Cloud Engineer, Google Cloud

About Talk

Unlocking the power of Multimodal Foundation Models

Explore how the Gemini model integrates various types of data such as text, images, and audio to drive innovation across diverse applications. We’ll delve into key multimodal features and showcase how Gemini, an open-source platform, can be utilized to develop cutting-edge multimodal applications. The session will also focus on the concept of “grounding” — connecting models to real-world context — and will highlight how Gemini integrates with custom solutions to enhance this grounding capability. The emphasis on open-source technologies will demonstrate how the community can contribute to, extend, and implement these models for a wide range of use cases, ensuring transparency and fostering collaboration. Let me know if anything else is pending from my side. Requesting you to send the final session time for keynote, tech talk and workshops for Google Cloud speakers.

TRACK: AI and ML

24th Oct 2024 | Time: 03:30 - 04:00

About Speaker

Koushik is a technology lead with over 14 years of experience in building and scaling both enterprise and consumer products, particularly on Google AI platforms. With deep expertise in Data Science and Product Development, he is passionate about leveraging AI responsibly to empower businesses and improve lives. Currently serving as a Cloud AI Engineer at Google Cloud, Koushik provides strategic advice to Fortune 500 companies and startups, helping them drive digital transformation through cloud technology and AI/ML-driven solutions. His work focuses on guiding businesses in adopting AI/ML to optimize and scale their applications for future success. Prior to Google, Koushik played a pivotal role in building multiple products from the ground up across the Airlines, Ads, and Banking domains.