LLMs and Multimodal AI with Stefania Druga of Google
Manage episode 415253962 series 3570809
In this episode, Dr. Stefania Druga (she/her), Research Scientist on the Bard (now Gemini) team at Google, shares insights into her work on developing multimodal AI applications. She explains how Bard, a LLM, is trained on a massive amount of internet data, enabling it to understand and generate text, summarize content, and interact with images. They cover the advancements in AI and importance of multimodal capabilities that extend beyond text to include images and other data forms, pushing the boundaries of AI's application in daily life.
Stefania Druga Quotes
📊 Data Set Sizes: "You can think of what does it mean to scrape the entire internet and that's pretty much it...all the data that was ever created digitally and has the right permissions."
⚙️ “These AI assistants are becoming now embedded in not only standalone applications like Bard and ChatGPT, but also in a variety of products...they can become embedded in your calendar to help with smart planning”
🚀 “I realized in 2016 that AI, voice assistants, machine learning are going to be huge. So I started thinking of how do we teach the next generation? That's how I started working on Cognimates & launched a platform.”
Resources
- Fast.ai: Making neural nets uncool again
- LangChain: Build context-aware, reasoning applications with LangChain’s flexible abstractions & AI-first toolkit
- Stefania’s publications: stefania11.github.io
- Cognimates: Stefania’s coding education project started @ MIT Media Lab
Stefania Druga is a Research Scientist at Google Gemini AI. She was a principal researcher at the Center of Applied AI Research at the University of Chicago. She graduated with a PhD in Creative AI Literacies at the University of Washington and a master of science at MIT. During her PhD, she did several research internships at Google X, Microsoft Research, and Fixie.ai focusing on LLM applications for developer tools, programming languages, and data science applications. She loves trail running & drawing with robots. Connect with Stefania:
Let's Connect
YouTube @YourAIRoadmap
LinkedIn Let Joan know you listen!
Pre-order Joan's Book! ✨📘🔜 Your AI Roadmap: Actions to Expand Your Career, Money, and Joy Jan 9, 2025, Wiley
Who is Joan? Ranked the #4 in Voice AI Influencer, Dr. Joan Palmiter Bajorek is the CEO of Clarity AI, Founder of Women in Voice, & Host of Your AI Roadmap. With a decade in software & AI, she has worked at Nuance, VERSA Agency, & OneReach.ai in data & analysis, product, & digital transformation. She's an investor & technical advisor to startup & enterprise. A CES & VentureBeat speaker & Harvard Business Review published author, she has a PhD & is based in Seattle.
Clarity AI builds AI that makes businesses run better. Our mission is to help SMB + enterprise leverage the power of AI. Whether your budget is 5-8 figures, we can build effective AI solutions. Book a 15min
♥️ Love it? Rate, Review, Subscribe. Send to a friend 😊
Bölümler
1. Introduction and Overview (00:00:00)
2. What is Bard? (Google Gemini AI) (00:01:09)
3. Data Set Sizes for Large Language Models (00:02:44)
4. Applications of Bard (now Google Gemini) (00:03:30)
5. Multimodal AI (00:05:55)
6. Stefania's Work on Bard (00:07:00)
7. Transfer Learning and Empowering Users (00:09:29)
8. The Importance of AI Literacy (00:10:34)
9. Ethical Considerations and Trust in AI (00:12:35)
10. Prompt Engineering (00:18:58)
11. What is NLP? "Natural Language" and Acronyms in AI (00:22:43)
12. Forecasting Multimodal AI Applications of the Future (00:27:45)
13. Stefania's Background and Career Path (00:33:50)
14. Pivoting from Academia to Industry (00:39:24)
15. Resources and Advice for Getting Started in AI (00:41:53)
27 bölüm