Google Gemini Expands to Audio, Adds New AI Features
Google has rolled out a major update to its Gemini-powered tools, introducing long-requested features and expanding AI support across apps. From audio file compatibility in the Gemini app to multilingual search support and smarter report generation in NotebookLM, these updates strengthen Google’s position in the competitive AI landscape.
Gemini App Now Supports Audio Files
For the first time, the Gemini app can accept and process audio files. This feature was the most requested update, according to Josh Woodward, Vice President of Google Labs and Gemini, who shared the news on X.
Free users: Can upload up to 10 minutes of audio and use five prompts per day.
AI Pro and AI Ultra users: Can upload audio up to three hours in length.
The Gemini app supports multiple file formats, and users can add up to 10 files per prompt—including compressed files within ZIP folders.
Google Search Adds Five New Languages
Google Search’s AI Mode now supports questions and exploration in five additional languages:
Hindi
Indonesian
Japanese
Korean
Brazilian Portuguese
This enhancement comes with the integration of Gemini 2.5 into Google Search, enabling users around the world to ask complex queries in their preferred language while browsing the web more deeply.
NotebookLM Receives Powerful Report Styles
The Gemini-powered research tool, NotebookLM, is also getting smarter with new customizable report formats. Based on uploaded documents, files, and media, NotebookLM can now generate:
Study guides
Briefing documents
Blog posts
Flashcards
Quizzes
Users have the flexibility to choose the report format and adjust tone, style, and structure. The expanded feature is being rolled out in more than 80 languages and should be fully available by the end of this week.
Recent Developments in Gemini AI
Google has been consistently improving its AI ecosystem over the past few months:
August 2025: Gemini began automatically recalling user details and preferences from previous conversations. Free users also gained access to Google Workspace’s AI-powered video tool, Vids.
September 2025: Google Photos integrated Veo 3, the company’s latest video generation software, allowing free users to create short animated clips (up to 4 seconds) from still photos.
Why This Update Matters
These updates position Gemini as a more versatile and user-friendly AI platform. By enabling audio uploads, introducing multilingual support in Google Search, and enhancing NotebookLM’s reporting capabilities, Google is ensuring that Gemini extends beyond text and images into a multi-modal AI assistant for research, content creation, and personal productivity.