Google Gemini Expands to Audio, Adds New AI Features

Google Gemini Gemini app audio AI Search languages
Nikita Shekhawat
Nikita Shekhawat
 
September 9, 2025 2 min read

Google has rolled out a major update to its Gemini-powered tools, introducing long-requested features and expanding AI support across apps. From audio file compatibility in the Gemini app to multilingual search support and smarter report generation in NotebookLM, these updates strengthen Google’s position in the competitive AI landscape.

Gemini App Now Supports Audio Files

For the first time, the Gemini app can accept and process audio files. This feature was the most requested update, according to Josh Woodward, Vice President of Google Labs and Gemini, who shared the news on X.

  • Free users: Can upload up to 10 minutes of audio and use five prompts per day.

  • AI Pro and AI Ultra users: Can upload audio up to three hours in length.

The Gemini app supports multiple file formats, and users can add up to 10 files per prompt—including compressed files within ZIP folders.

Google Search Adds Five New Languages

Google Search’s AI Mode now supports questions and exploration in five additional languages:

  • Hindi

  • Indonesian

  • Japanese

  • Korean

  • Brazilian Portuguese

This enhancement comes with the integration of Gemini 2.5 into Google Search, enabling users around the world to ask complex queries in their preferred language while browsing the web more deeply.

NotebookLM Receives Powerful Report Styles

The Gemini-powered research tool, NotebookLM, is also getting smarter with new customizable report formats. Based on uploaded documents, files, and media, NotebookLM can now generate:

  • Study guides

  • Briefing documents

  • Blog posts

  • Flashcards

  • Quizzes

Users have the flexibility to choose the report format and adjust tone, style, and structure. The expanded feature is being rolled out in more than 80 languages and should be fully available by the end of this week.

Recent Developments in Gemini AI

Google has been consistently improving its AI ecosystem over the past few months:

  • August 2025: Gemini began automatically recalling user details and preferences from previous conversations. Free users also gained access to Google Workspace’s AI-powered video tool, Vids.

  • September 2025: Google Photos integrated Veo 3, the company’s latest video generation software, allowing free users to create short animated clips (up to 4 seconds) from still photos.

Why This Update Matters

These updates position Gemini as a more versatile and user-friendly AI platform. By enabling audio uploads, introducing multilingual support in Google Search, and enhancing NotebookLM’s reporting capabilities, Google is ensuring that Gemini extends beyond text and images into a multi-modal AI assistant for research, content creation, and personal productivity.

Nikita Shekhawat
Nikita Shekhawat
 

Nikita is a digital marketing expert with a strong background in SEO, content strategy, and performance marketing. She specializes in driving brand growth through data-driven campaigns, social media optimization, and AI-powered marketing strategies. With a passion for innovation, Nikita helps businesses enhance their online presence, attract the right audience, and achieve measurable results.

Related Articles

social media bots

Sam Altman Says Bots Are Making Social Media Feel Fake

Sam Altman warns that bots and AI-driven posts are making social media feel fake, raising concerns about authenticity on Reddit and X.

By Ankit Agarwal September 9, 2025 3 min read
Read full article
Isotopes AI

Scale AI’s Former CTO Launches Isotopes AI to Transform Big Data Access with Intelligent Agent

Isotopes AI secures $20M to launch Aidnn, an AI agent solving enterprise big data access, enabling business leaders to query data in natural language.

By Hitesh Suthar September 8, 2025 3 min read
Read full article
ChatGPT personality

OpenAI Restructures Research Team Behind ChatGPT’s Personality

OpenAI merges its Model Behavior team into Post Training to refine ChatGPT’s personality, balance friendliness, and reduce AI sycophancy.

By Ankit Lohar September 8, 2025 3 min read
Read full article
AI Infrastructure & Cloud Computing

CoreWeave Acquires Agent-Training Startup OpenPipe to Boost AI Capabilities

CoreWeave acquires OpenPipe to enhance AI agent training with reinforcement learning, powering scalable custom AI solutions for enterprises and startups.

By Govind Kumar September 4, 2025 2 min read
Read full article