×
American AI chatbots, and Gemini especially, collect more user data than Chinese alternatives
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

New research challenges the prevailing narrative about Chinese AI models posing the greatest privacy risks, revealing instead that popular American chatbots often collect more personal data. This counterintuitive finding comes amid heated international debate over AI ethics and regulation, as users worldwide increasingly rely on AI assistants while governments struggle to establish appropriate privacy safeguards across different jurisdictions.

The big picture: Despite widespread suspicion of Chinese-developed DeepSeek R1, research from VPN provider Surfshark finds it ranks only fifth for data collection among popular AI chatbots.

  • American-developed Google Gemini tops the list as the most data-intensive AI chatbot, collecting 22 out of 35 possible user data types.
  • The study analyzed privacy details for the ten most popular chatbots on Apple’s App Store, including ChatGPT, Gemini, Copilot, Perplexity, DeepSeek, Grok, Jasper, Poe, Claude, and Pi.

Key details: DeepSeek’s controversial Chinese origins have fueled privacy concerns, prompting bans by government and private organizations in the US and elsewhere.

  • DeepSeek’s flagship open-source AI model debuted in January, attracting approximately 12 million users within just two days of its launch.
  • The company’s privacy policy acknowledges storing personal information on secure servers located in China, which has intensified concerns among Western users.

By the numbers: Google Gemini collects the most extensive range of sensitive user information among popular AI chatbots.

  • Gemini harvests highly sensitive data including precise location, user content, contacts lists, and browsing history.
  • Only three chatbots—Gemini, Copilot, and Perplexity—collect precise location data from users.
  • Approximately 30% of chatbots share sensitive user data with third parties, and the same percentage actively track user data.

For comparison: DeepSeek collects an average of 11 unique data types, focusing primarily on contact information, user content, and diagnostics.

  • ChatGPT, developed by US-based OpenAI, collects 10 unique data types, including contact information, user content, identifiers, usage data, and diagnostics.
  • This places both popular AI assistants in the middle range for data collection practices among the chatbots analyzed.

Why it matters: The findings highlight how geopolitical tensions may distort public perception of privacy risks, with users potentially overlooking excessive data collection by domestic companies while focusing concerns on foreign alternatives.

  • The research emerges amid an escalating AI arms race between the US and China, which continues to raise profound privacy, security, and ethical questions on both sides.
Worried about DeepSeek? Turns out, Gemini is the biggest data offender

Recent News

AI’s impact on productivity: Strategies to avoid complacency

Maintaining active thinking habits while using AI tools can prevent cognitive complacency without sacrificing productivity gains.

OpenAI launches GPT-4 Turbo with enhanced capabilities

New GPT-4.1 model expands context window to one million tokens while reducing costs by 26 percent compared to its predecessor, addressing efficiency concerns from developers.

AI models struggle with basic physical tasks in manufacturing

Leading AI systems fail at basic manufacturing tasks that human machinists routinely complete, highlighting a potential future where knowledge work becomes automated while physical jobs remain protected from AI disruption.