×
American AI chatbots, and Gemini especially, collect more user data than Chinese alternatives
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

New research challenges the prevailing narrative about Chinese AI models posing the greatest privacy risks, revealing instead that popular American chatbots often collect more personal data. This counterintuitive finding comes amid heated international debate over AI ethics and regulation, as users worldwide increasingly rely on AI assistants while governments struggle to establish appropriate privacy safeguards across different jurisdictions.

The big picture: Despite widespread suspicion of Chinese-developed DeepSeek R1, research from VPN provider Surfshark finds it ranks only fifth for data collection among popular AI chatbots.

  • American-developed Google Gemini tops the list as the most data-intensive AI chatbot, collecting 22 out of 35 possible user data types.
  • The study analyzed privacy details for the ten most popular chatbots on Apple’s App Store, including ChatGPT, Gemini, Copilot, Perplexity, DeepSeek, Grok, Jasper, Poe, Claude, and Pi.

Key details: DeepSeek’s controversial Chinese origins have fueled privacy concerns, prompting bans by government and private organizations in the US and elsewhere.

  • DeepSeek’s flagship open-source AI model debuted in January, attracting approximately 12 million users within just two days of its launch.
  • The company’s privacy policy acknowledges storing personal information on secure servers located in China, which has intensified concerns among Western users.

By the numbers: Google Gemini collects the most extensive range of sensitive user information among popular AI chatbots.

  • Gemini harvests highly sensitive data including precise location, user content, contacts lists, and browsing history.
  • Only three chatbots—Gemini, Copilot, and Perplexity—collect precise location data from users.
  • Approximately 30% of chatbots share sensitive user data with third parties, and the same percentage actively track user data.

For comparison: DeepSeek collects an average of 11 unique data types, focusing primarily on contact information, user content, and diagnostics.

  • ChatGPT, developed by US-based OpenAI, collects 10 unique data types, including contact information, user content, identifiers, usage data, and diagnostics.
  • This places both popular AI assistants in the middle range for data collection practices among the chatbots analyzed.

Why it matters: The findings highlight how geopolitical tensions may distort public perception of privacy risks, with users potentially overlooking excessive data collection by domestic companies while focusing concerns on foreign alternatives.

  • The research emerges amid an escalating AI arms race between the US and China, which continues to raise profound privacy, security, and ethical questions on both sides.
Worried about DeepSeek? Turns out, Gemini is the biggest data offender

Recent News

AI monopolies threaten free society, new research reveals

Leading tech firms could exploit their AI systems internally to gain unprecedented advantages, creating a massive power imbalance that evades public scrutiny and regulatory oversight.

AI coding tools fall short in mimicking programmers’ critical thinking

AI coding tools optimize for text generation while missing programming's essence: reasoning about complex systems and contexts that aren't visible in the code itself.

AI’s Mirror Trap risks stifling human imagination

As AI increasingly reflects existing ideas back to us, it risks creating intellectual echo chambers that may ultimately constrain original human thinking and creative development.