Home Microsoft denies claims of using Word and Excel data for AI training

Microsoft denies claims of using Word and Excel data for AI training

TLDR

  • Microsoft denies scraping user data from Word and Excel for AI training purposes.
  • The Connected Experiences feature collects data unless manually disabled by users.
  • Microsoft assures that user data remains private and not used without consent.

Microsoft is facing allegations of “scraping” user-generated data from its desktop publishing software Word and spreadsheet tool Excel to train its AI systems, a claim the company denies.

Recent reports suggested that users of Word and Excel are required to opt out if they do not wish their data to be used for AI training. According to nixCraft, a contributor to Cyberciti.biz, Microsoft’s Connected Experiences feature is at the center of the controversy. The feature, enabled by default, reportedly collects data from user-created Word and Excel files to train AI models unless users manually disable it. Microsoft addressed these circulating claims, seeking to clarify the situation.

In a post by Microsoft 365 on X, the company stated: “In the M365 apps, we do not use customer data to train LLMs. This setting only enables features requiring internet access like co-authoring a document.”

Frank Shaw, Microsoft’s head of communications, also weighed in on Bluesky to refute the allegations. He wrote: “As noted when this came up a few weeks back, this is not true and following the link for more information makes that clear.”

As noted when this came up a few weeks back, this is not true and following the link for more information makes that clear.

Frank X. Shaw (@fxshaw.com) 2024-11-26T20:44:19.997Z

Microsoft and intellectual property amid rise of AI

In a blog post from August 2024, Microsoft confirmed that user data remains private and is not shared without consent. The company stated: “Generative AI models do not store training data or return it to provide a response, and instead are designed to generate new content.”

It goes on to say: “If we plan for additional changes to how we use consumer data for training our generative AI models in Copilot, we will share that transparently and will ensure there remains an ability for consumers to stay in control and choose whether to allow that.”

However, Microsoft’s Services Agreement includes a clause that grants the company “a worldwide and royalty-free intellectual property license to use Your Content.”

The clause reads: “To the extent necessary to provide the Services to you and others, to protect you and the Services, and to improve Microsoft products and services, you grant to Microsoft a worldwide and royalty-free intellectual property license to use Your Content, for example, to make copies of, retain, transmit, reformat, display, and distribute via communication tools Your Content on the Services.”

ReadWrite reported a similar situation faced by Adobe earlier this year when its user terms were widely misunderstood to suggest the company was using user-generated content to train generative AI. In response, Adobe quickly revised the language in its terms of service to clarify that this was not the case.

Featured image: Canva

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Suswati Basu
Tech journalist

Suswati Basu is a multilingual, award-winning editor and the founder of the intersectional literature channel, How To Be Books. She was shortlisted for the Guardian Mary Stott Prize and longlisted for the Guardian International Development Journalism Award. With 18 years of experience in the media industry, Suswati has held significant roles such as head of audience and deputy editor for NationalWorld news, digital editor for Channel 4 News and ITV News. She has also contributed to the Guardian and received training at the BBC As an audience, trends, and SEO specialist, she has participated in panel events alongside Google. Her…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.