Luc Rocher

Research

Training language models to be warm can reduce accuracy and increase sycophancy

Published in Nature

Protecting health data at UK Biobank

Published in BMJ

Reliability of LLMs as medical assistants for the general public: a randomized preregistered study

Published in Nature Medicine

Attributing and situating knowledge cannot be left to language models

Published in Nature Machine Intelligence

Meaningful Data Access for Quantitative Algorithm Audits

Presented at ACM CHI

Supporting adolescents to challenge algorithmic profiling in online platforms

Preprint on arXiv

‘We can see a savage’: a case study of the colonial gaze in generative AI algorithms

Published in AI & Society

Gender trouble in language models: an empirical audit guided by gender performativity theory

Presented at ACM FAccT

Evaluating the use of a language model to crowdsource gun violence reports

Presented at ACM CSCW

Measuring what matters: Construct validity in large language model benchmarks

Presented at NeurIPS

A scaling law to model the effectiveness of identification techniques

Published in Nature Communications

Building infrastructure is key to unifying UK health data

Published in BMJ

Anonymization: The imperfect science of using data while preserving privacy

Published in Science Advances

Press

BBC News

The friendlier the AI chatbot the more inaccurate it is, study suggests

New study led by Lujain and Sofia finds that friendlier chatbots make more mistakes.

29 Apr 2026

Mashable

Study: Friendly AI chatbots may be less accurate

How does a friendlier chatbot respond to a falsehood about the moon landings?

29 Apr 2026

Nature

Friendlier LLMs tell users what they want to hear - even when it is wrong

A large language model that is trained to respond in a warm manner is more likely to give incorrect information and reinforce conspiracy beliefs.

29 Apr 2026

The Telegraph

Why you don’t want your AI chatbot to be nice to you

Systems trained to sound friendlier are up to 30 per cent less accurate, study finds

29 Apr 2026

The Guardian

Friendly AI chatbots more likely to support conspiracy theories, study finds

Chatbots programmed to respond warmly even cast doubts on Apollo moon landings and fate of Hitler, researchers say.

29 Apr 2026

The Verge

Friendly chatbots make more mistakes

The researchers found AI chatbots trained to be warmer were significantly more likely to make factual errors and agree with false beliefs than the originals.

29 Apr 2026

Science

UK Biobank faces questions about data security after latest breach

Experts say the lapse highlights that even new measures to control access did not safeguard deidentified patient information; Q&A with Luc.

24 Apr 2026

El País

Chinese website sells medical information of 500,000 volunteers from UK database

Luc comments on the UK Biobank data leak, with medical data put for sale on a Chinese website.

23 Apr 2026

The Guardian

What is the UK Biobank project and what are the privacy concerns around it?

Volunteers’ data has enabled medical breakthroughs, but there are questions over how that data is protected. Luc comments.

23 Apr 2026

The Times

What is UK Biobank? Medical database that’s a victim of own success

The resource at the centre of a data breach in China is a treasure trove for scientists and has been responsible for a series of medical breakthroughs. Luc comments on Biobank data leaks.

23 Apr 2026

The Guardian

Now you can break up with big tech at a bar: ‘cybersecurity disguised as a party’

Luc comments on why people are turning away from big tech.

16 Apr 2026

ARS Technica

Americans ask AI for health care. Hospitals think the answer is more chatbots.

Article cites our research which warns about the risks in AI chatbots giving medical advice.

14 Apr 2026

De Standaard

Dr ChatGPT doesn’t help you any better than Dr Google, and that’s not because of the AI models’ ‘knowledge.’

New study led by Andrew warns of the risks in AI chatbots giving medical advice.

09 Feb 2026

New York Times

Health advice from AI chatbots is frequently wrong, study shows

New study led by Andrew warns of the risks in AI chatbots giving medical advice.

09 Feb 2026

Reuters

AI no better than other methods for patients seeking medical advice, study shows

New study led by Andrew warns of the risks in AI chatbots giving medical advice.

09 Feb 2026

The Register

AI chatbots are no better at medical advice than a search engine

A new study led by OII researchers warns of the risks in AI chatbots giving medical advice.

09 Feb 2026

New York Times

Frustrated by the Medical System, Patients Turn to A.I

Chatbots are cheap, always available, superficially empathetic — and sometimes wrong. Some have concluded they’re a risk worth taking. Article references upcoming study led by Andrew.

16 Nov 2025

de Correspondent

The claims about increasingly smart AI models?

More vibe than science. Luc comments.

13 Nov 2025

NBC News

AI Revolution – NBC News discuss latest OII study exploring AI evaluation

The NBC Morning News programme discuss the findings from Andrew's latest study which finds weaknesses in how AI systems are evaluated.

09 Nov 2025

The Register

AI benchmarks are a bad joke - and LLM makers are the ones laughing

Covers our research finding that many AI benchmarks do not measure the right things.

07 Nov 2025

Gizmodo

AI Capabilities May Be Overhyped on Bogus Benchmarks, Study Finds

Covers our Measuring What Matters study on the construct validity of AI benchmarks.

06 Nov 2025

NBC News

AI’s capabilities may be exaggerated by flawed tests, according to new study

Researchers behind a new study say that the methods used to evaluate AI systems’ capabilities routinely oversell AI performance and lack scientific rigour.

06 Nov 2025

The Guardian

Experts find flaws in hundreds of tests that check AI safety and effectiveness

Scientists say almost all have weaknesses in at least one area that can ‘undermine validity of resulting claims’ with commentary and latest research findings from Andrew.

04 Nov 2025

Tech Policy Press

Why We Shouldn’t Trust Facial Recognition’s Glowing Test Scores

Failures in facial recognition technology are far from uncommon, and numerous examples continue to be reported in the press. Despite these repeated failures, the technology is rapidly being integrated into our daily lives, Juliette, Teo, and Luc write.

18 Aug 2025

The Daily Telegraph

ChatGPT is driving people mad

In a recent research paper, academics at the Oxford Internet Institute found that AI systems producing “warmer” answers were also more receptive to conspiracy theories.

17 Aug 2025

BMA The Doctor

Bot-ched advice – ‘disturbing’ results in AI study

Rebecca and Andrew comments on our study showing that LLM chatbots can perform worse when interacting with humans than when assessed using benchmarks.

10 Jul 2025

Oxford Internet Institute

Do language models have an issue with gender?

Feature piece by Sofia about our study on how to best evaluate if language models perpetuate gender stereotypes.

09 Jun 2025

TechCrunch

People struggle to get useful health advice from chatbots, study finds

Coverage of Andrew's study showing that people using AI chatbots for medical self-diagnosis did not make better decisions than people using traditional sources.

05 May 2025

Le Soir

Une équipe de l’UCLouvain découvre une faille dans le RGPD : « Rester anonyme sur internet est presqu’impossible »

Coverage of our research showing that data considered anonymous can still identify web users.

09 Jan 2025

University of Oxford

Pioneering new mathematical model could help protect privacy and ensure safer use of AI

University coverage of our scaling law for identification techniques and privacy risks.

09 Jan 2025

Luc Rocher (they/them)

Research

Press

The friendlier the AI chatbot the more inaccurate it is, study suggests

Study: Friendly AI chatbots may be less accurate

Friendlier LLMs tell users what they want to hear - even when it is wrong

Why you don’t want your AI chatbot to be nice to you

Friendly AI chatbots more likely to support conspiracy theories, study finds

Friendly chatbots make more mistakes

UK Biobank faces questions about data security after latest breach

Chinese website sells medical information of 500,000 volunteers from UK database

What is the UK Biobank project and what are the privacy concerns around it?

What is UK Biobank? Medical database that’s a victim of own success

Now you can break up with big tech at a bar: ‘cybersecurity disguised as a party’

Americans ask AI for health care. Hospitals think the answer is more chatbots.

Dr ChatGPT doesn’t help you any better than Dr Google, and that’s not because of the AI models’ ‘knowledge.’

Health advice from AI chatbots is frequently wrong, study shows

AI no better than other methods for patients seeking medical advice, study shows

AI chatbots are no better at medical advice than a search engine

Frustrated by the Medical System, Patients Turn to A.I

The claims about increasingly smart AI models?

AI Revolution – NBC News discuss latest OII study exploring AI evaluation

AI benchmarks are a bad joke - and LLM makers are the ones laughing

AI Capabilities May Be Overhyped on Bogus Benchmarks, Study Finds

AI’s capabilities may be exaggerated by flawed tests, according to new study

Experts find flaws in hundreds of tests that check AI safety and effectiveness

Why We Shouldn’t Trust Facial Recognition’s Glowing Test Scores

ChatGPT is driving people mad

Bot-ched advice – ‘disturbing’ results in AI study

Do language models have an issue with gender?

People struggle to get useful health advice from chatbots, study finds

Une équipe de l’UCLouvain découvre une faille dans le RGPD : « Rester anonyme sur internet est presqu’impossible »

Pioneering new mathematical model could help protect privacy and ensure safer use of AI