Generative AI is sowing the seeds of doubt in serious science | 生成式 AI 正在为严肃科学播下怀疑的种子

00:00

{"text":[[{"start":null,"text":"<div class=\"pic\"><picture><img src=\"https://thumbor.ftacademy.cn/unsafe/1920x0/https://d1e00ek4ebabms.cloudfront.net/production/fc59220e-7a7c-4063-9178-da4d493dd34c.jpg\"></picture></div>"}],[{"start":13.25,"text":"The writer is a science commentator "}],[{"start":16.35,"text":"Large language models like ChatGPT are purveyors of plausibility. "},{"start":20.792,"text":"The chatbots, many based on so-called generative AI, are trained to respond to user questions by scraping the internet for relevant information and assembling coherent answers, churning out convincing student essays, authoritative legal documents and believable news stories. "}],[{"start":36.06,"text":"But, because publicly available data contains misinformation and disinformation, some machine-generated texts might not be accurate or true. "},{"start":44.139,"text":"That has triggered a scramble to develop tools to identify whether text has been drafted by human or machine. "},{"start":49.694,"text":"Science is also struggling to adjust to this new era, with live discussions over whether chatbots should be allowed to write scientific papers or even generate new hypotheses. "}],[{"start":59.21,"text":"The importance of distinguishing artificial from human intelligence is growing by the day. "},{"start":63.877,"text":"This month, UBS analysts revealed ChatGPT was the fastest-growing web app in history, garnering 100mn monthly active users in January. "},{"start":72.869,"text":"Some sectors have decided there is no point bolting the stable door: on Monday, the International Baccalaureate said pupils would be allowed to use ChatGPT to write essays, provided they referenced it. "}],[{"start":83.96000000000001,"text":"In fairness, the tech’s creator is upfront about its limitations. "},{"start":88.114,"text":"Sam Altman, OpenAI’s chief executive, warned in December that ChatGPT was “good enough at some things to create a misleading impression of greatness . . . we have lots of work to do on robustness and truthfulness. ”"},{"start":99.144,"text":"The company is developing a cryptographic watermark for its output, a secret machine-readable sequence of punctuation, spellings and word order; and is honing a “classifier” to tell the difference between synthetic and human-generated text, using examples of both to train it. "}],[{"start":113.42000000000002,"text":"Eric Mitchell, a graduate student at Stanford University, figured a classifier would take a lot of training data. "},{"start":119.86200000000002,"text":"Along with colleagues, he came up with DetectGPT, a “zero-shot” approach to spotting the difference, meaning the method requires no prior learning. "},{"start":127.97900000000001,"text":"Instead, the method turns a chatbot on itself, to sniff out its own output. "}],[{"start":133.61,"text":"It works like this: DetectGPT asks a chatbot how much it “likes” a sample text, with the “liking” a shorthand for how similar the sample is to its own creations. "},{"start":142.714,"text":"DetectGPT then goes one step further — it “perturbs” the text, slightly altering the wording. "},{"start":148.382,"text":"The assumption is that a chatbot is more variable in its “likes” of altered human-generated text than altered machine text. "},{"start":154.47400000000002,"text":"In early tests, the researchers claim, the method correctly distinguished between human and machine authorship 95 per cent of the time. "}],[{"start":162.73000000000002,"text":"There are caveats: the results are not yet peer-reviewed; the method, while better than random guessing, did not work equally reliably across all generative AI models. "},{"start":172.459,"text":"DetectGPT could be fooled by making human tweaks to synthetic text. "}],[{"start":177.62,"text":"What does all this mean for science? "},{"start":179.799,"text":"Scientific publishing is the lifeblood of research, injecting ideas, hypotheses, arguments and evidence into the global scientific canon. "},{"start":188.00400000000002,"text":"Some have been quick to alight on ChatGPT as a research assistant, with a handful of papers controversially listing the AI as a co-author. "}],[{"start":196.52,"text":"Meta even launched a science-specific text generator called Galactica. "},{"start":200.83700000000002,"text":"It was withdrawn three days later. "},{"start":202.929,"text":"Among the howlers it produced was a fictitious history of bears travelling in space. "}],[{"start":208.29000000000002,"text":"Professor Michael Black of the Max Planck Institute for Intelligent Systems in Tübingen tweeted at the time that he was “troubled” by Galactica’s answers to multiple inquiries about his own research field, including attributing bogus papers to real researchers. "},{"start":221.18200000000002,"text":"“In all cases, [Galactica] was wrong or biased but sounded right and authoritative. "},{"start":226.34900000000002,"text":"I think it’s dangerous. ”"}],[{"start":228.84000000000003,"text":"The peril comes from plausible text slipping into real scientific submissions, peppering the literature with fake citations and forever distorting the canon. "},{"start":237.08200000000002,"text":"The journal Science now bans generated text outright; Nature permits its use if declared but forbids crediting it as co-author. "}],[{"start":244.33000000000004,"text":"Then again, most people don’t consult high-end journals to guide their scientific thinking. "},{"start":249.47200000000004,"text":"Should the devious be so inclined, these chatbots can spew an on-demand stream of citation-heavy pseudoscience on why vaccination doesn’t work, or why global warming is a hoax. "},{"start":258.97700000000003,"text":"That misleading material, posted online, can then be swallowed by future generative AI to produce a new iteration of falsehoods that further pollutes public discourse. "}],[{"start":268.77000000000004,"text":"The merchants of doubt must be rubbing their hands. "}],[{"start":271.28000000000003,"text":""}]],"url":"https://creatives.ftacademy.cn/album/e34c24f6-1159-4b88-8d92-a4bda685a73c-1677749155.mp3"}

尊敬的用户您好，这是来自FT中文网的温馨提示：如您对更多FT中文网的内容感兴趣，请在苹果应用商店或谷歌应用市场搜索“FT中文网”，下载FT中文网的官方应用。

undefined

The writer is a science commentator

Large language models like ChatGPT are purveyors of plausibility. The chatbots, many based on so-called generative AI, are trained to respond to user questions by scraping the internet for relevant information and assembling coherent answers, churning out convincing student essays, authoritative legal documents and believable news stories.

But, because publicly available data contains misinformation and disinformation, some machine-generated texts might not be accurate or true. That has triggered a scramble to develop tools to identify whether text has been drafted by human or machine. Science is also struggling to adjust to this new era, with live discussions over whether chatbots should be allowed to write scientific papers or even generate new hypotheses.

The importance of distinguishing artificial from human intelligence is growing by the day. This month, UBS analysts revealed ChatGPT was the fastest-growing web app in history, garnering 100mn monthly active users in January. Some sectors have decided there is no point bolting the stable door: on Monday, the International Baccalaureate said pupils would be allowed to use ChatGPT to write essays, provided they referenced it.

In fairness, the tech’s creator is upfront about its limitations. Sam Altman, OpenAI’s chief executive, warned in December that ChatGPT was “good enough at some things to create a misleading impression of greatness . . . we have lots of work to do on robustness and truthfulness.” The company is developing a cryptographic watermark for its output, a secret machine-readable sequence of punctuation, spellings and word order; and is honing a “classifier” to tell the difference between synthetic and human-generated text, using examples of both to train it.

Eric Mitchell, a graduate student at Stanford University, figured a classifier would take a lot of training data. Along with colleagues, he came up with DetectGPT, a “zero-shot” approach to spotting the difference, meaning the method requires no prior learning. Instead, the method turns a chatbot on itself, to sniff out its own output.

It works like this: DetectGPT asks a chatbot how much it “likes” a sample text, with the “liking” a shorthand for how similar the sample is to its own creations. DetectGPT then goes one step further — it “perturbs” the text, slightly altering the wording. The assumption is that a chatbot is more variable in its “likes” of altered human-generated text than altered machine text. In early tests, the researchers claim, the method correctly distinguished between human and machine authorship 95 per cent of the time.

There are caveats: the results are not yet peer-reviewed; the method, while better than random guessing, did not work equally reliably across all generative AI models. DetectGPT could be fooled by making human tweaks to synthetic text.

What does all this mean for science? Scientific publishing is the lifeblood of research, injecting ideas, hypotheses, arguments and evidence into the global scientific canon. Some have been quick to alight on ChatGPT as a research assistant, with a handful of papers controversially listing the AI as a co-author.

Meta even launched a science-specific text generator called Galactica. It was withdrawn three days later. Among the howlers it produced was a fictitious history of bears travelling in space.

Professor Michael Black of the Max Planck Institute for Intelligent Systems in Tübingen tweeted at the time that he was “troubled” by Galactica’s answers to multiple inquiries about his own research field, including attributing bogus papers to real researchers. “In all cases, [Galactica] was wrong or biased but sounded right and authoritative. I think it’s dangerous.”

The peril comes from plausible text slipping into real scientific submissions, peppering the literature with fake citations and forever distorting the canon. The journal Science now bans generated text outright; Nature permits its use if declared but forbids crediting it as co-author.

Then again, most people don’t consult high-end journals to guide their scientific thinking. Should the devious be so inclined, these chatbots can spew an on-demand stream of citation-heavy pseudoscience on why vaccination doesn’t work, or why global warming is a hoax. That misleading material, posted online, can then be swallowed by future generative AI to produce a new iteration of falsehoods that further pollutes public discourse.

The merchants of doubt must be rubbing their hands.

Generative AI is sowing the seeds of doubt in serious science
生成式 AI 正在为严肃科学播下怀疑的种子

热门文章

相关话题

莉娜•汗：特朗普若放任私募股权投资将产生灾难性后果

特朗普“疯狂第一周”让美国企业艰难追赶进度

私募股权所投资公司遭遇破产潮

澳洲电工年薪达平均工资两倍

FT商学院
“每天都有爆炸”：以色列军队进驻黎巴嫩南部

一周新闻小测：2025年1月25日

Generative AI is sowing the seeds of doubt in serious science生成式 AI 正在为严肃科学播下怀疑的种子

FT商学院关注 “每天都有爆炸”： 以色列军队进驻黎巴嫩南部

Generative AI is sowing the seeds of doubt in serious science
生成式 AI 正在为严肃科学播下怀疑的种子

FT商学院
“每天都有爆炸”：以色列军队进驻黎巴嫩南部