Google pins its hopes on Gemini to leapfrog GPT-4 - FT中文网
登录×
电子邮件/用户名
密码
记住我
请输入邮箱和密码进行绑定操作:
请输入手机号码,通过短信验证(目前仅支持中国大陆地区的手机号):
请您阅读我们的用户注册协议隐私权保护政策,点击下方按钮即视为您接受。
观点 谷歌

Google pins its hopes on Gemini to leapfrog GPT-4

Top-of-the-range Ultra is the search group’s best weapon in the race to turn generative AI into a useful everyday tool

This week’s release of Gemini, a family of large language models, will give Google a stronger platform to fight back against OpenAI, the company behind ChatGPT, and Microsoft

It has taken a year, but Google has finally delivered a coherent response to the surprise challenge to its dominance in artificial intelligence that came with the launch of ChatGPT.

This week’s release of Gemini, a family of large language models, will give it a stronger platform to fight back against both OpenAI, the company behind ChatGPT, and Microsoft, which has used OpenAI’s models to supercharge all its software and cloud services this year.  

The question now is whether Gemini can make a meaningful difference to Google’s existing services — and, perhaps even more important, whether it can become a foundation for a new range of services that carry AI much deeper into everyday life.

With the three “flavours” of Gemini announced this week, Google is finally stamping its mark on a technology that its own researchers did much to pioneer, but which OpenAI’s ChatGPT carried into the mainstream. The Pro version, for instance, is positioned squarely against OpenAI’s GPT-3.5, the model behind the free version of ChatGPT and the workhorse for many of the first generative AI applications from other companies that have hit the market this year.

The smaller Gemini Nano is matched against systems such as the smallest version of LLaMa 2, Facebook’s open-source model, making it capable of being run on a mobile device. Apple, as always, is taking a considered approach before bringing generative AI to the iPhone, but the appearance of Gemini on Google’s latest Pixel handset is a sign that it can’t afford to wait too long.

It is the top-of-the-line Gemini Ultra, due out early next year, that carries Google’s main hopes of matching or leapfrogging OpenAI’s GPT-4 in the race to turn generative AI into a more useful everyday tool. The company fell behind this year, but has some clear advantages that could help bring Gemini to a big market in 2024.

One is distribution. Google said this week, for instance, that Gemini will be added to Chrome, which has more than 60 per cent of the browser market, giving billions of web users instant access to tools that are able to do things such as analyse the content of web pages.

As Google flexes its existing market power like this to boost its AI ambitions, competition regulators will be watching closely.

Another advantage for Google is the uncertainty around OpenAI. After the shock sacking and reinstatement of chief executive Sam Altman last month, the many businesses that have built their own generative AI plans on top of OpenAI’s models will be looking to hedge their bets.

The search company will also be hoping that its Bard chatbot will do a better job of rivalling ChatGPT now that it has a better language model behind it. But its best hope of regaining an edge may lie in being the first to come up with the next breakthrough services powered by generative AI. Some of the capabilities claimed for Gemini point to where Google thinks these might lie.

It has made much, for instance, of the fact that Gemini was designed from the outset to be “multimodal” — that is, able to understand not just text but also images, video and audio. According to Google, that makes it better suited than models such as GPT-4 to deal with the sort of everyday situations that rely on senses such as sight and hearing.

That may be a step towards AI systems that are better able to operate in the real world. But it is too soon to tell what applications this could make possible, or whether Google really has achieved the technical superiority it claims.

Another avenue for development lies in what Google claims are Gemini’s reasoning and planning capabilities. These are the kind of skills that could prepare the ground for personal assistants able to tackle complex problems and set a plan of action.

If such assistants are linked to other internet services, they could also become agents, taking action on their users’ behalf. Imagine a shopping agent, for instance, that not only hunts out the products you want but goes ahead and pays for them as well.

This is already shaping up to be one of the key AI battles of 2024 and beyond. OpenAI took a first step in this direction last month when it said its users would be able to build rudimentary agents on top of its models, then offer them for sale on an OpenAI app store. That could point to the next big AI breakthrough beyond ChatGPT — and this time, Google has no intention of being left behind.

richard.waters@ft.com

版权声明:本文版权归FT中文网所有,未经允许任何单位或个人不得转载,复制或以任何其他方式使用本文全部或部分,侵权必究。

造就埃隆•马斯克的神话

这位科技亿万富翁对唐纳德•特朗普的支持是其世界观的一部分,这种世界观来自硅谷最狂野的边界。

投资者警告称,强势美元将冲击新兴市场债券

新兴市场债务基金遭遇资金外流,因为发展中国家降息的希望破灭。

吉赛尔•佩利科,震惊法国的审判的核心人物

在法庭审理她如何被丈夫下药并被陌生人强奸时,她表现出了非凡的力量。

安东尼奥•科斯塔:“特朗普为什么要与欧洲打贸易战?”

欧洲理事会新任主席谈跨越政治分歧开展业务、面对腐败调查,以及为什么欧洲在危机中能发挥最大作用。

来自罗马的明信片:向好莱坞明星展示永恒之城秘密的“角斗士导游”

历史学家亚历山大•马里奥蒂是《角斗士II》的顾问,他兼职为汤姆•克鲁斯、比尔•盖茨和罗素•克劳做向导。

海上石油又回来了,但代价是什么?

在发生了历史上最严重的泄漏事故多年之后,公司为了寻找新的发现,正在钻探更深的海底钻井。
设置字号×
最小
较小
默认
较大
最大
分享×