The continuing gap between the capabilities of Gemini Pro 3.1 (a very good model) and the capabilities of the Gemini app/website is odd. The model can do what Claude/GPT can do, but there is a minimal harness for tools (file creation, research etc), no auditable thinking trace/actions, manual canvas, etc.
The reason this is odd is that Google is trusted by enterprises & has the compute to burn, so a good harness would solve so many of Gemini’s gaps and make it an easier sell to companies. Gemini can make Office documents, for example, but the harness doesn’t allow it to do so on the website or app. It could also decide when to use other Google tools (and Google has a lot of very good AI tools) and apply them, taking advantage of the ecosystem, but it doesn’t consistently.
I assume something will be coming out here eventually, but the gap with Claude and ChatGPT has only been growing.
The reason this is odd is that Google is trusted by enterprises & has the compute to burn, so a good harness would solve so many of Gemini’s gaps and make it an easier sell to companies. Gemini can make Office documents, for example, but the harness doesn’t allow it to do so on the website or app. It could also decide when to use other Google tools (and Google has a lot of very good AI tools) and apply them, taking advantage of the ecosystem, but it doesn’t consistently.
I assume something will be coming out here eventually, but the gap with Claude and ChatGPT has only been growing.