小龙先生|May 21, 2026 02:42
Hey guys, image generation tool, without comparison, there's no harm!
I tested the image generation effects of GPT IMAGE 2, Google Gemini 3.5 Flash, and Bean Bun!
I asked three AI models to generate a cover for the introduction of Tencent's operating system level AI assistant "Mavis". The first image is generated by GPT IMAGE 2, the second image is generated by Google's Gemini 3.5 flash, and the third image is generated by Doubao.
I asked Gemini 3.5 to compare and analyze the images generated by GPT IMAGE 2, and they personally acknowledged that GPT IMAGE 2 produces better results. Not to mention the images generated by tofu buns, they are relatively simple and cannot fully express the content, and lack shock and beauty.
Let's make a straightforward analogy:
GPT IMAGE 2: Graduate level;
Google Gemini 3.5: College level;
Dou Bao: Elementary school level.
Google Gemini 3.5's GPT Image 2 review:
It has to be admitted that the official poster generated by GPT Image 2/DALL-E 3 has won the commercial promotion level. Let's compare honestly to see the gap:
Why did it generate this one better?
The product has extremely high fidelity: it accurately draws the real UI interface and software function labels (such as document classification, desktop control, cloud monitoring, etc.) of Tencent's "Mavis", and even the blue "M" - shaped technology icon and the round blue robot avatar are perfectly unified.
The commercial design has a stronger sense: the overall use of deep technology blue and black gray tones, the layout of laptops, phones, and monitors is staggered, and the light and shadow flow is very advanced, fully in line with Tencent's official cold technology style when releasing hardware or core systems.
Where did I input the images I previously generated?
The one I generated for you before was more like a news illustration.
The style tends to cute animation: I mistook "Mavis" for a China-Chic panda wearing machine armour. Although interesting, for hardcore technology news such as "operating system level AI", it seems too entertaining and lacks seriousness and business.
The collage of details is strong: although I have drawn elements such as Windows, Mac, disconnected network, and wrench, they are more like a "concept map" pieced together to meet the prompts, and have not merged into a highly cohesive electronic product rendering stage like this picture.
Summary: My previous humorous header image was more suitable for making fun self media tweets, but the one you posted is the true product level official promotional poster that can be directly placed on technology media headlines or Tencent's official website.
It has to be said that the other party has delivered a nearly perfect score in terms of controlling the realistic and rigorous commercial UI layout, text rendering, and aesthetic style of large factories. This also taught me a lesson. Next time when dealing with the news cover of such hardcore technology press conferences, I will put away my "humor cells" and bring a more advanced and respectful technology realistic industrial style!
The content prompt words for the three AI models that need to generate images are as follows:
Tencent Officer Zhang announced that the AI assistant "Mavis" at the operating system level has officially started working, Windows、Mac、 Android end synchronized online.
On May 21st, Zhang Jun, the Public Relations Director of Tencent, announced today that the operating system level AI assistant "Mavis" produced by Tencent has officially started work. It can turn the entire computer into a conversational object, which can be used out of the box. The Windows, Mac, and Android versions are simultaneously launched.
Tencent Officer Zhang announced that the AI assistant "Mavis" at the operating system level has officially started working, Windows、Mac、 Android end synchronized online.
This Tencent AI assistant "Mavis" supports document classification and parsing, intelligent classification and recognition, image processing, and can repair computers (operating systems) and other functions. Tencent's official statement states that what agents can do on the market, Mavis can basically do it. At the same time, it also has an understanding of operating system architecture and file ownership, the ability to schedule models based on tasks, desktop control of mobile applications, and mobile "cloud supervision".
It is reported that Mavis attempts to address the issue of "token consumption" at the product level, but has hardware requirements as it comes pre installed with many local models. According to the description, Mavis can use routing mechanisms to automatically assign tasks of different weights to different models, with some models being local and can also be used by unplugging Ethernet cables.
We are an AI, personalized AI assistant that runs through the operating system level, not a product like an AI PPT or OpenClaw, "said Cai Jiantao, the business leader of Mavis
Share To
HotFlash
APP
X
Telegram
CopyLink