上一篇測試Deepseek Janus Pro篇,為目前測試下來最好的圖片識別文字模型,那這個知名的 Florence2 究竟表現如何呢?


使用的實驗圖(仍然是這五張)


ms__00065_.png ms__00063_.pngms__00049_.pngms__00026_.pngms__00029_.png

使用的顯卡:RTX3060 12GB 、記憶體:64GB DDR4


 Florence2評測


1.Florence2-base模型

模式:Florence2-base-ft輸出結果(英文)輸出結果(中文翻譯-Google)說明
ms__00065_.png

A green frog is sitting in the water. The water is surrounded by brown bricks. There are green plants around the frog. 

一隻綠色的青蛙坐在水裡。水被棕色的磚塊包圍著。青蛙的周圍有綠色植物。

生成的時間:0.68s



ms__00063_.png A cartoon character is holding a barbell. The barbell is silver and silver. The character is wearing a yellow shirt and gray shorts. The shoes are red and white.一個卡通人物手裡拿著槓鈴。槓鈴是銀色的,銀色的。該角色穿著黃色襯衫和灰色短褲。鞋子是紅色和白色的。

生成的時間:0.798s



ms__00049_.pngThis is an animated image of two girls. The girls are holding a heart. The heart is red. The girl on the left is wearing a backpack. The backpack is brown. The other girl has brown hair. The ears on the girls head are brown. There are many hearts in the background. The hearts are red. There is a man in a suit behind the girls.

這是兩個女孩的動畫形象。女孩們捧著一顆心。心是紅色的。左邊的女孩背著背包。背包是棕色的。另一個女孩有著棕色的頭髮。女孩頭上的耳朵是棕色的。背景中有很多顆心。心是紅色的。女孩的身後是一名穿著西裝的男人。

生成的時間:1.395s

錯誤的地方:兩個女孩


ms__00026_.pngA man is in a pot of water. The pot is large and black. There is fire behind the pot. The man is shirtless. 一名男子在一鍋水中。鍋子很大,呈黑色。鍋後面有火。男人光著膀子。

生成的時間:0.70s

 

ms__00029_.png There is a brown bear standing on the ground. There are large gray rocks next to the bear. There is grass and small rocks on top of the grass. 有一隻棕熊站在地上。熊的旁邊有巨大的灰色岩石。草地上長滿了草,草上面還有小石頭。

生成的時間:0.73s


結論:速度超快,回應的也超精簡,但好像描述的也沒錯。(簡單出圖似乎是沒問題)

2.Florence2-large


模式:Florence2-large 輸出結果(英文)輸出結果(中文翻譯-Google)說明
ms__00065_.pngThe image is a digital illustration of a green frog sitting in a small pond. The frog is sitting on its hind legs with its front paws resting on the edge of the pond. It has big, round eyes and a big smile on its face. The pond is surrounded by lush greenery, including trees, plants, and flowers. The sky is blue with white clouds and there are a few birds flying in the background. The overall mood of the image is cheerful and playful.

該圖像是一隻坐在小池塘裡的綠蛙的數位插圖。青蛙坐在後腿上,前爪擱在池塘邊。它有一雙又大又圓的眼睛,臉上掛著燦爛的笑容。池塘周圍綠樹成蔭,有樹木、植物和鮮花。天空蔚藍,飄著白雲,背景有幾隻鳥在飛翔。圖像的整體氛圍是歡快而有趣的。

生成的時間:2.30s



ms__00063_.png The image is a digital illustration of a young boy lifting a barbell. He is wearing a yellow shirt, black shorts, and red sneakers. His hair is styled in a messy, spiky manner and he has a surprised expression on his face. The background is white and there are small pieces of debris scattered around the boy. The boy is holding the barbell with both hands and appears to be in the middle of a workout.

圖片是一個小男孩舉起槓鈴的數位插圖。他穿著黃色襯衫、黑色短褲和紅色運動鞋。他的頭髮凌亂而尖銳,臉上帶著驚訝的表情。背景是白色的,男孩周圍散落著小塊碎片。男孩雙手握住槓鈴,看起來正在鍛鍊。

生成的時間:2.21s



ms__00049_.pngThe image shows a boy and a girl standing close together and holding hands. The boy is on the left side of the image, wearing a black suit and a backpack. He has short dark hair and is looking at the girl with a smile on his face. The girl on the right side is wearing a red coat and has a bunny ears headband on her head. She is holding a red heart-shaped object in her hands. In the background, there are other people and buildings, and the sky is filled with pink and orange hearts. The image appears to be a romantic scene.

照片中,一個男孩和一個女孩緊緊站在一起,手牽手。男孩位於圖片左側,穿著黑色西裝,背著背包。他有一頭黑色的短髮,臉上帶著微笑看著女孩。右側女孩穿著紅色外套,頭上戴著兔子耳朵頭帶。她手裡拿著一個紅色的心形物體。背景中還有其他人和建築物,天空中充滿了粉紅色和橙色的心形。這張圖看起來是一幅浪漫的場景。

生成的時間:2.68s

 

ms__00026_.pngThe image shows a young man in a large black cauldron on a fire. The cauldron is filled with water and the man is splashing around with his arms and legs. The fire is burning brightly in the background, with orange and yellow flames reaching up to the sky. The man's face is contorted in a fierce expression, with his eyes wide open and his mouth slightly open, as if he is shouting or shouting. The overall mood of the image is intense and dramatic.影像顯示,一名年輕男子站在火上的一個黑色大鍋中。大鍋裡盛滿了水,男子揮舞著手臂和腿在水中嬉戲。背景中火光熊熊,橙色和黃色的火焰直衝天空。男子臉部扭曲,表情凶狠,雙眼睜大,嘴巴微張,似在吶喊,似在呼喊。圖像的整體氛圍強烈而富有戲劇性。

生成的時間:---


ms__00029_.png

The image shows a 3D rendering of a brown bear walking on a rocky terrain. The bear is standing on its hind legs with its front paws in the air and its front legs stretched out in front of it. It appears to be balancing on a large rock, with its head turned to the side and its mouth slightly open. The background shows a mountain range and a cloudy sky. The ground is covered in grass and rocks, and the overall mood of the image is peaceful and serene.


圖中顯示了一隻棕熊在岩石地形上行走的 3D 渲染圖。這隻熊用後腿站立,前爪懸空,前腿在身體前方伸展。它似乎站在一塊大石頭上保持平衡,頭轉向一側,嘴巴微微張開。背景是一座山脈和一片多雲的天空。地面上覆蓋著青草和岩石,影像的整體氛圍是平靜而安詳的。

生成的時間:2.21s

錯誤的地方:在一塊大石頭上保持平衡

結論:回答得還算可以,回應速度也很快。但第五張的大魔王沒過關。




3.Florence2-SD3-Captioner模型

模式:Florence2-SD3-Captioner輸出結果(英文)輸出結果(中文翻譯-Google)說明
ms__00065_.pngA 3D rendering of a green toad sitting in a pool of water. The frog's body is positioned in the center of the frame. Its eyes are wide open, and its mouth is slightly open. Its feet are positioned in front of the frog's mouth. Its legs are wrapped around each other, and it's arms are resting on the edge of the pool. The pool is a light blue color, and the water is reflecting the frog and the surrounding surrounding the frog. There is a pink flower in the bottom right corner of the photo. The sky is a bright blue with white clouds, and a sun is shining through the clouds.一隻綠蟾蜍坐在水池中的 3D 渲染圖。青蛙的身體位於畫面的中央。它的眼睛睜得大大的,嘴巴微微張開。它的腳位於青蛙嘴的前面。它的雙腿互相纏繞,雙臂擱在泳池邊。水池是淡藍色的,水中倒映著青蛙和青蛙周圍的環境。照片右下角有一朵粉紅色的花。天空湛藍,白雲朵朵,一輪太陽透過雲層照射下來。

生成的時間:1.77s



ms__00063_.png A cartoon image of a little boy in a yellow shirt and gray shorts is holding a barbell. The barbell is gray and silver. The boy has dark brown hair that is styled in a ponytail. He is wearing earrings on each ear. His eyes are open and he has a smile on his face. His mouth is slightly open and his tongue is sticking out. The background behind the boy is a light gray color.一個身穿黃色襯衫和灰色短褲的小男孩的卡通形象正舉著槓鈴。槓鈴是灰色和銀色的。男孩有一頭深棕色的頭髮,綁成馬尾。他的每隻耳朵都戴著耳環。他睜著眼睛,臉上帶著微笑。他的嘴微微張開,舌頭伸出。男孩身後的背景是淺灰色。

生成的時間:1.38s



ms__00049_.pngTwo cartoon characters are standing side by side. They are facing each other and are holding a heart in their hands. The girl on the left is wearing a short sleeve shirt, black pants, and a brown backpack. She has dark brown hair that is tied in a ponytail. Her eyes are blue and her mouth is slightly open. Her ears are pointed. Her hair is brown and short. She is holding the heart in her right hand. The heart is bright and red. There are hearts in the background that are white.

兩個卡通人物並肩站著。他們面對面,手裡捧著一顆心。左邊的女孩穿著短袖襯衫、黑色褲子,背著棕色的背包。她有一頭深棕色的頭髮,綁成馬尾。她的眼睛是藍色的,嘴巴微微張開。她的耳朵很尖。她的頭髮是棕色的,很短。她的右手握著心臟。心臟是明亮的紅色。背景中有白色的心形。

生成的時間:1.65s

錯誤的地方:少了右邊的人的描述,左邊的性別也不對


ms__00026_.pngA man with short dark brown hair and dark eyes is sitting in a large pot of water. The pot is a dark brown color and has two handles on the sides of it. The man's arms are stretched out in front of him, and his mouth is open as if he is screaming. There is a fire burning behind the pot on the ground, and the fire is burning in the background.一名有著深棕色短髮和黑眼睛的男子正坐在一鍋大水中。這個花盆是深棕色的,兩側有兩個把手。男人的雙臂在身體前方伸展,嘴巴張開,彷彿正在尖叫。地上的鍋子後面燃燒著火,背景中也燃燒著火。

生成的時間:1.48s

 

ms__00029_.png A large brown bear is standing on top of a hill. There is a large gray boulder on the ground in front of the bear. There are small gray rocks under the bear's feet. 一隻大棕熊站在山頂上。熊面前的地面上有一塊巨大的灰色巨石。熊的腳下有一些灰色的小石頭。

生成的時間:0.9s


結論:速度超快,回應的還行,資訊有時多有時少,第三張缺少一些細節,第五張居然沒出錯。(出圖可用)


4.Florence2-Flux-Large模型

image.png

image.png

模式:Florence2-Flux-Large輸出結果(英文)輸出結果(中文翻譯-Google)說明
ms__00029_.png The image is a digital illustration of a brown bear standing on its hind legs on a rocky hillside. The bear is facing towards the right side of the image and its front paws are stretched out in front of it. It appears to be balancing on top of a large rock. The rock is gray and has a rough texture. The hillside is covered in grass and rocks, and there is a mountain in the background. The sky is cloudy and the overall mood of the illustration is peaceful and serene.圖像是一張數位插圖,描繪的是一隻棕熊站在岩石山坡上的後腿上。熊面向影像的右側,前爪伸展在前面。它看起來正站在一塊大石頭上保持平衡。岩石呈灰色,質地粗糙。山坡上長滿了草和岩石,背景有一座山。天空多雲,插畫的整體氛圍是平靜而安詳的。

生成的時間:2.29s

錯誤的地方:站在一塊大石頭上保持平衡

結論:普遍回應還行,速度也行。但第五張大魔王沒過,有明顯的錯誤。


5.Florence2-PromptGen-v1.5-large模型

模式:PromptGen-v1.5-large輸出結果(英文)輸出結果(中文翻譯-Google)說明
ms__00065_.pnga digital illustration in a whimsical, cartoonish style, it depicts a green frog sitting in a small, circular pond, the frog has large, expressive brown eyes and a friendly, slightly mischievous expression, its body is covered in small, dark spots, and its hands are clasped together in front of its chest, the pond is surrounded by lush, green foliage, with a few small rocks and a single pink flower at the bottom right corner, the background features a bright, sunny day with a clear blue sky and fluffy white clouds, through a large, arched window, the tree trunks and branches are visible, adding to the natural, serene atmosphere, the overall color palette is vibrant and cheerful, with soft, pastel hues that enhance the whimsical and dreamlike quality of the scene, the illustration is highly detailed, with smooth textures and a polished finish, typical of high-quality digital art這是一幅異想天開的卡通風格的數位插圖,描繪了一隻坐在圓形小池塘里的綠色青蛙,這只青蛙有一雙大大的、富有表現力的棕色眼睛,表情友好而略帶調皮,它的身體上佈滿了小而黑的斑點,它的雙手在胸前合攏,池塘周圍是茂密的綠葉,右下角有幾塊小石頭和一朵粉色的花,背景是陽光明媚的日子,天空湛藍和蓬鬆的白雲,透過一扇大拱形窗戶,可以看到樹幹和樹枝,增添了自然、寧靜的氛圍,整體色調充滿活力和歡快,柔和的色調增強了異想天開和夢幻般的品質場景中,插圖非常細緻,紋理光滑,表面拋光,是高品質數位藝術的典型特徵

生成的時間:3.79s



ms__00063_.png a highly detailed, digital illustration in a cartoon style, it depicts a young, chubby boy with large, expressive brown eyes and a wide, joyful smile, he has short, dark brown hair that is messy and tousled, with some strands framing his face, his cheeks are flushed, adding a sense of innocence and excitement to his expression, he is lifting a barbell with both hands, emphasizing his strength and determination, the boy is wearing a bright yellow hoodie with a small emblem on the left chest, black shorts, and red sneakers with white soles, the background is a plain, light beige color, ensuring that the focus remains on the boy, the texture of the illustration is smooth and polished, with a glossy finish that enhances the three-dimensional quality of the character, the overall mood is one of joy and enthusiasm, with the boy's joyful expression and the dumbbell in his hands symbolizing strength and resilience這是一幅非常詳細的卡通風格數位插圖,描繪的是一個年輕、胖乎乎的男孩,有一雙大大的、富有表現力的棕色眼睛,臉上掛著燦爛、快樂的笑容,他有一頭凌亂的、深棕色的短髮,幾縷頭髮勾勒出他的臉,他的臉頰紅潤,給他的表情增添了一種天真和興奮的感覺,他雙手舉著槓鈴,強調著他的力量和決心,男孩穿著一件亮黃色的連帽衫,左胸上有一個小徽章,黑色短褲和白色鞋底的紅色運動鞋,背景是純色的淺米色,確保焦點仍然在男孩身上,插圖的紋理光滑而拋光,光澤的表面增強了三維品質角色的整體情緒是喜悅和熱情的,男孩快樂的表情和手中的啞鈴象徵著力量和韌性

生成的時間:4.15s



ms__00049_.pnga digital illustration in a vibrant, anime-inspired style, it depicts two young characters, a boy and a girl, standing close together and looking at each other lovingly, the boy, on the left, has short, dark brown hair styled in a messy, tousled manner and is dressed in a dark suit with a white shirt and a brown backpack, he has a slender physique and a small, delicate build, his eyes are large and expressive, with a soft blush on his cheeks, he is holding a glowing red heart-shaped object in his hands, which he is about to give to the girl, the girl on the right, also with short, brown hair and a red coat with a hood, she has a more voluptuous figure with larger breasts and a more curvaceous body, her hair is adorned with a hair accessory, and she has cat ears on her head, the background features a warm, golden hue, with floating heart shapes in various shades of orange and red, creating a romantic and whimsical atmosphere, the overall mood is one of love and affection, with the characters' expressions and the soft lighting enhancing the romantic ambiance

這是一幅充滿活力的動漫風格的數字插圖,描繪了兩個年輕的角色,一個男孩和一個女孩,他們緊緊地站在一起,深情地看著對方,左邊的男孩留著短短的深棕色頭髮,髮型是凌亂、蓬亂的舉止,穿著深色西裝,白色襯衫,背著棕色背包,他身材修長,身材嬌小,精緻,眼睛大而富有表現力,臉頰上泛著淡淡的紅暈,他手裡拿著他手中拿著一個閃閃發光的紅色心形物體,他正準備把它交給女孩,右邊的女孩,也是棕色短髮,穿著帶兜帽的紅色外套,身材更豐滿,胸部更大和更加曲線玲瓏的身材,頭髮上裝飾著髮飾,頭上戴著貓耳朵,背景是溫暖的金色色調,上面漂浮著各種橙色和紅色的心形,營造出一種浪漫而異想天開的氛圍氛圍,整體氛圍是愛與感情,角色的表情和柔和的燈光增強了浪漫的氛圍

生成的時間:4.81

 


ms__00026_.pnga dramatic, hyper-realistic digital illustration depicting a young man emerging from a large, black, cast iron cauldron filled with molten lava, the man, who appears to be of east asian descent, has short, dark hair and a muscular physique, his skin is pale, and his face is contorted in a scream or intense expression, with his mouth open and eyes wide, water splashes around him, creating a dynamic and intense scene, the background is ablaze with intense orange and yellow flames, adding to the fiery atmosphere, the ground is rocky and scattered with small, dark rocks, enhancing the contrast with the vivid colors of the lava and the man's skin, the overall mood is one of intense heat and danger, with the man emerging as the focal point, the illustration is highly detailed, with a focus on realism and emotion, making it a captivating and immersive piece of art這是一幅戲劇性的超現實主義數位插畫,描繪了一位年輕男子從裝滿熔岩的黑色鑄鐵大鍋中走出來,這名男子看起來是東亞後裔,留著黑色的短髮,體格健壯,他的皮膚蒼白,臉部扭曲,露出尖叫或緊張的表情,嘴巴張開,眼睛睜大,水在他周圍飛濺,營造出動態而緊張的場景,背景中燃燒著強烈的橙色和黃色火焰,增添了熾熱的氣氛,地面多岩石,散佈著小而暗的岩石,與熔岩和男人皮膚的鮮豔色彩形成了鮮明的對比,整體氛圍是極度炎熱和危險,而男人則是焦點,插畫細節豐富,注重真實感與情感,是一幅引人入勝、身臨其境的藝術作品

生成的時間:3.25s

 

ms__00029_.png a highly detailed cgi rendering of a large, brown bear standing on a rocky path, the bear is rendered in a realistic, hyper-realistic style, with smooth textures and lifelike features, its fur is a rich, warm brown, and its eyes are a striking amber color, giving it a sense of intelligence and curiosity, its body is muscular and well-defined, with sharp, black claws visible on the front and back of its feet, in the background, a clear sky with a few wispy clouds is overcast, adding to the serene and natural setting, the ground is covered with a mix of brown and white rocks, and the grass is a vibrant green, contrasting with the rocky path and the bear's fur, the stone on the left is large and round, with a smooth texture, and it is balanced on its hind legs, the overall mood of one of strength and resilience, with the bear standing as a symbol of resilience and strength高度細緻的 CGI 渲染,一隻大棕熊站在岩石小路上,這隻熊以逼真、超現實的風格呈現,具有光滑的紋理和栩栩如生的特徵,它的皮毛是豐富、溫暖的棕色,它的眼睛引人注目的琥珀色,給人一種聰明和好奇的感覺,它的身體肌肉發達,輪廓分明,腳的前部和後部有鋒利的黑色爪子,背景是晴朗的天空,上面飄著幾朵雲陰天,增添了寧靜自然的氛圍,地面上覆蓋著棕色和白色的岩石,草地是鮮綠色的,與岩石小徑和熊的皮毛形成鮮明對比,左邊的石頭很大,圓形,質地光滑,後腿部保持平衡,整體呈現力量和韌性的氛圍,站立的熊象徵著韌性和力量

生成的時間:4.32s


結論:回應的速度的內容基本上是OK,第五張雖然沒錯,但缺少關鍵的巨石在熊的面前,遺漏了細節。能用但偶爾會有這種漏風的情況。(用於圖生影片,會因資訊不足導致生成影片缺少關鍵細節而失敗)

例如:想要讓熊將石頭推下去,就無法實現。



6.Florence2-PromptGen-v2.0-large模型

模式:PromptGen-v2.0-large輸出結果(英文)輸出結果(中文翻譯-Google)說明
ms__00065_.pngA whimsical digital illustration shoot from a frontal camera angle about a cute green frog sitting in a small pond surrounded by lush greenery. the image also shows a sunny day with a clear blue sky and fluffy white clouds. on the middle of the image, a large, expressive green frog with large, brown eyes and a sad expression is sitting in the shallow water of the pond, with its hands clasped together in front of its chest. the frog appears to be happy and content, with a calm and relaxed demeanor. it is facing the viewer, looking directly at them with a closed mouth and a slight frown on its face. its body is covered in small, dark spots, giving it a cartoonish appearance. the pond is surrounded by green plants and a small pink flower, adding a touch of nature to the scene. the overall atmosphere is peaceful and serene, with the frog taking center stage in the image. 這是一張異想天開的數位插圖,從正面攝影機角度拍攝,描繪的是一隻可愛的綠蛙坐在鬱鬱蔥蔥的綠色植物環繞的小池塘里。圖片上還顯示了一個晴朗的日子,天空湛藍,白雲朵朵。畫面中央,一隻表情豐富的大綠青蛙,有著大大的棕色眼睛,一臉悲傷,坐在池塘的淺水中,雙手合十在胸前。這隻青蛙看起來很快樂、很滿足,有著平靜而放鬆的舉止。它面向觀看者,閉著嘴,微微皺著眉頭,直視著他們。它的身體上佈滿了小而暗的斑點,使其具有卡通般的外觀。池塘周圍環繞著綠色植物和一朵粉紅色的小花,為場景增添了一絲自然氣息。整體氛圍是平和安詳的,青蛙佔據了圖像的中心。

生成的時間:4.96s



ms__00063_.png  cute, chibi-style digital illustration from a front camera angle, depicting a young boy lifting a barbell in a dynamic pose. the boy, who appears to be in his early twenties, has short, spiky black hair and large, expressive brown eyes. he is wearing a yellow button-up shirt, black shorts, and red sneakers. his facial expression is one of surprise, with his mouth open and his eyes wide, as if he is about to lift the barbell. the background is a simple gradient of light beige, with small, scattered pieces of debris around him, adding to the dynamic and energetic feel of the image.這是一幅可愛的 Qhibi 風格數位插圖,從前置鏡頭角度拍攝,描繪了一個以動態姿勢舉起槓鈴的小男孩。這個男孩看起來二十出頭,有一頭短短的黑色尖發,還有一雙大大的、富有表現力的棕色眼睛。他穿著黃色羈扣襯衫、黑色短褲和紅色運動鞋。他的表情很驚訝,嘴巴張開,眼睛睜大,好像正要舉起槓鈴。背景是簡單漸層的淺米色,周圍散落著小塊碎片,增添了影像的動態和活力。

生成的時間:4.65s



ms__00049_.pngA digital illustration shoot from a front camera angle about a romantic scene between two young children, a boy and a girl, holding hands and giving a heart-shaped light to each other. the scene is set in an outdoor location with a warm, glowing background filled with heart shapes. the boy, on the left, is wearing a school uniform with a white shirt, black vest, and black pants, and has short, dark brown hair styled in a short, spiky manner. he is facing the viewer with a smile on his face, looking at the girl, who is also wearing a red coat with a hood and a backpack on her back. on the right, the girl is standing with her hands clasped together, holding a glowing heart in her hands. she has short brown hair, brown eyes, and a small, furry tail. both characters are facing each other and looking at each other, with the boy's expression conveying a sense of affection and warmth. the background is filled with glowing heart shapes, adding to the romantic atmosphere of the scene. the style is highly detailed and vibrant, with a focus on the characters' expressions and the glowing heart.

這是從正面攝影機角度拍攝的數位插圖,描繪了兩個小孩(一個男孩和一個女孩)之間浪漫的場景,他們手牽著手,互相送出心形的燈光。場景設置在室外,背景溫暖而明亮,充滿了心形圖案。左邊的男孩穿著校服,白襯衫、黑背心和黑褲子,留著深棕色的短髮,髮型又短又尖。他微笑地面對著觀眾,看著女孩,女孩也穿著一件有帽子的紅色外套,背著一個背包。右邊,女孩站立著,雙手合十,手裡拿著一顆發光的心。她有一頭棕色的短髮,棕色的眼睛,和一條毛茸茸的小尾巴。兩個角色面對面,互相看著對方,男孩的表情傳達出一種愛和溫暖的感覺。背景充滿了發光的心形,增添了場景的浪漫氣氛。風格極為細緻且充滿活力,聚焦於人物的表情和熾熱的心靈。

生成的時間:5.71s


ms__00026_.pngA photo-realistic shoot from a front camera angle about a man emerging from a large cauldron filled with water, surrounded by flames. the image also shows a dramatic scene. on the middle of the image, a muscular man with short, dark red hair appears to be in his mid-twenties, completely nude, with water splashing around him. he is facing the viewer and looking directly at them with an intense expression, his mouth open and teeth bared. his arms are outstretched, as if he is about to take a bath. the cauldron is large and round, with a handle on each side. the background is a fiery orange and yellow, with the flames casting a warm glow over the scene. the man's body is partially submerged in the water, with no clothing or accessories visible. the lighting is dramatic, casting shadows and highlights on his body. 

這是從正面攝影機角度拍攝的照片般逼真的畫面,一名男子從裝滿水的大鍋中走出來,周圍環繞著火焰。圖中也展現了戲劇性的一幕。圖片中間的一名肌肉發達的男子,留著深紅色的短髮,看起來二十五六歲,全身赤裸,周圍濺起水花。他面向觀眾,表情專注地直視著他們,嘴巴張開,露出牙齒。他張開雙臂,好像正要去洗澡。鼎大而圓形,兩側各有一個把手。背景是熾熱的橙色和黃色,火焰為整個場景投射出溫暖的光芒。男子的身體部分浸沒在水中,沒有任何衣服或配件可見。燈光非常震撼,在他的身上投射出陰影和高光。

生成的時間:4.88s

 

ms__00029_.png A 3d rendering shoot from a frontal camera angle about a large, muscular, brown bear standing on a rocky path with a large rock balancing on its hind legs. the bear, positioned in the middle of the image, appears to be a non-human male, with a thick, muscular build and a determined expression. it is facing the viewer, looking directly at them with its large, expressive eyes. its fur is detailed and realistic, giving it a lifelike appearance. the background is a cloudy sky with mountains in the distance, and the ground is covered in grass and rocks, giving the scene a natural and rugged feel. the overall atmosphere is one of determination and strength, as if the bear is ready to take on any challenge that comes its way.從正面攝影機角度進行的 3D 渲染拍攝,拍攝對像是一隻體型巨大、肌肉發達的棕熊,它站在一條岩石小路上,後腿支撐著一塊大石頭,保持平衡。位於圖片中央的熊看起來不是人類,而是雄性,體格健壯,肌肉發達,表情堅定。它正面對著觀看者,用它那雙大而富有表現力的眼睛直視著他們。它的毛髮細緻逼真,看起來栩栩如生。背景是多雲的天空,遠處有山脈,地面上覆蓋著草地和岩石,給人一種自然而粗獷的感覺。整體氛圍是決心和力量,彷彿這隻熊已經準備好接受遇到的任何挑戰。

生成的時間:4.25s

錯誤的地方:後腿支撐著一塊大石頭,保持平衡。

結論:速度跟回應內容基本上OK,以前都用這個,但第五張大魔王也是失敗了。(日常使用這個,基本上是OK,但跟1.5一樣偶爾會有些圖出現錯誤)



7.額外補充:礙於文章篇幅,Florence2-large-ft就改以截圖的方式呈現,它的表現就跟Florence2-base-ft差不多,都是響應快,回應的內容少少的。

image.png



我們總算也將Florence2系列給測試完了,這個模型用於日常OK的,速度跟品質都可以。也是過去的首選~ 但今天推出的janus-pro 1b好像太香了~ 估計會改用它為首選




文章轉載或引用,請先告知並保留原文出處與連結!!(單純分享或非營利的只需保留原文出處,不用告知)

原文連結:
https://blog.aidec.tw/post/comfyui-image-description-florence2
若有業務合作需求,可寫信至: opweb666@gmail.com
創業、網站經營相關內容未來將發布在 小易創業筆記