国产开源图像模型HiDream-I1发布

HiDream-I1 由国内AI初创公司智象未来(HiDream.ai)研发,凭借 17亿参数 的规模与创新架构,成为当前开源图像生成领域的标杆级产品。其核心技术亮点包括:

  1. 混合架构融合:基于 扩散模型(DiT)混合专家系统(MoE) 的双重架构,在生成速度与质量间取得平衡。模型包含双流MMDiT block和单流DiT block,支持多模态输入。
  2. 多模态编码器:整合 OpenCLIP ViT-bigGOpenAI CLIP ViT-LT5-XXLLlama-3.1-8B-Instruct 四类编码器,实现文本描述的深度语义解析。
  3. 性能优化:通过 动态剪枝知识蒸馏 技术,提供完整版、开发版(28步推理)和极速版(16步推理)三种模型,满足不同硬件需求。在 HPS v2.1评分 中达到SOTA水平,生成质量超越多数开源模型。

DPG-Bench

ModelOverallGlobalEntityAttributeRelationOther
PixArt-alpha71.1174.9779.3278.6082.5776.96
SDXL74.6583.2782.4380.9186.7680.41
DALL-E 383.5090.9789.6188.3990.5889.83
Flux.1-dev83.7985.8086.7989.9890.0489.90
SD3-Medium84.0887.9091.0188.8380.7088.68
Janus-Pro-7B84.1986.9088.9089.4089.3289.48
CogView4-6B85.1383.8590.3591.1791.1487.29
HiDream-I185.8976.4490.2289.4893.7491.83

GenEval

ModelOverallSingle Obj.Two Obj.CountingColorsPositionColor attribution
SDXL0.550.980.740.390.850.150.23
PixArt-alpha0.480.980.500.440.800.080.07
Flux.1-dev0.660.980.790.730.770.220.45
DALL-E 30.670.960.870.470.830.430.45
CogView4-6B0.730.990.860.660.790.480.58
SD3-Medium0.740.990.940.720.890.330.60
Janus-Pro-7B0.800.990.890.590.900.790.66
HiDream-I10.831.000.980.790.910.600.72

HPSv2.1 benchmark

ModelAveragedAnimationConcept-artPaintingPhoto
Stable Diffusion v2.026.3827.0926.0225.6826.73
Midjourney V630.2932.0230.2929.7429.10
SDXL30.6432.8431.3630.8627.48
Dall-E331.4432.3931.0931.1831.09
SD331.5332.6031.8232.0629.62
Midjourney V532.3334.0532.4732.2430.56
CogView4-6B32.3133.2332.6032.8930.52
Flux.1-dev32.4733.8732.2732.6231.11
stable cascade32.9534.5833.1333.2930.78
HiDream-I133.8235.0533.7433.8832.61

已发布

分类

来自

标签:

评论

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注