Aidge Resource
Try for freeWorkplace
  • LATEST ADVANCEMENTS
    • Introducing Marco-MT: Bringing Translation to the Next Level with LLM
  • GETTING STARTED
    • Quick Start
    • Account and Authentication
    • Your First API Request
    • Test Your API Requests
    • Service Level Agreement
    • FAQ
  • API REFERENCE
    • E-commerce Information Translation
      • Marco Translator
        • Marco Translator API Reference
      • Image Translation
        • Image Translation Pro Version API Reference
        • Image Translation Pro Version Result API Call Description
        • Image Translation Standard Version API Reference
    • E-commerce Image Editing
      • Image Background Removal
        • Image Background Removal API Reference
      • Image Upscaling
        • Image Upscaling API Reference
      • Image Cropping
        • Image Cropping API Reference
      • Image Elements Removal
        • Image Elements Removal API Reference
      • Image Elements Detection
        • Image Elements Detection API Reference
    • E-commerce Virtual Model
      • Virtual Model Alternation
        • Virtual Model Alternation Submit API Reference
        • Virtual Model Alternation Result Query API Reference
      • Virtual TryOn
        • Virtual Try-on Submit API Reference
        • Virtual Try-On Query API Reference
        • General Model Library Reference
      • Hands&Feet Repair
        • Hands&Feet Repair Submit API Reference
        • Hands&Feet Repair Query API Reference
    • Editor Documentation
      • AI Model Editor
      • AI Image Editor
        • Image Workbench
        • Background Removal
        • Elements Removal
        • Image Translation
Powered by GitBook
On this page
  • Introduction
  • Key Features
  • Supported Language Pairs
  • Pricing
  • Quick Start
  • FAQ

Was this helpful?

  1. API REFERENCE
  2. E-commerce Information Translation

Image Translation

PreviousMarco Translator API ReferenceNextImage Translation Pro Version API Reference

Last updated 9 days ago

Was this helpful?

Introduction

The Image Translation is specially designed for e-commerce images, supporting dozens of language pairs. It accurately handles complex text layouts within images and ensures that the translated content matches the image content. This helps e-commerce platforms and developers easily achieve multilingual conversion and presentation of image content.

We offer two image translation API products: the Image Translation Standard Version and the Image Translation Pro Version, based on Multimodal Large Language Model(MLLM),which enhances translation accuracy and quality through advanced image understanding.

A comprehensive comparison between Aidge's image translation product and competitors revealed that Aidge offers 15% to 20% superior image processing, addressing challenges like color block masking and layout issues. In multilingual translation quality, the Image Translation Standard API outperforms competitors by 10% to 20%. The Pro Version, utilizing a multimodal large model, further improves translation quality by an additional 10% to 15%.

Original Image Input

Image Translation Standard Version Output

MLLM Image Translation Pro Version Output

Description

Text:160° Electric Recline for Deep Relaxation

Image Translation Standard Version Output:160 ° The electric back lies deep into it

MLLM Image Translation Pro Version Output:160° Electric Recline for Deep Relaxation

MLLM Image Translation Pro Version provides fluent translations consistent with standard English marketing and product description language.

Text:Brand under EuroComfort Group/69 Years of Experience

Image Translation Standard Version Output:Brand owned by german oshine group/69 precipitation

MLLM Image Translation Pro Version Output:Brand under EuroComfort Group/69 Years of Experience

MLLM Image Translation Pro Version translates names and abbreviations such as "EuroComfort Group" and "Y" on images with English information and brand names.

For more information about the MLLM Image Translation Pro Version, please refer to:

Key Features

The image translation is specifically designed for various e-commerce businesses, facilitating cross-platform e-commerce image creation. It quickly converts main product images and detailed descriptions into languages understood by consumers in different regions, optimizing product presentation and sales across multiple e-commerce platforms.

Key Advantages

Description

High-definition image processing

Capable of handling images up to 4000x4000 pixels while maintaining clarity and quality.

Multiple languages

Supports multiple languages, covering the languages of over a dozen countries and regions, with translation capabilities specifically optimized for e-commerce terminology.

Distinguishes between natural text and artificial text on products, preserving the natural text on the main body of the product.

Differentiates natural product text from artificial text, preserving natural text during translation with product body recognition, ideal for e-commerce images.

As shown in the example image, the text in the red box represents the text on the main subject, while the text in the blue box represents text not on the main subject.

Preserves brand name

When brand name protection is enabled, the system can identify artificially added brand names and prevent their translation.

In the example image, "懒角落" is the brand name. With protection enabled, it will remain untranslated.

Erases thoroughly, ensuring no residue or color blocks remain.

Retains image layout with adjusted text size and alignment.

The translated Arabic text is arranged from right to left, in accordance with the linguistic characteristics.

Supports the restoration of text within complex tables after translation.

Supported Language Pairs

The image translation product supports the language pairs listed in the table below for image translation. We will continue adding more language pairs.

If you have specific needs, please contact us via email, and we will get back to you as soon as possible. Our email address is: aidgesales@alibaba-inc.com.

The standard version of the image translation supports a wider range of languages, with 9 source languages including Chinese, English, Turkish... And can translate into over 20 target languages such as Chinese, English, Arabic, Spanish, and French. Currently, 198 language pairs are supported. Among these, language pairs like English to Arabic, and English to European languages are our strengths.

Note: Translation where the source and target languages are the same is not supported (e.g., if the source language is English and the target language is English, the final result will return the original input image).

The Image Translation Standard version supports source language to:

Language

Language Code

Chinese(Simplified)

zh

English

en

French

fr

Italian

it

Japanese

ja

Korean

ko

Portuguese

pt

Spanish

es

Turkish

tr

The Image Translation Standard version supports target language to:

Language

Language Code

Arabic

ar

Bengali

bn

Chinese(Simplified)

zh

Chinese (Traditional) only Chinese(Simplified) and English as source languages are supported

zh-tw

Czech only English and Turkish as source languages are supported

cs

Dutch

nl

English

en

French

fr

German

de

Greek only English and Turkish as source languages are supported

el

Hebrew

he

Hungarian only English and Turkish as source languages are supported

hu

Indonesian

id

Italian

it

Japanese

ja

Kazakh only Chinese(Simplified) as source language is supported

kk

Korean

ko

Malay

ms

Polish

pl

Portuguese

pt

Russian

ru

Spanish

es

Thai

th

Turkish

tr

Ukrainian

uk

Urdu

ur

Vietnamese

vi

The Image Translation Pro version(MLLM) currently supports over 600 language pairs. The translation quality has improved by 10% to 15% compared to competitors for Chinese to English, Japanese, Korean, and Southeast Asian languages, as well as English to Spanish, French, German, Portuguese, and other European languages.

The Image Translation Pro version supports source language to:

Language

Language Code

Chinese(Simplified)

zh

Chinese (Traditional)

zh-tw

English

en

French

fr

Italian

it

Japanese

ja

Korean

ko

Malay

ms

Portuguese

pt

Russian

ru

Spanish

es

Thai

th

Turkish

tr

Vietnamese

vi

The Image Translation Pro version supports target language to:

Language

Lamguage Code

Arabic

ar

Bengali

bn

Bulgarian

bg

Chinese(Simplified)

zh

Chinese (Traditional)

zh-tw

Croatian

hr

Czech

cs

Danish

da

Dutch

nl

English

en

Estonian

et

Finnish

fi

French

fr

German

de

Greek

el

Hebrew

he

Hungarian

hu

Icelandic

is

Indonesian

id

Italian

it

Japanese

ja

Kazakh

kk

Korean

ko

Latvian

lv

Lithuanian

lt

Macedonian

mk

Malay

ms

Polish

pl

Portuguese

pt

Romanian

ro

Russian

ru

Serbian

sr

Sinhala

si

Slovak

sk

Slovenian

sl

Spanish

es

Swedish

sv

Tagalog

tl

Thai

th

Turkish

tr

Ukrainian

uk

Urdu

ur

Uzbek

uz

Vietnamese

vi

Pricing

To use the API, you are required to choose and purchase an API resource pack from us on a subscription basis.

  • Each resource pack is valid for one calendar year upon successful purchase, and enables you to access the API up to the number of requests specified in the pack. No refunds can be provided.

  • Resource packs cannot be used across different products. For example, if you need to use both product text translation and image translation, you must purchase separate resource packs for each.

  • The Image Translation Standard Version and MLLM Image Translation Pro Version are priced the same.

The prices are as follows:

Capacity

Price (USD)

Unit Price(USD)

1,000 images

22

$0.022 /image

10,000 images

150

$0.015 /image

100,000 images

1000

$0.010 /image

Quick Start

FAQ

1. Does it support automatic recognition of images in different languages?

Currently, the image translation only translates the specified input language and does not support automatic recognition of multiple languages.

For example, if an image contains both Chinese and English, and the input source language is set to Chinese, only the Chinese text will be translated while the English text remains untouched. Conversely, if the input source language is set to English, only the English text will be translated while the Chinese text remains untouched.

2. The image translation result has imperfections, and how should I make adjustments?

If your image is not an e-commerce image or does not contain a product main body, or if you encounter missing translations, you can try setting the input parameter to translate the main body text of the product and try again.

Input:

Output:

Input:Output:

Input:Output:

Input:Output:

Input:Output:

Input:Output:

If you need to purchase more QPS due to business requirements, please contact us via or email us (aidgesales@alibaba-inc.com).

For instructions on calling the API Submit of the Image Translation Standard Version,please refer to . For the API Result Query, please also refer to to document. For the API Submit of the Image Translation Pro Version,please refer to . For the API Result Query, please also refer to the document.

https://github.com/AIDC-AI/Ovis
Discord
Image Translation Standard Version API Referance
Image Translation Pro Version Result API Call Description