Multilingual Support
Process documents and data in multiple languages with full Unicode support. The AI detects the language automatically, preserves special characters, and maps data to the correct fields.
Overview
Instafill.ai is a form filler ai used in 138 countries. The platform supports form filling in a growing set of fully tested languages, with additional languages in active beta, and reads source documents in 100+ languages.
This covers two separate scenarios. The first is filling a form where the field labels are in a language other than English - a Spanish credentialing form, a German compliance document, a French legal filing. The AI understands what each field is asking in its language and fills it correctly. The second is attaching a source document written in a different language from the form itself. A client submits their intake information in Spanish; the target form is in English. The AI reads the Spanish source, extracts the data, and places it in the English form fields without you translating anything manually.
Special characters, accented letters, and diacritical marks are preserved throughout - in the filled form and in the downloaded PDF.
Supported Languages
Primary supported languages (fully tested)
These languages are fully supported for both form field labels and source document input:
English, Spanish (Latin America and Spain), French (France, Canada, and Switzerland), German (Germany, Austria, and Switzerland), Italian (Italy and Switzerland), Portuguese (Portugal and Brazil), Ukrainian, Dutch, Polish, Romanian, Hungarian, Turkish, Catalan, and Romansh.
Beta languages (field-tested, actively improving)
These languages are supported and in active use, but may require reviewing more fields after autofill:
Vietnamese, Tagalog, Arabic (left-to-right fields only - see note below), Chinese Simplified, Korean, and Japanese.
Note on Arabic: Arabic form fields are currently supported in left-to-right orientation only. Forms with Arabic text in right-to-left layout may require manual field review and correction after filling.
Note on non-Latin scripts: For Chinese, Korean, Japanese, Arabic, and similar scripts, accuracy depends on the quality and clarity of the source document and the PDF form layout. Plan to review the filled result more carefully on the first fill of any new form type in these languages.
Source documents in any language
When you attach a source file - a PDF, Word document, image, or spreadsheet - the AI can read content in 100+ languages and extract the relevant data. You do not need to specify the language or translate the document before attaching it.
How It Works
Source documents in a different language
Attach any foreign-language source document in a fill session the same way you would attach an English one - upload the file in the Files tab or paste text in the Input tab. The AI detects the language, extracts names, dates, addresses, and other data, then maps that data to the corresponding fields in the target form.
If a client submits data in Spanish and your form is in English, the AI handles the mapping. "Nombre" in a source document maps to "Name" in the target form. "Dirección" maps to "Address." You do not configure any of this - the AI resolves field equivalents across languages automatically.
Forms with non-English field labels
When the form itself is in a supported language, Instafill.ai reads and understands the field labels in that language during fine-tuning. The filling process is identical to an English form - provide your source data and the AI places values in the right fields.
Special characters and accented letters
Names like "Maria Garcia," entries with umlauts, cedillas, tildes, and other diacritical marks are written into the PDF exactly as provided. The platform uses UTF-8 encoding throughout - database, API, and PDF generation - so characters are never corrupted or dropped. The downloaded PDF embeds the fonts needed to display special characters correctly on any device.
Date formats, name ordering, and address structure
The AI adapts to language-specific conventions automatically:
- Date formats: US forms expect MM/DD/YYYY; European forms typically use DD/MM/YYYY. The AI converts dates from the source document's format to what the form expects.
- Name ordering: Western names use first name then last name. Chinese, Japanese, and Korean names conventionally reverse this order. The AI applies the correct ordering based on form context.
- Address structure: US addresses follow Street, City, State, ZIP. Other countries use different structures. The AI maps address components to the correct fields based on the form's layout.
- Smart data formatting: Phone numbers, dates, currency values, and ID numbers are reformatted to match what each form expects - regardless of how they appear in the source document.
Use Cases
Immigration and visa workflows
Immigration attorneys frequently receive client documents in the applicant's native language - passports, civil records, employment letters - and need to fill US government forms in English. Attaching these foreign-language documents as sources means the AI extracts names, dates, and biographical data without manual translation or transcription.
International business operations
Organizations operating across multiple countries maintain form libraries in several languages. A compliance packet in Germany uses German field labels. A credentialing form in France uses French. A regulatory submission in Spain uses Spanish. The same workflow applies in all cases - upload the form, attach the source, review the filled result.
Cross-border client intake
Law firms, financial advisors, and healthcare providers serving non-English-speaking clients can receive intake information in the client's language and fill English forms from it. This removes a translation step from every new client intake.
Multilingual batch processing
When the same form needs to be filled for records from different countries - employees across different regions, applicants from different origin countries - batch filling handles each row using the same field mapping logic. A spreadsheet with multilingual data populates the correct fields in the correct format for each row.
Key Capabilities
| Capability | Detail |
|---|---|
| Primary languages | 14+ fully tested: English, Spanish, French, German, Italian, Portuguese, Ukrainian, Dutch, Polish, Romanian, Hungarian, Turkish, Catalan, Romansh |
| Beta languages | Vietnamese, Tagalog, Arabic (LTR fields only), Chinese Simplified, Korean, Japanese |
| Source document reading | 100+ languages - the AI detects the language and extracts data automatically |
| Cross-language field matching | English source data fills Spanish form fields, and vice versa, without manual mapping |
| Special character preservation | Accented letters, umlauts, tildes, cedillas, and diacritical marks are written and rendered correctly |
| Date format conversion | Source dates are converted to the format the form expects |
| Name and address conventions | East Asian name ordering and international address structures handled automatically |
| Smart data formatting | Phone numbers, SSNs, currency, and dates are reformatted to match the form's expected format |
| Font embedding in PDF output | Required fonts are embedded in the downloaded PDF so special characters display on any device |
Benefits
- Attach source documents in any language without translating them first - the AI reads and extracts data directly
- Fill forms in any of the fully tested languages using the same workflow as English
- Special characters in names and addresses are preserved exactly in the filled PDF
- Date formats, name ordering, and address structure are adapted to what each form expects
- Smart formatting cleans up raw data - phone numbers, dates, ID numbers - before placing them in fields
- One platform handles multilingual workflows the same way it handles English ones - no separate tools, no language-specific configuration
Security & Privacy
Source documents in all languages are handled with the same security controls as English documents:
- Workspace-scoped access: Files are accessible only to users within the originating workspace.
- Encrypted storage: All data is encrypted at rest with AES-256. All connections use TLS.
- No AI training: Documents you upload are processed only for your specific filling session and are never used to train AI models.
- Data residency: Instafill.ai is hosted on Microsoft Azure in US, European, and Australian regions. Enterprise customers can select the region that meets their compliance requirements.
- Stateless option: Enable "Remove files immediately after processing" and all source content is deleted from Instafill.ai's servers as soon as the form is filled - including sensitive foreign-language documents such as passports, civil records, and medical files.
Learn more on the Security page.
Common Questions
Which languages are fully supported?
The following languages are fully tested for form filling: English, Spanish (Latin America and Spain), French (France, Canada, and Switzerland), German (Germany, Austria, and Switzerland), Italian (Italy and Switzerland), Portuguese (Portugal and Brazil), Ukrainian, Dutch, Polish, Romanian, Hungarian, Turkish, Catalan, and Romansh.
For source documents - the files you attach to extract data from - the AI reads content in 100+ languages and maps it to form fields automatically.
Additionally, these languages are in active beta (field-tested and improving): Vietnamese, Tagalog, Arabic (left-to-right fields only), Chinese Simplified, Korean, and Japanese.
Can I attach a source document in one language and fill a form in another?
Yes. This is one of the most common multilingual workflows. A Spanish-language intake form submitted by a client can be attached as a source to fill an English PDF. A foreign-language passport or employment letter can supply data for a US government form.
The AI understands field label equivalents across languages - "Nombre" corresponds to "Name," "Fecha de nacimiento" corresponds to "Date of birth" - without any configuration required.
How accurate is multilingual form filling compared to English?
For primary supported languages with Latin-based scripts (Spanish, French, German, Portuguese, Italian, Dutch, Polish, and similar), accuracy is close to English because the underlying script and structure are similar.
For beta languages - Arabic, Chinese, Japanese, Korean, Vietnamese, Tagalog - accuracy depends on the quality and clarity of the source document and the form's PDF layout. Plan to review the filled result more carefully on the first fill of any new form type in these languages. Arabic is additionally limited to left-to-right field orientation at this time.
Do special characters and accented letters render correctly in the downloaded PDF?
Yes. Special characters - accented vowels, umlauts, cedillas, tildes, and other diacritical marks - are preserved in the filled form and in the downloaded PDF. The PDF embeds the fonts needed to display those characters so the file opens and prints correctly on any device, regardless of what fonts are installed locally.
Can I use multilingual data in Profiles?
Yes. Profiles store structured data for clients, patients, or employees and can hold information in any language. A profile for a Spanish-speaking client stores their name, address, and other details with special characters intact. When you apply that profile to fill a form - whether in English or Spanish - the AI maps the profile data to the correct fields and preserves the characters exactly as stored.
Does the AI translate the content of forms or source documents?
No. Instafill.ai does not translate document content. When filling a Spanish form from English source data, the AI fills the Spanish fields with the original values from the source - "John Smith" goes into the "Nombre" field as "John Smith," not as a translated name.
What the AI does is understand field label equivalence across languages - it knows "Nombre" means "Name" - so it places data in the correct field. The data values themselves are not translated. If your workflow requires full document translation, Instafill.ai can be combined with external translation APIs.
Do date formats and name ordering work correctly across languages?
Yes. The AI detects the date format used in your source document and converts dates to the format the form expects - MM/DD/YYYY for US forms, DD/MM/YYYY for many European forms, or other formats as needed. Name ordering conventions for Chinese, Japanese, and Korean names (family name first) are handled based on the form's context. You do not need to reformat dates or reorder names manually before attaching source documents.
Is my language not listed? Can it be added?
Contact the support team. If you work with a specific form in a language not currently on the primary or beta list, Instafill.ai can be configured to support that form set. The AI can be trained on new form types regardless of language. Reach out to discuss your specific requirements.