Source Management

Organize and reuse documents as data sources across any form filling session

Overview

When you add a document to the source library, it is available for any form filling session in your workspace without re-uploading. Upload a physician CV, a client intake packet, or a prior completed form once, and any future session that needs it can pull from the library directly.

This is different from how profiles work. Profiles store structured data - names, addresses, dates - that maps directly to form fields. Source library documents are unstructured files that the AI reads and interprets to extract values. In practice, most workflows use both: the profile handles the standard identity fields while source documents carry case-specific detail like prior applications, employment records, or supporting certificates.

The library supports tagging, folder organization, version history, and usage tracking across sessions. Workspace-level sharing gives teams a single, maintained document set rather than individual uploads duplicated across accounts.

Key Capabilities

  • Source Library: Centralized repository for frequently-used documents, scoped to workspace
  • Multiple Format Support: PDF, Word, Excel, images, text files, and plain text
  • Source Organization: Tag and folder organization for easy discovery
  • Version History: Track changes when updating source documents; prior versions are preserved
  • Usage Tracking: See which sources are used in which sessions
  • Quick Access: Recently-used sources appear first for easy selection
  • Bulk Upload: Upload multiple source documents at once for batch processing workflows
  • Source Sharing: Share sources with workspace members for team collaboration
  • OCR for Images: Google Cloud Vision API extracts text from scanned documents and photographed IDs
  • Source Metadata: Add descriptions, tags, and notes to each source

How It Works

  1. Add Sources to Library:

    • Upload documents from your computer
    • Paste text directly
    • Capture via email forwarding
    • Add programmatically via RESTful API: POST /api/sources
  2. Processing on Upload:

    • Text extracted from PDF (PyMuPDF), Word, Excel, and plain text
    • Images and scanned PDFs processed via Google Cloud Vision API (OCR), with PIL/Pillow and OpenCV 4.9.0 for pre-processing (rotation correction, contrast normalization)
    • Vector embeddings created per source; source text mapped to form page numbers for page-scoped retrieval during sessions
    • Text content encrypted with workspace-scoped keys before storage
  3. Organize Sources:

    • Create folders: "Client Documents", "Reference Materials", "Templates"
    • Tag sources: "resume", "financial-info", "medical-history"
    • Add descriptions for team members
  4. Use in Form Sessions:

    • Start a form filling session
    • Select "Add Source" - "From Library"
    • Sources are available to the session before autofill begins
    • AI extracts data automatically from each source for the relevant fields

    Example: Teams using the n8n integration push source documents from Google Sheets, CRMs, or email inboxes into Instafill.ai via the API, so sessions start with sources already attached and no manual upload step is needed.

  5. Update Sources: When information changes:

    • Upload an updated version
    • The system creates a new version while preserving previous versions
    • Future sessions use the latest version automatically
    • In-progress sessions continue with the version they started with, so a mid-session update does not affect an active fill
  6. Share with Team:

    • Mark sources as "Shared" for workspace visibility
    • Team members reference common sources (company info, standard templates)
    • No duplicate uploads needed

Use Cases

Source management saves the most time in workflows where the same documents appear across many form submissions.

Medical credentialing - Hawkeye Physicians maintains a source library per radiologist containing CME certificates, malpractice coverage letters, and board certification records. When a new hospital contract requires a credentialing packet, the relevant sources are already organized and ready - no hunting across email or shared drives. Time per packet dropped from 3-4 hours to under 30 minutes.

Insurance authorization - Headstart Health processes ABA treatment authorization forms for 30+ payers. Each form pulls data from 3-5 documents: treatment plans, diagnostic reports, insurance cards, provider rosters. Keeping those in a shared source library means any team member can start a session without tracking down files. Time per form dropped from 20-30 minutes to around 2 minutes.

Immigration law - For firms managing multiple USCIS forms per client, source documents like passports, birth certificates, employment letters, and prior applications are uploaded once and reused across all filings in the case. Hong LLC achieved a 75-80% time reduction across I-485, I-765, I-130, and G-1145 filings this way.

HR and onboarding - Employee documents (I-9 supporting IDs, certifications, prior W-2s) are uploaded once per employee and reused across any onboarding or compliance form without the HR team re-uploading per form. When a new certification is earned, only one source needs updating for all future sessions to reflect it.

Benefits

  • Time savings: Upload documents once, use across any number of sessions without re-uploading
  • Consistency: The same source used across multiple forms ensures data extracted from it stays uniform
  • Version control: Update a source without losing previous versions
  • Team collaboration: Share sources across a workspace so every team member draws from the same maintained document set
  • Reduced errors: Referencing a single source eliminates transcription mistakes that occur when the same data is manually re-entered per form
  • Easy updates: Update a source once and all future sessions use the new version automatically
  • Better organization: Tag and folder structure keeps documents findable rather than buried in email threads or local folders

Security & Privacy

  • Access Control: All service-layer queries include a workspaceId filter - source data is isolated per workspace and protected via the shared JWT authentication middleware running in both the .NET and Python service layers
  • Encryption: Text source content encrypted with workspace-scoped keys stored in Azure Key Vault; binary files stored in Azure Blob Storage
  • Scope Restriction: Encryption includes scope metadata that prevents decryption outside the originating workspace, even for internal service calls
  • Retention Policies: Configure automatic deletion after a specified period via Data Retention & Cleanup Management
  • Audit Trail: Every access and modification to sources is logged - see Request Logging & Audit Trail for details
  • No AI Training: Source documents are never used to train AI models; they are processed only for your specific form filling sessions

Common Questions

How many sources can I store?

Storage limits depend on your subscription plan. See the pricing page for current plan limits, or contact support if you need a custom quota for high-volume use.

Tips for managing storage:

  • Delete unused sources periodically
  • Use text paste instead of file upload when possible (plain text takes far less space)
  • Compress large PDFs before uploading
Can I share sources with people outside my workspace?

Within Workspace: All workspace members see shared sources. Individual sources can be kept private so only you see them.

External Sharing: Not directly supported. If you need to share a source with an external party, export the file and have them upload it to their own workspace. For ongoing client collaboration, create a separate workspace and invite them as a member.

Public Sharing: Not supported. Sources are always private to the workspace.

What happens to form sessions if I delete a source?

Completed sessions: Already-filled forms retain their data. Source data was extracted into field values during the session, not stored as a live reference, so deleting the source has no impact on completed forms.

In-progress sessions: If the session has not yet extracted data from the source, the source will be missing when autofill runs. If extraction has already completed, there is no impact. The safest approach is to complete any active sessions before deleting sources they depend on.

Can I automatically pull sources from external systems?

Yes, through integrations and automation:

Email Integration: Each workspace has a unique email address. Forward a document to that address and Instafill.ai adds it as a source for the configured form. See Email Integration for setup details.

API Integration: Add sources programmatically via POST /api/sources and reference them by ID when creating sessions.

n8n, Make.com, Zapier: Connect any platform that sends HTTP requests to push documents into Instafill.ai and trigger form fills without writing custom code. The n8n setup guide walks through a ready-to-use workflow template.

Example Workflow:

  • A new employee record is created in the HR system
  • The HR system pushes the employee document package to Instafill.ai via POST /api/sources
  • HR fills onboarding forms using the auto-populated source library
  • No manual document upload needed
How do I update a source without creating a new one?

Replace Version:

  1. Open the source in the library
  2. Click "Upload New Version"
  3. Select the updated file
  4. The system creates a new version while preserving the previous ones

Edit Metadata: Update the source name, description, or tags without uploading a new file. Useful for organizational changes.

Impact on Sessions:

  • Future sessions use the new version automatically
  • In-progress sessions continue with the version they started with, preventing mid-session inconsistency

Example: An employee updates their resume with a new certification. HR uploads the new version to the source library. Future applications use the updated resume; any session already in progress completes with the original.

Related Features

Ready to get started?

Start automating your form filling process today with Instafill.ai

Try Instafill.ai View Pricing