Commercial Invoice Data Extraction: Capture Customs Fields to Excel and CSV

Upload a commercial invoice and the AI reads every field customs and your broker need: seller and buyer, HS or HTS codes, country of origin, Incoterms, currency, unit values, line items, freight, and total customs value. Download clean Excel or CSV in seconds, with no template to build per supplier.

PDF, JPG, PNG, BMP, HEIC, TIFF

Upload your invoices

HS Code & Country of Origin
Full Line-Item Capture
Excel & CSV Output
No Template to Build

Why Commercial Invoices Are Slow and Risky to Key by Hand

A commercial invoice is the document U.S. Customs and Border Protection uses to value a shipment and assess duty, so the data on it has to be exact. Retyping HS codes, values, and country of origin from dozens of supplier layouts is slow, and a single wrong figure can trigger a hold or a penalty.

Every Supplier and Forwarder Uses a Different Layout

Commercial invoices arrive from overseas sellers, freight forwarders, and carriers in hundreds of formats and languages. Template-based tools break the moment a layout changes, so someone ends up reading the odd ones by hand anyway.

Misclassification and Vague Descriptions Cause Penalties

Wrong HS or HTS codes and vague descriptions like "hardware" or "parts" are among the most common causes of CBP penalties and shipment holds. Rekeying by hand is exactly where those errors creep in.

Customs Deadlines Do Not Wait for Data Entry

Entry has to be filed on a schedule, and manual keying of line items, values, and charges is the bottleneck. A backlog of invoices means demurrage, storage fees, and delayed release at the port.

No Clean Data to Match Against the PO or Entry

Reconciling a commercial invoice against the purchase order and the customs entry needs structured fields: line items, quantities, unit values, and totals. When that data lives only on a PDF, every check is a manual lookup.

How InvoiceExtractor Turns Commercial Invoices Into Structured Customs Data

Upload the commercial invoice and the AI reads it the way an experienced broker would, then returns the fields customs and your ERP need as clean, structured columns.

HS Codes and Country of Origin

Capture the HS or HTS classification, country of origin, and Incoterms alongside the merchandise description, so entry preparation starts from structured data instead of a manual read.

Every Line and Value

Each line with description, quantity, unit value, and extended amount, plus freight and insurance charges, currency, and the total customs value. The full record, not just the invoice total.

Parties and Shipment Terms

Seller and buyer names and addresses, invoice number and date, purchase order reference, and terms of sale, all pulled into named fields ready to import.

Batch a Whole Shipment File

Drop a folder of commercial invoices, packing lists, and scans at once and get back one consolidated Excel or CSV, ready to feed a broker system or ERP.

From Commercial Invoice to Import-Ready Data in Three Steps

One invoice takes under 10 seconds. A full shipment folder takes minutes.

1

Upload the Commercial Invoices

Drag in one invoice or a batch straight from your email or shipment folder. Native PDFs, scans, and photos all work, including multi-page invoices and packing lists.

Tip: Upload mixed file types together; there is no need to separate scans from native PDFs first.

2

AI Extracts Every Field

The AI identifies parties, invoice number and date, HS codes, country of origin, Incoterms, currency, line items, quantities, unit values, charges, and total customs value.

3

Download and File

Export a structured Excel or CSV with consistent headers, ready to key into your customs entry, broker platform, or ERP without retyping.

Who Uses Commercial Invoice Extraction

From a single importer to a customs brokerage clearing thousands of entries a month.

Importers

Pull line items, HS codes, and values off supplier invoices to prepare entries and reconcile against the purchase order.

Customs Brokers

Turn a stack of client commercial invoices into structured data for entry filing, faster and with fewer keying errors.

Freight Forwarders

Capture shipment and merchandise data from invoices and packing lists to build documentation without manual entry.

Trade Compliance

Get consistent, auditable fields for classification and valuation reviews instead of reading each PDF by hand.

Commercial invoice data extraction, explained

Commercial invoice data extraction uses AI to read a commercial invoice and return the fields customs and your ERP need as structured columns: seller and buyer, invoice number and date, HS or HTS codes, country of origin, Incoterms, currency, line items, unit values, freight and insurance charges, and the total customs value. Instead of a broker or importer keying each field off a PDF, you upload the file and download a clean Excel or CSV in seconds. It works on native PDFs, scans, and photos from any supplier layout, with no template to build. Last updated July 2026.

What data lives on a commercial invoice

A commercial invoice is the document foreign customs agencies, including U.S. Customs and Border Protection, use to value a shipment and assess duties and taxes. Under CBP invoice rules, it has to show an adequate description of the merchandise, the quantities, the values, the appropriate Harmonized Tariff Schedule subheading, the name and address of the seller or manufacturer, the country of origin, the currency, the port of destination, and itemized charges such as freight and insurance. Extraction returns each of those as its own field, so the record is ready to file rather than ready to retype. For the full picture of what the AI captures across any invoice type, the invoice data extraction software page lists the fields, and invoice line item extraction covers pulling the full line table with quantities and unit values.

Why accurate extraction protects you at the border

The costly mistakes on a commercial invoice are wrong HS codes and vague descriptions, which are among the most common causes of CBP penalties and shipment holds. Reading fields off the page with AI, then reviewing them, is more accurate than retyping under a deadline, and it gives you consistent data to check classification and valuation before entry. Because the tool reads any layout rather than a fixed template, it keeps returning the same fields whether the invoice comes from a new supplier or a forwarder. The invoice data capture software page explains how the AI reads any format without a template per vendor.

From invoice to entry, then into your systems

Extraction is one step in the import workflow. Once the fields are in a spreadsheet, you can key them into a customs entry or import them into a broker platform or ERP, and reconcile the line items against the purchase order. Many commercial invoices arrive as email attachments, so it helps to pull data straight from incoming email with mailparse.ai before it reaches your queue, and to match a shipment back to what was ordered, dedicated purchase order management software keeps POs and receipts in one place. For document types beyond invoices, from bills of lading to certificates, docuocr.com handles general document data extraction. To move the extracted data into a spreadsheet workflow, use the invoice PDF to Excel converter or the invoice PDF to CSV converter.

Why Trade and AP Teams Pick InvoiceExtractor

99%+
Field Accuracy
<10s
Per Invoice
0
Templates to Build

Security & Privacy

  • Bank-grade TLS encryption
  • Files auto-deleted after processing
  • No invoice data stored permanently
  • SOC 2 compliant infrastructure

Commercial Invoice Data Extraction: Common Questions

A commercial invoice is a document used in international trade that details a shipment between a seller and a buyer. Customs agencies, including U.S. Customs and Border Protection, use it as the primary record to value goods and assess import duties and taxes, which is why the data on it has to be accurate and specific.

A commercial invoice lists the seller and buyer, invoice number and date, a description of the goods, quantities, unit and total values, currency, the Harmonized Tariff Schedule (HS) code, country of origin, Incoterms, and itemized charges such as freight and insurance. Extraction returns each of these as a separate field ready to file or import.

Upload the PDF, scan, or photo to an AI extraction tool and it reads the parties, HS codes, country of origin, line items, values, and charges, then exports a clean Excel or CSV. That replaces several minutes of manual keying per invoice with a quick review, and it works across any supplier layout without a template.

Yes. The AI reads the Harmonized Tariff Schedule code, country of origin, and Incoterms alongside the merchandise description and returns them as named fields. You review the result before filing, so classification and valuation start from structured data instead of a manual read of each PDF.

A regular invoice is a domestic bill for goods or services. A commercial invoice is for cross-border shipments and carries extra fields customs needs, such as HS codes, country of origin, Incoterms, and a declared customs value. Both can be extracted the same way, but the commercial invoice has more fields to capture.

Yes. The tool exports the extracted commercial invoice data as a clean Excel or CSV file with consistent column headers for parties, HS codes, line items, values, and charges. From there it drops into a customs entry worksheet, a broker platform, or an ERP without a second round of data entry.

No. A proforma invoice is a preliminary quote or offer sent before a sale is final, and it cannot be used for customs clearance. A commercial invoice is the final document that records the actual transaction and is used by customs to assess duties. The tool extracts data from both, but only the commercial invoice clears goods.

CBP invoice rules require an adequate description of the merchandise, quantities, values, the appropriate Harmonized Tariff Schedule subheading, the name and address of the seller or manufacturer, the country of origin, the currency, the port of destination, and itemized charges. Accurate capture of these fields helps avoid holds and penalties.