📄 Document Classification

Document Classification System

This AI-powered tool analyzes PDF documents and matches them to predefined product categories based on content similarity.

Methods Available:

  • Smart Semantic: Uses LLM to summarize the document, then finds semantic matches (recommended)
  • Semantic: Direct semantic similarity between document and product descriptions
  • Keyword: Matches based on keyword presence in the document
  • Hybrid: Combines semantic and keyword approaches (70% semantic, 30% keyword)

How to use:

  1. Upload a PDF document
  2. Define your product categories with descriptions and keywords (JSON format) or use the examples at the bottom of the page
  3. Choose a classification method
  4. Get top 3 matches with confidence scores

Upload PDF

Classification Method

Select classification method

Product Definitions

Classification Results

Document Summary

Product Definition Examples
Upload PDF Product definitions (JSON format) Select classification method