Home
  • Parsee Cloud
  • Tutorials
  • Case Studies
  • Datasets
  • Github

  • All
  • Parsee
  • Data Extraction
All
Parsee
Data Extraction
  • finRAG Datasets & Study: Total Scores
    Data Extraction

    finRAG Datasets & Study

    We wanted to investigate how good the current state of the art (M)LLMs are at solving the relatively simple problem of extracting revenue figures from publicly available financial reports. To test this, we created 3 different datasets, all based on the same selection of 1,156 randomly selected annual reports for the year 2023 of publicly listed US companies. The resulting datasets contain a combined total of 10,404 rows, 37,536,847 tokens and 1,156 images. For our study, we are evaluating 8 state-of-the-art (M)LLMs on a subset of 100 reports.
    1 years ago, 10 min
  • Comparison between Parsee Document Loader  and Langchain Document Loader for PDFs
    Data Extraction

    Comparing Parsee Document Loader vs. Langchain Document Loaders for PDFs

    In the following we will be comparing the results of the Parsee Document Loader vs. the PyPDF Langchain Document Loader for various datasets. All datasets that are used here can be found on Huggingface (links below), so the results are all reproducible.
    1 years ago, 5 min
  • frame-1321315523
    Data Extraction

    Extraction Templates

    The core functionality of the Parsee Extraction Templates explained.
    1 years ago, 5 min
  • frame-1321315523
    Data Extraction

    Extraction Templates vs. Prompt Templates

    Exploring the advantages of Parsee extraction templates over simple prompt templates.
    1 years ago, 5 min read
  • 1
Home
  • Contact
  • Blog
  • Imprint & Terms of Service
  • Privacy Policy
Parsee.ai is developed and maintained by SimFin Analytics GmbH