Skip to content
Change the repository type filter

All

    Repositories list

    • docling

      Public
      Get your documents ready for gen AI
      Python
      MIT License
      94618k12614Updated Jan 10, 2025Jan 10, 2025
    • Python
      MIT License
      1061121Updated Jan 10, 2025Jan 10, 2025
    • A python library to define and validate data types in Docling.
      Python
      MIT License
      245072Updated Jan 10, 2025Jan 10, 2025
    • Docling Haystack integration
      Python
      MIT License
      01020Updated Jan 9, 2025Jan 9, 2025
    • Examples using the Deep Search functionalities
      Python
      MIT License
      185204Updated Jan 9, 2025Jan 9, 2025
    • Docling LangChain integration
      Python
      MIT License
      0400Updated Jan 9, 2025Jan 9, 2025
    • Running Docling as an API service
      Python
      MIT License
      103834Updated Dec 19, 2024Dec 19, 2024
    • .github

      Public
      0101Updated Dec 16, 2024Dec 16, 2024
    • Simple package to extract text with coordinates from programmatic PDFs
      C++
      MIT License
      1044120Updated Dec 16, 2024Dec 16, 2024
    • Interact with the Deep Search platform for new knowledge explorations and discoveries
      Python
      MIT License
      22144812Updated Dec 9, 2024Dec 9, 2024
    • Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
      C++
      MIT License
      73221Updated Dec 9, 2024Dec 9, 2024
    • CSS
      MIT License
      11000Updated Dec 2, 2024Dec 2, 2024
    • PatCID

      Public
      Python
      MIT License
      13620Updated Nov 28, 2024Nov 28, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      0900Updated Nov 28, 2024Nov 28, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      35900Updated Nov 22, 2024Nov 22, 2024
    • quackling

      Public archive
      Build document-native LLM applications
      Python
      MIT License
      25200Updated Sep 11, 2024Sep 11, 2024
    • Mognet is a fast, simple framework to build distributed applications using task queues.
      Python
      MIT License
      41001Updated Aug 7, 2024Aug 7, 2024
    • Python
      MIT License
      0700Updated Jul 8, 2024Jul 8, 2024
    • Python
      MIT License
      0800Updated Jul 8, 2024Jul 8, 2024
    • SemTabNet

      Public
      Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"
      Python
      MIT License
      11000Updated Jul 1, 2024Jul 1, 2024
    • Repository to detect scientific software in documents for Chan Zuckerberg Initiative workshop
      Python
      MIT License
      1200Updated Oct 26, 2023Oct 26, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      16k100Updated May 18, 2023May 18, 2023
    • Website of the ICDAR 2023 DocLayNet competition
      2100Updated Apr 26, 2023Apr 26, 2023
    • DocLayNet

      Public
      DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
      Other
      1630030Updated Feb 1, 2023Feb 1, 2023
    • Example NLP Annotator API used for integrating with the IBM DeepSearch CPS platform
      Python
      Apache License 2.0
      41000Updated Sep 8, 2022Sep 8, 2022