Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Grobid POC #10

Open
PieterjanMontens opened this issue Dec 3, 2020 · 0 comments
Open

Grobid POC #10

PieterjanMontens opened this issue Dec 3, 2020 · 0 comments

Comments

@PieterjanMontens
Copy link
Contributor

https://cwiki.apache.org/confluence/display/TIKA/GrobidJournalParser

The GrobidJournalParser uses the GROBID (or Grobid) GeneRation Of BIbliographic Data machine learning framework to parse PDF files and to extract information such as title, abstract, authors, affiliations, keywords, etc, from journal publications. The parser has been integrated into Tika. You can follow this guide to get it working on your system.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant