-
Notifications
You must be signed in to change notification settings - Fork 484
Insights: kermitt2/grobid
Overview
Could not load contribution data
Please try again later
1 Release published by 1 person
-
0.8.2
published
May 11, 2025
6 Pull requests merged by 2 people
-
Enable Trivy for security code scanning
#1295 merged
May 27, 2025 -
Add support for ARM on other OS
#1291 merged
May 27, 2025 -
Update jruby to 9.4.12.1
#1293 merged
May 26, 2025 -
updated eclipse-temurin docker images to 17.0.15_6
#1294 merged
May 26, 2025 -
address vulnerabilities in CRF image
#1261 merged
May 19, 2025 -
Revert text that does not belong to graphics as paragraph instead of discarding it
#1266 merged
May 11, 2025
4 Pull requests opened by 2 people
-
Uniform figDesc between sentence segmentation and non-sentence segmentation
#1287 opened
May 11, 2025 -
Relax block distance when multi columns and blocks at the same height
#1288 opened
May 11, 2025 -
new-figure-table-extraction - Extract figures from SVG
#1297 opened
May 30, 2025 -
Add segmentation + fulltext annotations.
#1301 opened
Jun 8, 2025
23 Issues closed by 2 people
-
Internal Server Error
#112 closed
Jun 8, 2025 -
Potential issue with consolidation of funders/citations
#1298 closed
Jun 5, 2025 -
Address vulnerabilities in docker images
#1262 closed
May 27, 2025 -
Lines/blocks are filtered when containing certain extensions, leading to error 500
#1281 closed
May 26, 2025 -
Formulas in a separate paragraph or in stand alone tags
#1252 closed
May 26, 2025 -
Ignored text before tables
#1251 closed
May 26, 2025 -
Wrongly placed figure reference
#1244 closed
May 26, 2025 -
<ref type="figure" target="#fig_5"> missed
#1233 closed
May 26, 2025 -
<p> duplicates
#1232 closed
May 26, 2025 -
Misclassified tables and/or figures maybe tossed incorrectly
#1206 closed
May 26, 2025 -
Annex and body misclassification
#1198 closed
May 26, 2025 -
Data availabilty extraction failure use cases
#1187 closed
May 26, 2025 -
Empty refs
#1175 closed
May 26, 2025 -
Incomplete funding statement extraction
#984 closed
May 26, 2025 -
Abstract for paper is not correctly extracted from PDF
#1155 closed
May 26, 2025 -
PDF source file containing "pdf" before ".pdf" extension breaks naming of training files
#776 closed
May 26, 2025 -
[Feature idea] Extract external links (github, dataset, ...)
#167 closed
May 26, 2025 -
Full text model layout features: BLOCKSTART missing, if very first block token is a new line
#712 closed
May 26, 2025 -
JRuby update crashes grobid when segmenting sentences
#1292 closed
May 26, 2025 -
Gradle problem
#1057 closed
May 19, 2025 -
"Author contributions" section content is skipped by grobid
#1231 closed
May 19, 2025 -
Identification of code excerpts or software development/project name
#116 closed
May 14, 2025
4 Issues opened by 3 people
-
Not able to process the request sent one after the another
#1299 opened
Jun 6, 2025 -
[grobid/multi-arch-docker-image] UnsatisfiedLinkError: /opt/grobid/grobid-home/lib/lin-64/libwapiti.so
#1296 opened
May 26, 2025 -
[ubuntu:plucky OpenJDK-21 Gradle-8.14] patch/pull request
#1290 opened
May 19, 2025 -
best practices for enhancing figure/table annotation (question)
#1289 opened
May 12, 2025
18 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Support for ARM, updated PDFAlto, Docker multi-architecture build
#1165 commented on
Jun 11, 2025 • 10 new comments -
Support for python env managers
#1010 commented on
May 11, 2025 • 0 new comments -
Improvement of the recovery of Pragmatic Segmenter sentence segmentation text wrt to the original text offsets
#701 commented on
May 27, 2025 • 0 new comments -
Using the figure number for matching figues of different types (Extended, Supplementary) might not be enough
#1286 commented on
Jun 9, 2025 • 0 new comments -
Incorrectly extracted chemistry information.
#1249 commented on
Jun 8, 2025 • 0 new comments -
When ARM64 DLL will be available (present in lib and pdfalto)
#1219 commented on
May 28, 2025 • 0 new comments -
Docker on macOS arm64
#1089 commented on
May 27, 2025 • 0 new comments -
Errors using the lightweight docker container (v0.7.3)
#1014 commented on
May 27, 2025 • 0 new comments -
GROBID container image for linux/arm64
#928 commented on
May 27, 2025 • 0 new comments -
Docker container killed after uploading PDF file on Apple Silicon (macOS Sonoma 14.5)
#1119 commented on
May 27, 2025 • 0 new comments -
Regarding GROBID support ARM64
#1218 commented on
May 27, 2025 • 0 new comments -
grobid run giving '/tini: 1: Syntax error: "(" unexpected'
#1229 commented on
May 27, 2025 • 0 new comments -
GROBID 0.8.1 Docker Container Fails Due to cgroup v2 NullPointerException on MacOS
#1260 commented on
May 19, 2025 • 0 new comments -
is it feasible: inverse(GROBID)?
#502 commented on
May 19, 2025 • 0 new comments -
Question about identifying table content
#514 commented on
May 19, 2025 • 0 new comments -
Grobid seems to keep skipping entire columns
#1270 commented on
May 19, 2025 • 0 new comments -
Only first line of figure description extracted if distance between lines deemed too large
#683 commented on
May 14, 2025 • 0 new comments -
Inconsistency in <figDesc> when applying or not sentence segmentation
#1265 commented on
May 11, 2025 • 0 new comments