Skip to content

Commit be06452

Browse files
committed
data related to comparison with other agentic data extraction frameworks added
1 parent 71be033 commit be06452

File tree

98 files changed

+59231
-34
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

98 files changed

+59231
-34
lines changed

CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
## Unreleased
22

33
### Added
4+
- Data related to comparison with other agentic data extraction frameworks added for the ComProScanner paper in the `examples/piezo_test/comparing_existing_frameworks` folder.
45

56
- New parameter `apply_advanced_cleaning` added to data cleaning methods in `data_cleaner.py`. When set to `True`, it triggers the advanced cleaning pipeline.
67

@@ -38,6 +39,7 @@
3839
- [CITATION.cff](https://github.com/slimeslab/ComProScanner/blob/main/CITATION.cff) added for standardized citation information based on the latest release and arXiv preprint.
3940

4041
### Fixed
42+
- Data extraction failures fixed if composition-property text data is empty.
4143

4244
- CSV progress tracking in `elsevier_processor.py`:
4345

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.ceramint.2024.10.251.xml

Lines changed: 3663 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.jallcom.2024.177560.xml

Lines changed: 5384 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.jallcom.2024.177690.xml

Lines changed: 4415 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.jallcom.2024.178322.xml

Lines changed: 4464 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.jallcom.2025.178686.xml

Lines changed: 4292 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.jece.2024.115049.xml

Lines changed: 3612 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.jeurceramsoc.2024.117101.xml

Lines changed: 5267 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.matchemphys.2024.129999.xml

Lines changed: 4329 additions & 0 deletions
Large diffs are not rendered by default.

examples/piezo_test/comparing_existing_frameworks/CMEG-IITR_Agentic_data_extraction/Elsevier_xml_data/10.1016_j.mtcomm.2024.111118.xml

Lines changed: 4042 additions & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)