SC20 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Diving for Treasure in a Sea of Scientific Literature: Extracting Scientific Information from Free Text Articles


Student: Aarthi Koripelly (University of Chicago)
Supervisor: Kyle Chard (University of Chicago, Argonne National Laboratory (ANL))

Abstract: It has become impossible for researchers to keep up with the more than 2.5 million publications published every year. We explore scalable approaches for automatically extracting relations from scientific papers (e.g., melting point of a polymer). We implement a dependency parser-based relation extraction model to understand relationships without the need for a Named Entity tagger, and integrate several word embeddings models and custom tokenization to boost learning performance for scientific text. We then evaluated the models based on run-time, hyperparameter tuning, and scalability.

ACM-SRC Semi-Finalist: yes

Poster: PDF
Poster Summary: PDF


Back to Poster Archive Listing