Scholarly Insight through the ArXivExplore Paclet
Scholarly Insight through the ArXivExplore Paclet
Daniele Gregori, Soochow University
Institute for Advanced Study
Institute for Advanced Study
#WolframTechConf
ABSTRACT
ABSTRACT
In[]:=
abstract="ArXivExplore helps the deep data analysis of all 2.6M physics, math, computer science, etc. articles on ArXiv, providing functionality for e.g. title/abstract word statistics, TeX source/formulae/citations dissection, NNs for classification or recommendation and LLM-automated concept explanations and author reports.";
Out[]=
Paclet loading
Paclet loading
Introduction
Introduction
On ArXiv
On ArXiv
A “deep data” problem
A “deep data” problem
Academic “chat” vs scientific “insight”
Academic “chat” vs scientific “insight”
Some ArXiv questions
Some ArXiv questions
ArXiv data mining
ArXiv data mining
ArXiv main data
ArXiv main data
Categories
Categories
Full TEX scraping
Full scraping
T
E
XCitations API
Citations API
Real insights from “just counting”
Real insights from “just counting”
Submission trends
Submission trends
Counting total words
Counting total words
Word popularity trends
Word popularity trends
Word logic combinations
Word logic combinations
Some Machine Learning
Some Machine Learning
Basic classification
Basic classification
Advanced training
Advanced training
Feature extraction
Feature extraction
Clustering
Clustering
LLMs to enhance understanding
LLMs to enhance understanding
The importance of introductions
The importance of introductions
Explaining specific concepts
Explaining specific concepts
Creating author reports
Creating author reports
Conclusions
Conclusions
Unlimited explorations
Unlimited explorations
Planned improvements
Planned improvements
New horizons in “deep data”
New horizons in “deep data”
Thank you!
Thank you!
Out[]=