**Guen Prawiroatmodjo** @guenp@physics.social · 2022-11-22T03:09:53Z

Guen Prawiroatmodjo @guenp@physics.social

Guen Prawiroatmodjo @guenp@physics.social

I'm tempted to go and build an #arxiv bot but one that also shares the first figure in the toot automatically.. someone build that yet? #science #papers #physics

Nov 22, 2022, 03:09 · · Web · · ·

**Junghyeon Park** @j824h@mathstodon.xyz · Nov 22, 2022, 03:25

**Junghyeon Park** @j824h@mathstodon.xyz · Nov 22, 2022, 03:25

Nov 22, 2022, 03:25

Junghyeon Park @j824h@mathstodon.xyz

@guenp arXiv itself does not contain Figures & Tables data, so generally you will need to process the PDF file from https://export.arxiv.org/pdf/*
Defining which visible part of the document is a figure might be again non-trivial.

**Guen Prawiroatmodjo** @guenp@physics.social · Nov 22, 2022, 03:31

**Guen Prawiroatmodjo** @guenp@physics.social · Nov 22, 2022, 03:31

Nov 22, 2022, 03:31

Guen Prawiroatmodjo @guenp@physics.social

@j824h indeed, I was not planning to parse the PDF, that seems like a tedious route

**Junghyeon Park** @j824h@mathstodon.xyz · Nov 22, 2022, 03:32

**Junghyeon Park** @j824h@mathstodon.xyz · Nov 22, 2022, 03:32

Nov 22, 2022, 03:32

Junghyeon Park @j824h@mathstodon.xyz

@guenp Copyright also concerns.
Unlike the abstract, the article content is not in public domain by default.
I think we can only repost the articles explicitly licensed for re-use.
https://arxiv.org/help/bulk_data#bulk-full-text-access

ab480f9c067c002d.png

**Guen Prawiroatmodjo** @guenp@physics.social · Nov 22, 2022, 03:36

**Guen Prawiroatmodjo** @guenp@physics.social · Nov 22, 2022, 03:36

Nov 22, 2022, 03:36

Guen Prawiroatmodjo @guenp@physics.social

@j824h huh good to know.. how does arxiv-vanity deal with this limitation?

**Junghyeon Park** @j824h@mathstodon.xyz · Nov 22, 2022, 03:48

**Junghyeon Park** @j824h@mathstodon.xyz · Nov 22, 2022, 03:48

Nov 22, 2022, 03:48

Junghyeon Park @j824h@mathstodon.xyz

@guenp No clue! Maybe @bfirsh would assert fair use? You reminded me that arXiv seems to overlook certain gray area so maybe we don't have to be excessively cautious.

Trending now

Resources

Developers

What is Mastodon?

physics.social

More…