This is code to download and save search page results from bioRxiv, one day at a time. It was used in the data-gathering stage of a study that led to these results.

It’s on Github.

Why didn’t we just ask Cold Spring Harbor Laboratories (the maintainers of bioRxiv) for the data? We were in a hurry.

Here’s a sample search page.

Part 1: Load the necessary libraries

Part 2: Define functions for downloading, parsing, and summarizing search results

Part 3: Run the searches for a set date range, saving the results