For our Open Trials project, we are aiming to index and make links between different data sources on clinical trials, drugs, and health conditons. Toward this end, we’re looking to incorporate structured data from ClinicalTrials.gov. We know lots work has been done on scraping Clinical Trials in the past (including by Open Knowledge ). We’ve come up with the following list on past work. Does anyone have experience here? Any pitfalls to avoid?
https://wwwcf2.nlm.nih.gov/nlm_eresources/eresources/search_database.cfm
https://cran.r-project.org/web/packages/rclinicaltrials/vignettes/basics.html
https://github.com/tinfante/ClinicalTrialsScraper
https://classic.scraperwiki.com/views/clinicaltrialsgov_test/
Also this: