Get a full-text article in XML text file from PubMed Central


1. Search articles at PubMed and click "Display Settings" and select "PMID list" format and "200" Items per page.



2. Go to "PMCID - PMID - Manuscript ID - DOI Converter" page and copy & paste PMID list to convert to PubMed Central ID (PMCID).

- PMCID - PMID- Manuscript ID -DOI Converter
http://www.ncbi.nlm.nih.gov/pmc/pmctopmid/


3. Providing full-text article XML text file format .

(1) PubMed Central
- Able to bulk download XMLs of every articles but can't download specific article.
- URL: http://europepmc.org/RestfulWebService

(2) PubMed Central Open Access
- Developed by someone.
- URL: http://pmc.jensenlab.org/
- Example: http://pmc.jensenlab.org/pmcid/13900.nxml

(3) European
- Web service provided by EBI.
- URL: http://www.ebi.ac.uk/europepmc/
- Example: http://www.ebi.ac.uk/europepmc/webservices/rest/PMC2652837/fullTextXML

4. Automatically download XML files by PMCID list from PubMed Central Open Access.

--------------------Shell script---------------------
#!/bin/bash

cat all_pmcid.txt | while read line;
do
url="http://www.ebi.ac.uk/europepmc/webservices/rest/${line}/fullTextXML";
save="${line}.xml";
wget -O ${save} ${url};
done
------------------------------------------------------

Share on Google Plus

About Piyoko

    Blogger Comment
    Facebook Comment

0 コメント:

コメントを投稿