0

我想知道Bio.Entrez是否检索 PubMed 文章的efetch()所有元数据,给定一个 PMID 作为输入。对于所有元数据,我的意思是 PubMed 是否有比efetch()检索到的更多的元数据。


例如,我看到对于 PMID 23954024efetch()检索的摘要包含的信息比 PubMed 网站 ( http://www.ncbi.nlm.nih.gov/pubmed/23954024 ) 上的摘要少:

efetch()

"AbstractText": [
    "Rotator cuff tendinopathy is a common source of shoulder pain characterised by persistent and/or recurrent problems for a proportion of sufferers. The aim of this study was to pilot the methods proposed to conduct a substantive study to evaluate the effectiveness of a self-managed loaded exercise programme versus usual physiotherapy treatment for rotator cuff tendinopathy.", 
    "A single-centre pragmatic unblinded parallel group pilot randomised controlled trial.", 
    "One private physiotherapy clinic, northern England.", 
    "Twenty-four participants with rotator cuff tendinopathy.", 
    "The intervention was a programme of self-managed loaded exercise. The control group received usual physiotherapy treatment.", 
    "Baseline assessment comprised the Shoulder Pain and Disability Index (SPADI) and the Short-Form 36, repeated three months post randomisation.", 
    "The recruitment target was met and the majority of participants (98%) were willing to be randomised. 100% retention was attained with all participants completing the SPADI at three months. Exercise adherence rates were excellent (90%). The mean change in SPADI score was -23.7 (95% CI -14.4 to -33.3) points for the self-managed exercise group and -19.0 (95% CI -6.0 to -31.9) points for the usual physiotherapy treatment group. The difference in three month SPADI scores was 0.1 (95% CI -16.6 to 16.9) points in favour of the usual physiotherapy treatment group.", 
    "In keeping with previous research which indicates the need for further evaluation of self-managed loaded exercise for rotator cuff tendinopathy, these methods and the preliminary evaluation of outcome offer a foundation and stimulus to conduct a substantive study."
], 

http://www.ncbi.nlm.nih.gov/pubmed/23954024:摘要 目标:肩袖肌腱病是肩痛的常见来源,其特征是部分患者出现持续和/或反复出现的问题。本研究的目的是试验提出的方法,以进行实质性研究,以评估自我管理的负荷运动计划与常规物理疗法治疗肩袖肌腱病的有效性。

DESIGN:
A single-centre pragmatic unblinded parallel group pilot randomised controlled trial.

SETTING:
One private physiotherapy clinic, northern England.

PARTICIPANTS:
Twenty-four participants with rotator cuff tendinopathy.

INTERVENTIONS:
The intervention was a programme of self-managed loaded exercise. The control group received usual physiotherapy treatment.

MAIN OUTCOMES:
Baseline assessment comprised the Shoulder Pain and Disability Index (SPADI) and the Short-Form 36, repeated three months post randomisation.

RESULTS:
The recruitment target was met and the majority of participants (98%) were willing to be randomised. 100% retention was attained with all participants completing the SPADI at three months. Exercise adherence rates were excellent (90%). The mean change in SPADI score was -23.7 (95% CI -14.4 to -33.3) points for the self-managed exercise group and -19.0 (95% CI -6.0 to -31.9) points for the usual physiotherapy treatment group. The difference in three month SPADI scores was 0.1 (95% CI -16.6 to 16.9) points in favour of the usual physiotherapy treatment group.

CONCLUSIONS:
In keeping with previous research which indicates the need for further evaluation of self-managed loaded exercise for rotator cuff tendinopathy, these methods and the preliminary evaluation of outcome offer a foundation and stimulus to conduct a substantive study.

(的摘要中缺少OBJECTIVESDESIGNSETTING等。)efetch()

还遗漏了哪些其他元数据efetch(),有没有办法以编程方式检索丢失的信息?

4

2 回答 2

2

为了扩展 xbello 的答案,不,信息没有丢失,但有点隐藏。

from Bio import Entrez

Entrez.email = "Your.Name.Here@example.org"
handle = Entrez.efetch(db="pubmed", id="23954024", rettype="xml")
records = Entrez.read(handle)

for record in records:

    m = record['MedlineCitation']['Article']['Abstract']['AbstractText']
    for subsection in m:
        print(subsection.attributes['Label'])
        print(subsection)

截断输出:

目标

肩袖肌腱病是肩部疼痛的常见来源,其特征是一部分患者出现持续和/或反复出现的问题。本研究的目的是试验提出的方法,以进行实质性研究,以评估自我管理的负荷运动计划与常规物理疗法治疗肩袖肌腱病的有效性。

设计

单中心实用非盲平行组试点随机对照试验。

于 2016-06-27T09:08:49.093 回答
1

信息没有丢失:

from Bio import Entrez
Entrez.email = "sample@sample.org"

handle = Entrez.efetch(db="pubmed", id="23954024", rettype="xml")

print(handle.read())

部分输出:

<Abstract>
 <AbstractText Label="OBJECTIVES" NlmCategory="OBJECTIVE">Rotator cuff tendinopathy is a common source of shoulder pain characterised by persistent and/or recurrent problems for a proportion of sufferers. The aim of this study was to pilot the methods proposed to conduct a substantive study to evaluate the effectiveness of a self-managed loaded exercise programme versus usual physiotherapy treatment for rotator cuff tendinopathy.</AbstractText>
 <AbstractText Label="DESIGN" NlmCategory="METHODS">A single-centre pragmatic unblinded parallel group pilot randomised controlled trial.</AbstractText>
 <AbstractText Label="SETTING" NlmCategory="METHODS">One private physiotherapy clinic, northern England.</AbstractText>
[...]
于 2016-06-27T06:57:12.713 回答