问题标签 [ncbi]

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

0 投票
4 回答
1414 浏览

javascript - 如何解决超链接的 VERY LARGE URL 问题...使用 jQuery 或 Javascript 在背面调用任何 perl、php 等脚本

我已经构建了 UI,它就像一个用于 BioProcess/Disease--> Genes 的搜索引擎。例如,用户可以查询:“干细胞”或“脑肿瘤”,结果将给出 50 到 5000 个 GeneID(本质上是代表 NCBI 数据库中唯一基因的数字)。

它是免费的,你可以试试: http: //proteogenomics.musc.edu/genemesh/

现在,问题是我喜欢那些 GeneIDs 到 NCBI 站点一次。对于多达 200 或 400 个 GeneID,我可以获得很好的结果。但是对于超过 500 个 GeneID,我会收到“URI TOO LARGE”错误,或者 NCBI 网站无法接受巨大的查询。我正在尝试解决这个问题。

Javascript, OnClick 可以在 UNIX /tmp 文件中写入文件吗?或者有没有更好的解决方案?

在这一点上,任何帮助/指南都很棒......

非常感谢你,索林

网址过长:

http://view.ncbi.nlm.nih.gov/gene/30980,31208,31442,31789,32083,32855,33156,33392,33498,34309,35197,35974,36057,38545,38701,42945,43162, 43256,43288,43739,44900,45339,45775,3772082,260437,393632,406845,379537,379620,399306,100037057,100037058,100037249,10,40,43,60,104,133,174,182,185,186,187,197,207,208,210,238,240,269,284,285,317,324,328,332,335,351,355,356,358,361,366,379,387,388,396,429,462,466,472,482,558,567,573,575,576,577,581,595,596,598,604,648,650,664,672,673,675,706,712,754,761,768,771,800,836,841,842,847,857,860,875,885,889,890,891,902,960,970,978,983,999,1000,1018,1019,1021,1025, 1026,1027,1029,1030,1031,1051,1082,1111,1116,1124,1131,1152,1231,1234,1272,1282,1284,1285,1286,1287,1288,1316,1432,1457,1459, 1462,1464,1471,1474,1485,1490,1493,1499,1508,1512,1520,1543,1545,1571,1594,1600,1605,1612,1620,1622,1630,1633,1638,1641,1643, 1649,1728,1755,1756,1803,1809,1814,1839,1854,1869,1909,1910,1942,1950,1956,1969,2012,2013,2014,2019,2020,2022,2034,2035,2044,2045,2046, 2048,2050,2051,2052,2064,2066,2067,2068,2071,2100,2149,2166,2173,2246,2247,2250,2251,2254,2260,2263,2264,2272,2289,2290,2305, 2308,2309,2321,2335,2475,2542,2547,2574,2579,2621,2627,2670,2734,2735,2736,2737,2738,2740,2849,2890,2896,2901,2908,2925,2932, 2936,2939,2944,2947,2950,2952,2956,3073,3074,3082,3090,3091,3104,3105,3106,3115,3122,3123,3146,3161,3162,3181,3191,3215,3240, 3265,3309,3315,3324,3371,3373,3383,3384,3397,3398,3417,3479,3480,3481,3482,3485,3486,3487,3491,3553,3558,3561,3565,3566,3569, 3574,3575,3596,3597,3598,3603,3611,3621,3630,3632,3645,3672,3685,3688,3717,3732,3738,3741,3766,3785,3791,3814,3815,3845,3897, 3910,3912,3913,3915,3918,3925,3945,3956,3958,3981,3987,4035,4087,4088,4089,4133,4137,4145,4147,4152,4155,4172,4175,4192,4193,4194,4241,4255,4267,4288,4292,4303,4312,4313,4314,4316,4318,4320,4321,4323,4327,4350,4436,4440,4524,4548,4549, 4552,4574,4601,4609,4654,4684,4691,4735,4745,4763,4771,4790,4804,4807,4808,4829,4830,4843,4851,4856,4893,4907,4914,4915,4916, 4950,4978,4982,5015,5028,5030,5054,5080,5111,5118,5154,5155,5156,5159,5178,5241,5243,5266,5274,5276,5290,5294,5295,5328,5329, 5334,5340,5395,5444,5468,5536,5538,5563,5566,5578,5579,5580,5581,5583,5584,5591,5594,5595,5599,5602,5629,5653,5663,5702,5708, 5713,5725,5727,5728,5730,5743,5745,5747,5764,5781,5803,5805,5834,5835,5879,5880,5881,5888,5894,5898,5899,5900,5915,5921,5925, 5934,5970,5978,5981,5992,6048,6091,6118,6165,6195,6275,6278,6284,6347,6382,6387,6​​416,6464,6469,6490,6502,6506,6507,6513,6598, 6606,6608,6622,6647,6648,6649,6657,6659,6660,6662,6663,6664,6667,6670,6678,6695,6697,6714,6717,6770,6772,6774,6790,6853,6855,6863,6892,6900,6948,7012,7013,7015,7018,7020,7025,7040,7042,7054,7057,7076,7078,7105,7124,7153,7157,7161,7168, 7175,7185,7186,7248,7249,7258,7262,7276,7277,7283,7298,7422,7428,7431,7442,7447,7468,7474,7490,7508,7515,7517,7518,7520,7525, 7545,7799,7849,7852,7980,8028,8038,8061,8089,8140,8190,8301,8372,8484,8507,8577,8605,8650,8678,8682,8692,8718,8737,8741,8742, 8743,8745,8754,8771,8772,8788,8795,8797,8811,8828,8829,8841,8842,8848,8851,8862,8871,8877,8928,8930,9077,9100,9113,9141,9148, 9156,9173,9211,9212,9232,9334,9353,9423,9429,9444,9445,9447,9459,9507,9518,9535,9545,9588,9681,9806,9833,9844,9961,10036,10153 10209101050111010101010101010101050105051050107010505505505505550555550555555555155 11095,11096,11141,11156,11162,11186,11191,11200,11235,11284,22865,22933,22943,23136235055855 ,,2.23155858858585858585858585858585858585585585858555585855858585585585588555855555855885 291085 ,,512905085 ,,512915185085 ,,51251851851851851518515185.51515151851851851518518515 54850555 ,,55055555 ,,5.5555855558558555855558558558555785555578555578575558558575855557858555755755555785555755557585855585557555557855855578555555755855557555855555575 80309858858804858 ,,880480488048585885885858585858858585885858585858585858585858585855855855855855855858558558555 129787,139065,140885,146691,146956,160728,162979,163732,200895,253260,253738,259266,284217,284459,286527,338030,342945,375790,399473,406907,406991,407043,646555,728239,100133941, 403412,396562,396783,751862,100125840,100125841,282862,100008679,11539,11545,11601,11622111790118211178117811117817801178011780117811781178117811781178178117817811781178117817811781785181781178111781781178117811781178178117811785 12578,126061111111111111,1111111111111111,111111111111111111111111384611,1386611450,14802,1439711111111111111111111115111151151145 14634,14714111111111111111111111111111111111111111155114811859118511481855 17920,18008,18125,1818,18181818208,18386,18507,1801,18595,18616,18708,18787,80792,187995,807,18808,18595,1860,185.18708,18787,80792,18793,8096,18908,18787,80792,18993,18996,1899793,18996,18997,19090,19118,19122, 19206,192111950 ,19508080 ,191119080895 ,19811980859598598595985985985985985985 21948,22059,22339,22596,22612,23796,26362,26413,26417,26434,30878,30957,50490,50913,50914,54725,55992,56458,567111111111111111111111111111111111111111111111111111111111185.8908556451855609085 23228625454504545468045 ,,2454680454545454545545455455454555455454545 2526580560568056805056805055454560554580554554545545545545545455454554545545545505545454555055455545455055545455055454550554545 5908685 ,,8185085 ,,8185888085 ,,818584578588588584578578578585785885785785785885885785785785885785785785888585857857857857858857858885885858858857 117273,170568,192248,292994,297893,306464,308435,338474,360457,362317,369119,2828259,155871,156110,908122,4981003,4981004,4981005,4981006,4981007,4981008,4981009,4981010,4981011,4981012, 4981013,4981014,4981015,4981016,4981017,4981018,4981019,4981020,4981021,4981022,4981023,4981024,4981025,4981026,4981027,4981028,4981029,4981030,4981031,4981032,4981033,4981034,4981035,4981036,4981037,4981038,4981039,4981040,4981041,4981042,4981043, 4981045,4981046,4981047,4981048,4981049,4981050,4981051,4981052,4981053,4981054,4981055,4981056,4981057,4981058,4981059,4981060,4981061,4981062,4981063,4981064,4981065,4981066,4981067,4981068,4981069, 4981070,4981071,4981072,4981073,4981074,4981075,4981076,4981077,4981078,4981079,4981080,4981081,4981082,4981083,4981084,4981085,4981086,4981087,4981088,4981089,4981090,4981091,4981092,4981093,4981094, 4981095,4981096,4981097,4981098,4981099,4981100,4981101,4981102,4981103,4981104,4981105,4981106,4981107,4981108,4981109,4981110,4981111,4981112,4981113,4981115,4981116,4981117,4981118,4981119,4981120, 4981121,4981125,4981126,4981128,4981129,4981130,4981131,49811498114981114981149811498114981149811498114981149811498114981114981149811981149811981149811498114981198114981198119811981198119811981198119811981198119811981198119811981149811981198119811981198114981198115 49811011011111111011111111111111111111111111179811811811811811117981168118118118111798111681181181181181181181168 4981183,4981184,4981185,4981186,4981187,4981188,4981189,4981190,4981191,4981192,4981193,4981194,4981195,4981196,4981197,4981198,4981199,4981200,4981201,4981202,4981203,4981204,4981205,4981206,4981207, 4981208,4981209,4981210,4981211,4981212,4981213,4981214,4981215,4981216,4981217,4981218,4981219,4981220,4981221,4981222,4981223,4981224,4981225,4981226,4981227,4981228,4981229,4981230,4981231,4981232, 4981233,4981234,4981235,4981236,4981237,4981238,4981239,4981240,4981241,4981242,4981243,4981244,4981245,4981246,4981247,4981248,4981249,4981250,4981251,4981252,4981253,4981254,4981255,4981256,4981257,4981258,4981259,4981260,4981261,4981262,4981263,4981264, 4981265,4981266,4981267,4981268,4981269,4981270,4981271,944996,946069

0 投票
1 回答
130 浏览

c++ - 在 NCBI c++ 工具包应用程序中设置版本号

如何在NCBI C++ Toolkit应用程序中设置版本号?

我的意思是当我使用参数 -version 启动程序时显示的版本号。

我通读了文档,但还没有找到。

(我知道这是一个非常具体的问题,但我认为值得一试)

0 投票
3 回答
7673 浏览

python - 如何从 XML NCBI BLAST 文件中提取第一个命中元素?

我试图只从 NCBI xml BLAST 文件中提取第一个命中。接下来我想只获得第一个 HSP。在最后阶段,我想根据最好成绩获得这些。为了清楚起见,这里有一个 xml 文件的示例:

基本上每个查询搜索都会创建一个迭代元素。每次迭代都可以有多个命中,而这些命中又可以有多个 HSP。我只想获得第一个命中,它是每次迭代的第一个 HSP。如果 BLAST 没有找到命中,我想忽略迭代。我编写了这个简单的代码:

任何帮助将不胜感激!

0 投票
2 回答
980 浏览

c++ - 如何设置 QNetworkReply 属性以获取正确的 NCBI 页面?

我尝试使用 downloadURL 函数获取以下 url,如下所示:

http://www.ncbi.nlm.nih.gov/nuccore/27884304

但是数据不是我们通过浏览器可以看到的,现在我知道是因为需要一些正确的信息(例如浏览器类型)。如何知道需要设置哪些信息,如何设置?(通过 setHeader 函数或其他方式??)

在VC++中,我们可以使用CInternetSession和CHttpConnection对象来获取正确的数据,而无需设置任何其他详细信息,Qt或其他跨平台C++网络库中是否有类似的方法?(是的,我需要跨平台属性。)

通过VC,我们可以做到这一点,那么正确的数据就在CHttpFile中。

0 投票
1 回答
556 浏览

bioinformatics - NCBI基因数据库问题

我正在尝试查找带有基因名和染色体位置的gene_info 文件。但是,我似乎无法在 NCBI FTP 站点上找到它。谁能给我指点?

0 投票
1 回答
393 浏览

java - 有人用过pubchemdb吗?有没有类似的API?

更新:答案中的链接既有趣又有用,但不幸的是没有解决对 java API 的需求,所以我仍然期待任何输入。

我正在建立一个化合物数据库。我需要所有同义词(IUPAC 和通用名称)以及每个同义词的安全数据。
我将使用 PubChem (http://pubchem.ncbi.nlm.nih.gov/) 上免费提供的数据

有一种简单的方法可以通过简单的 HTTP 获取来查询每个化合物。例如,要获取甘油数据,URL 是:

以下 URL 将返回一个易于解析的格式:

但它只会响应非常基本的信息,缺乏安全数据和几个常用名称。

有一个用于 JAVA 的公共领域 API,看起来非常完整,由 Scripps 的一个小组开发(引用)。代码在这里

不幸的是,这个 API 没有很好的文档记录,并且由于所涉及的数据的复杂性而很难遵循。对于我收集的信息,pubchemdb 正在使用PubChem Power User Gateway (PUG) XML API

有没有人使用过这个 API(或任何其他可用的 API)?我将不胜感激有关如何开始使用它的简短描述或教程。

0 投票
1 回答
2591 浏览

python - 获得 BLAST 结果的前 10 个序列 Bio Python

我想获得 BLAST 结果的前 10 个序列(只是序列,没有比对或分数或 e 值等)。我正在输入一个包含 5 个 fasta 文件的文本文件。所以我的输出应该是每个 fasta 文件的前 10 个爆炸命中。因此我的输出文件将有 50 个序列。

我正在通过 Bio.SeqIO 读取每个输入的 fasta 文件,将其写入 temp.faa,然后通过子进程将其传递给命令行 BLAST

输出有很多其他信息。我现在应该解析这个输出还是有更好的方法。

谢谢

PS XML 可能是一种方法,但我没有找到相关的 NCBIXML 解析器语法。

0 投票
1 回答
258 浏览

xml - 使用 SimpleXML 解释 BLAST XML 输出——连字符问题?嵌套对象访问语法问题?

我正在尝试使用 SimpleXML 来读取一些 NCBI BLAST XML 输出,并且我能够访问一些输出,但不能访问它的其他位。

这是 XML 的相关部分(为了便于阅读,删除了一些不相关的部分):

这是我的代码(注意:$qdef 和 $qlen 的到达方式不同,以确保我在设置/使用 $output 变量时没有犯一些愚蠢的错误):

这是输出:

如果我删除 Iteration_query-def 和 Iteration_query-len 周围的 {''},它会将它们视为整数并为两者返回零。

难道我做错了什么?除了 BlastOutput_program 位和其他两个变量之间的 {''} 东西之外,我无法弄清楚我在做什么不同。但是,如果我将 {''} 内容添加到 BlastOutput_program,它仍然可以正常工作并为此产生正确的输出。这是怎么回事?

更新:它使用 xpath 工作,如下所示:

但我仍然很想知道这是否是唯一的方法,或者是否有办法像我上面展示的那样做。

0 投票
0 回答
248 浏览

java - wsimport 导入失败并出现 NCBI/Blast

我正在尝试生成处理此处记录的 SOAP 服务的文件:http: //www.ncbi.nlm.nih.gov/books/NBK55699/

但是 wsimport 失败了:

但是,我测试的 WSDL 验证器没有显示任何错误(http://xmethods.net/ve2/Tools.pohttp://www.validwsdl.com/ ...)

我该如何解决这个问题?

谢谢,

0 投票
1 回答
777 浏览

bioinformatics - 转换 pmc-id -> pmid

是否可以通过 ncbi api 将 pmc-ids (pubmed central ids) 转换为 pmids (pubmed ids)?你可以通过网络表单来做,但我想使用一个程序——当然我总是可以写一个屏幕刮板......谢谢