wget - dealing with '#' symbol in the filepath

Question

I need to download some tif files from the link: http://ntsg.umt.edu/project/mod17#data-product

I stored the path and file names in a text file in wget directory but the data is not getting downloaded correctly. I believe '#' symbol in the filepath is the issue.

for example keeping: http://ntsg.umt.edu/project/mod17#data-product/MOD17A2_GPP.2008.M02.tif in a text file (temp.txt) and then giving command: wget -i temp.txt is not downloading the desired file correctly. Please help me with this issue. I tried backslash symbol as '#' but it doesn't work either.

the above weblink doesn't change when I browse through the folders, the sub-folder where desired tif files are kept can be reached by following path. /pub/MODIS/NTSG_Products/MOD17/GeoTIFF/Monthly_MOD17A2/GeoTIFF_0.05degree/

score 1 · Accepted Answer

'#' 确实提出了一些有趣的问题，即必须小心用反斜杠或围绕 '...' 或 "..." 来引用它。让我们看看发生了什么。另一个问题是“#...”语法在 URL 中是特殊的，并且不是网络服务器上基础文件名的一部分。

wget -nd -np -nH 'http://ntsg.umt.edu/project/mod17#data-product/MOD17A2_GPP.2008.M02.tif' -O temp.txt

该temp.txt文件存在，但没有特定的基本路径，因此几乎所有 URL 都会尝试使用过于简单的命令访问本地文件系统 - 我们需要使用--base=http://ntsg.umt.edu/project/mod17

wget -nd -np -nH --base=http://ntsg.umt.edu/project/mod17 --force-html --input-file temp.txt

这会下载原始 mod17 文档引用的大量文件，但几乎可以肯定这不是您真正想要的。

从您的问题来看，您似乎正在寻找在“数据产品”部分中找到的单个文件，并且需要通过复制地址栏而不是通过鼠标右键单击来获取 URL（在 Firefox 中，对于例如）在您想要的文件上并使用“复制链接位置”。将结果粘贴到某处应该会给您需要提供给的字符串wget。相同的方法可以让您右键单击“Up to Higher level directory”链接以获取它们的位置，然后可能使用 wget 的递归获取功能来获取其中的内容。

祝你好运。

wget - dealing with '#' symbol in the filepath

1 回答 1

Related

Reference