我正在尝试阅读当天的呆伯特图像。我可以通过这样做来获取页面的全文:
var todayDate = DateTime.Now.ToString("yyyy-MM-dd");
var web = new HtmlWeb();
web.UseCookies = true;
var wp = new WebProxy("http://myproxy:8080");
wp.UseDefaultCredentials = true;
NetworkCredential nc = (NetworkCredential)CredentialCache.DefaultCredentials;
HtmlDocument document = web.Load("http://www.dilbert.com/strips/comic/" + todayDate, "GET", wp, nc);
如果我查看文档的完整 html,我会在页面上看到多次列出的图像,例如:
<meta property="og:image" content="http://assets.amuniversal.com/c2168fa0c45a0132d8f0005056a9545d"/>
或者:
<meta name="twitter:image" content="http://assets.amuniversal.com/c2168fa0c45a0132d8f0005056a9545d">
或者
<img alt="Squirrel In The Large Hadron Collider - Dilbert by Scott Adams" class="img-responsive img-comic" height="280" src="http://assets.amuniversal.com/c2168fa0c45a0132d8f0005056a9545d" width="900" />
从这张图片中解析出 URl 的最佳方法是什么?